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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 



1. CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part application of U.S. Provisional Application 
5 Serial No. 60/339,739 filed December 10, 2001 entitled "Novel Nucleic Acids and Secreted 
Polypeptides", Attorney Docket No. 811; U.S. Application Serial No. 10/128,558 filed April 
22, 2002 entitled "Novel Nucleic Acids and Polypeptides", Attorney Docket No. 812A, 
which in turn claims the benefit of U.S. Provisional Application Serial No. 60/339,453 filed 
December 11, 2001 entitled "Novel Nucleic Acids and Polypeptides", Attorney Docket No. 

10 812; U.S. Provisional Application Serial No. 60/365,384 filed March 14, 2002 entitled 
"Novel Nucleic Acids and Secreted Polypeptides", Attorney Docket No. 814; U.S. 
Provisional Application Serial No. 60/365,091 filed March 14, 2002 entitled "Novel Nucleic 
Acids and Polypeptides", Attorney Docket No. 815; U.S. Provisional Application Serial No. 
60/372,615 filed April 12, 2002 entitled "Novel Nucleic Acids and Secreted Polypeptides", 

15 Attorney Docket No. 817; U.S. Provisional Application Serial No. 60/376,045 filed April 24, 
2002 entitled "Novel Nucleic Acids and Secreted Polypeptides", Attorney Docket No. 
817CIP; U.S. Provisional Application Serial No. 60/372,381 filed April 12, 2002 entitled 
"Novel Nucleic Acids and Polypeptides", Attorney Docket No. 818; PCT Application Serial 
No. PCT/US00/35017 filed December 22, 2000 entitled "Novel Contigs Obtained from 

20 Various Libraries", Attorney Docket No. 784CIP3 A/PCT, which in turn is a continuation-in- 
part application of U.S. Application Serial No. 09/552,317 filed April 25, 2000 entitled 
"Novel Contigs Obtained from Various Libraries", Attorney Docket No. 784CIP, which in 
turn is a continuation-in-part application of U.S. Application Serial No. 09/488,725 filed 
January 21, 2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney 

25 Docket No. 784; PCT Application Serial No. PCT/US0 1/02623 filed January 25, 2001 
entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
785CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/491,404 filed January 25, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 785; PCT Application Serial No. PCT/US0 1/03 800 filed 

30 February 5, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney 
Docket No. 787CIP3/PCT, which in turn is a continuation-in-part application of U.S. 
Application Serial No. 09/560,875 filed April 27, 2000 entitled "Novel Contigs Obtained 
from Various Libraries", Attorney Docket No. 787CIP, which in turn is a continuation-in- 
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part application of U.S. Application Serial No. 09/496,914 filed February 03 5 2000 entitled 
"Novel Contigs Obtained from Various Libraries", Attorney Docket No. 787; PCT 
Application Serial No. PCT/US0 1/04927 filed February 26, 2001 entitled "Novel Contigs 
Obtained from Various Libraries", Attorney Docket No. 788CIP3/PCT, which in turn is a 
5 continuation-in-part application of U.S. Application Serial No. 09/577,409 filed May 1 8, 
2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 
788CIP, which in turn is a continuation-in-part application of U.S. Application Serial No. 
09/515,126 filed February 28, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 788; PCT Application Serial No. PCT/US0 1/04941 filed 

10 March 5, 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket 
No. 789CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application 
Serial No. 09/574,454 filed May 19, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 789CIP, which in turn is a continuation-in-part application 
of U.S. Application Serial No. 09/519,705 filed March 07, 2000 entitled "Novel Contigs 

1 5 Obtained from Various Libraries", Attorney Docket No. 789; PCT Application Serial No. 
PCT/US01/08631 filed March 30, 2001 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 790CIP3/PCT, which in turn is a continuation-in-part 
application of U.S. Application Serial No. 09/649,167 filed August 23, 2000 entitled "Novel 
Contigs Obtained from Various Libraries", Attorney Docket No. 790CIP, which in turn is a 

20 continuation-in-part application of U.S. Application Serial No. 09/540,217 filed March 31, 

2000 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 790; 
PCT Application Serial No. PCT/US0 1/08656 filed April 18, 2001 entitled "Novel Contigs 
Obtained from Various Libraries", Attorney Docket No. 791CIP3/PCT, which in turn is a 
continuation-in-part application of U.S. Application Serial No. 09/770,160 filed January 26, 

25 2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 

791CIP, which is in turn a continuation-in-part application of U.S. Application Serial No. 
09/552,929 filed April 18, 2000 entitled "Novel Contigs Obtained from Various Libraries", 
Attorney Docket No. 791; and PCT Application Serial No. PCT/US01/14827 filed May 16, 

2001 entitled "Novel Contigs Obtained from Various Libraries", Attorney Docket No. 

30 792CIP3/PCT, which in turn is a continuation-in-part application of U.S. Application Serial 
No. 09/577,408 filed May 18, 2000 entitled "Novel Contigs Obtained from Various 
Libraries", Attorney Docket No. 792; all of which are incorporated herein by reference in 
their entirety, specifically including, but not limited to , incorporation by reference of the 
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tables in each application displaying sequence information, ematrix signatures, pfam 
signatures, signal peptide information, transmembrane domain information, chromosomal 
localization and tissue distribution information, and/or 3-dimensional structural information. 



5 2. BACKGROUND OF THE INVENTION 

2.1 TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

10 

2.2 BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such 
as lymphokines, interferons, circulating soluble factors, chemokines, and interleukins) has 
matured rapidly over the past decade. The now routine hybridization cloning and expression 

1 5 cloning techniques clone novel polynucleotides "directly" in the sense that they rely on 

information directly related to the discovered protein (i.e., partial DNA/amino acid sequence 
of the protein in the case of hybridization cloning; activity of the protein in the case of 
expression cloning). More recent "indirect" cloning techniques such as signal sequence 
cloning, which isolates DNA sequences based on the presence of a now well-recognized 

20 secretory leader sequence motif, as well as various PCR-based or low stringency 

hybridization-based cloning techniques, have advanced the state of the art by making 
available large numbers of DNA/amino acid sequences for proteins that are known to have 
biological activity, for example, by virtue of their secreted nature in the case of leader 
sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 

25 techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, 
for example, diagnostics, forensics, gene mapping; identification of mutations responsible 
for genetic disorders or other traits, to assess biodiversity, and to produce many other types 
of data and products dependent on DNA and amino acid sequences. 

30 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
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cloned genes or degenerate variants thereof, especially naturally occurring variants such as 
allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize 
one or more epitopes present on such polypeptides, as well as hybridomas producing such 
antibodies. 

5 The compositions of the present invention additionally include vectors, including 

expression vectors, containing the polynucleotides of the invention, cells genetically engineered 
to contain such polynucleotides and cells genetically engineered to express such 
polynucleotides. 

The present invention relates to a collection or library of at least one novel nucleic acid 

10 sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public 
databases. The invention relates also to the proteins encoded by such polynucleotides, along 
with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These 
nucleic acid sequences are designated as SEQ ID NO: 1-911, or 1 823-2478 and are provided in 

1 5 the Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenine; C is 
cytosine; G is guanine; T is thymine; and N is any of the four bases or unknown. In the amino 
acids provided in the Sequence Listing, * corresponds to the. stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences 
that hybridize to the complement of SEQ ID NO: 1 -91 1, or 1 823-2478 under stringent 

20 hybridization conditions; nucleic acid sequences which are allelic variants or species 

homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that 
encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ 
ID NO: 1-91 1 , or 1 823-2478. A polynucleotide comprising a nucleotide sequence having at 
least 90% identity to an identifying sequence of SEQ ID NO: 1-911, or 1823-2478 or a 

25 degenerate variant or fragment thereof. The identifying sequence can be 1 00 base pairs in 
length. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-911, or 1823-2478. The 
sequence information can be a segment of any one of SEQ ID NO: 1-91 1, or 1823-2478 that 
30 uniquely identifies or represents the sequence information of SEQ ID NO: 1-911, or 1823-2478. 

A collection as used in this application can be a collection of only one polynucleotide. 
The collection of sequence information or identifying information of each sequence can be 
provided on a nucleic acid array. In one embodiment, segments of sequence information are 
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provided on a nucleic acid array to detect the polynucleotide that contains the segment. The 
array can be designed to detect full-match or mismatch to the polynucleotide that contains the 
segment The collection can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; 
and host cells or organisms transformed with these expression vectors. Nucleic acid sequences 
(or their reverse or direct complements) according to the invention have numerous applications 
in a variety of techniques known to those skilled in the art of molecular biology, such as use as 
hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, 
use in sequencing full-length genes, use for chromosome and gene mapping, use in the 
recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their 
chemical analogs and the like. 

In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-91 1, or 1823- 
2478 or novel segments or parts of the nucleic acids of the invention are used as primers in 
expression assays that are well known in the art. In a particularly preferred embodiment, the 
nucleic acid sequences of SEQ ID NO: 1-911, or 1823-2478 or novel segments or parts of the 
nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well 
known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed 
sequence tags for physical mapping of the human genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-911, 
or 1823-2478; a polynucleotide comprising any of the fall length protein coding sequences of 
SEQ ID NO: 1-91 1, or 1 823-2478; and a polynucleotide comprising any of the nucleotide 
sequences of the mature protein coding sequences of SEQ ID NO: 1-911, or 1823-2478. The 
polynucleotides of the present invention also include, but are not limited to, a polynucleotide 
that hybridizes under stringent hybridization conditions to (a) the complement of any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-911, or 1823-2478; (b) anucleotide sequence 
encoding any one of the amino acid sequences set forth in SEQ ID NO: 1-911, or 1823-2478; 
(c) a polynucleotide which is an allelic variant of any polynucleotides recited above; (d) a 
polynucleotide which encodes a species homologue (e.g. orthologs) of any of the proteins 
recited above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain 
or truncation of any of the polypeptides comprising an amino acid sequence set forth in SEQ ID 
NO: 912-1822, or 2479-3134, or Tables 3A, 3B, 5, or 6. 
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The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the 
corresponding full length or mature protein. Polypeptides of the invention (SEQ ID NO: 912- 
1 822, or 2479-3 134) also include polypeptides with biological activity that are encoded by (a) 
5 any of the polynucleotides having a nucleotide sequence set forth in SEQ ID NO: 1-911, or 
1823-2478; or (b) polynucleotides that hybridize to the complement of the polynucleotides of 
(a) under stringent hybridization conditions. Biologically active variants of any of the 
polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof (e.g., with 
at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence 
10 identity) that preferably retain biological activity are also contemplated. The polypeptides of the 
invention may be wholly or partially chemically synthesized but are preferably produced by 
recombinant means using the genetically engineered cells (e.g. host cells) of the invention. 

The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such 
15 as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

The invention also provides host cells transformed or transfected with a 
polynucleotide of the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
20 under conditions permitting expression of the desired polypeptide, and purifying the 

polypeptide from the culture or from the host cells. Preferred embodiments include those in 
which the protein produced by such processes is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology. These techniques 
25 include use as hybridization probes, use as oligomers, or primers, for PGR, use for 

chromosome and gene mapping, use in the recombinant production of protein, and use in 
generation of anti-sense DNA or RNA, their chemical analogs and the like. For example, 
when the expression of an mRNA is largely restricted to a particular cell or tissue type, 
polynucleotides of the invention can be used as hybridization probes to detect the presence 
30 of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
expressed sequence tags for identifying expressed genes or, as well known in the art and 
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exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods that are currently applied to other proteins. For example, a 
5 polypeptide of the invention can be used to generate an antibody that specifically binds the 
polypeptide. Such antibodies, particularly monoclonal antibodies, are useful for detecting or 
quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as 
molecular weight markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical 

10 condition which comprises the step of administering to a mammalian subject a 

therapeutically effective amount of a composition comprising a polypeptide of the present 
invention and a pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, 
for example, in methods for the prevention and/or treatment of disorders involving aberrant 

1 5 protein expression or biological activity. 

The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for 
example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited 
herein and for the identification of subjects exhibiting a predisposition to such conditions. 

20 The invention provides a method for detecting the polynucleotides of the invention in a 
sample, comprising contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of interest for a period sufficient to form the complex and 
under conditions sufficient to form a complex and detecting the complex such that if a 
complex is detected, the polynucleotide of interest is detected. The invention also provides a 

25 method for detecting the polypeptides of the invention in a sample comprising contacting the 
sample with a compound that binds to and forms a complex with the polypeptide under 
conditions and for a period sufficient to form the complex and detecting the formation of the 
complex such that if a complex is formed, the polypeptide is detected. 

The invention also provides kits comprising polynucleotide probes and/or 

30 monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the 
invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, 
and monitoring the progress of patients, involved in clinical trials for the treatment of 
disorders as recited above. 
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The invention also provides methods for the identification of compounds that 
modulate (i.e., increase or decrease) the expression or activity of the polynucleotides and/or 
polypeptides of the invention. Such methods can be utilized, for example, for the 
identification of compounds that can ameliorate symptoms of disorders as recited herein. 
5 Such methods can include, but are not limited to, assays for identifying compounds and 
other substances that interact with (e.g., bind to) the polypeptides of the invention. The 
invention provides a method for identifying a compound that binds to the polypeptides of the 
invention comprising contacting the compound with a polypeptide of the invention in a cell 
for a time sufficient to form a polypeptide/compound complex, wherein the complex drives 

10 expression of a reporter gene sequence in the cell; and detecting the complex by detecting 

the reporter gene sequence expression such that if expression of the reporter gene is detected 
the compound that binds to a polypeptide of the invention is identified. 

The methods of the invention also provide methods for treatment which involve the 
administration of the polynucleotides or polypeptides of the invention to individuals 

15 exhibiting symptoms or tendencies. In addition, the invention encompasses methods for 
treating diseases or disorders as recited herein comprising administering compounds and 
other substances that modulate the overall activity of the target gene products. Compounds 
and other substances can affect such modulation either on the level of target gene/protein 
expression or target protein activity. 

20 The polypeptides of the present invention (e.g. SEQ ID NO: 912-1822, or 2479- 

3134) and the polynucleotides encoding them (e.g. SEQ ID NO: 1 -9 1 1 , or 1 823-2478) are 
also useful for the same functions known to one of skill in the art as the polypeptides and 
polynucleotides to which they have homology (set forth in Tables 2A and 2B); for which 
they have a signature region (as set forth in Tables 3 A and 3B); or for which they have 

25 homology to a gene family (as set forth in Tables 4A and 4B). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful 
for a variety of applications, as described herein, including use in arrays for detection. 

4. DETAILED DESCRIPTION OF THE INVENTION 
30 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms 
"a", "an" and "the" include plural references unless the context clearly dictates otherwise. 
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The term "active" refers to those forms of the polypeptide which retain the biologic 
and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
5 Likewise "immunologically active" or "immunological activity" refers to the capability of 
the natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are 
engaged in extracellular or intracellular membrane trafficking, including the export of 
10 secretory or enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3 5 binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded 
molecules may be "partial" such that only certain portion(s) of the nucleic acids bind or it 
15 may be "complete" such that total complementarity exists between the single stranded 

molecules. The degree of complementarity between the nucleic acid strands has significant 
effects on the efficiency and strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ 
20 line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a 
steady and continuous source of germ cells for the production of gametes. The term 
"primordial germ cells (PGCs)" refers to a small population of cells set aside from other cell 
lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis 
that have the potential to differentiate into germ cells and other cells. PGCs are the source 
25 from which GSCs and ES cells are derived. The PGCs, the GSCs and the ES cells are 

capable of self-renewal Thus these cells not only populate the germ line and give rise to a 
plurality of terminally differentiated cells that comprise the adult specialized organs, but are 
able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides 
30 which modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. 
EMFs include, but are not limited to, promoters, and promoter modulating sequences 
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(inducible elements). One class of EMFs are nucleic acid fragments which induce the 
expression of an operably linked ORF in response to a specific regulatory factor or 
physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
5 "oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or 
the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the 
sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like 
material. In the sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and 

10 N is A, C, G, or T (U) or unknown. It is contemplated that where the polynucleotide is 
RNA, the T (thymine) in the sequences provided herein is substituted with U (uracil). 
Generally, nucleic acid segments provided by this invention may be assembled from 
fragments of the genome and short oligonucleotide linkers, or from a series of 
oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which is 

1 5 capable of being expressed in a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of 
nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 

20 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 
nucleotides and most preferably at least about 17 nucleotides. The fragment is preferably 
less than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably 
less than about 100 nucleotides, more preferably less than about 50 nucleotides and most 
preferably less than 30 nucleotides. Preferably the probe is from about 6 nucleotides to 

25 about 200 nucleotides, preferably from about 1 5 to about 50 nucleotides, more preferably 
from about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides. 
Preferably the fragments can be used in polymerase chain reaction (PCR), various 
hybridization procedures or microarray procedures to identify or amplify identical or related 
parts of mRNA or DNA molecules. A fragment or segment may uniquely identify each 

30 polynucleotide sequence of the present invention. Preferably the fragment comprises a 
sequence substantially similar to any one of SEQ ID NO: 1-911, or 1823-2478. 

Probes may, for example, be used to determine whether specific mRNA molecules 
are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal 
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DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PGR Methods Appl 1 :241-250). 
They may be labeled by nick translation, Klenow fill-in reaction, PGR, or other methods 
well known in the art. Probes of the present invention, their preparation and/or labeling are 
elaborated in Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold 
5 Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in 

Molecular Biology, John Wiley & Sons, New York NY, both of which are incorporated 
herein by reference in their entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NO: 1-91 1, or 1823-2478. The 

10 sequence information can be a segment of any one of SEQ ID NO: 1-91 1, or 1823-2478 that 
uniquely identifies or represents the sequence information of that sequence of SEQ ID NO: 
1-91 1, or 1823-2478, or those segments identified in Tables 3 A, 3B, 5, or 6. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty- 
mer is fully matched in the human genome is 1 in 300. In the human genome, there are three 

1 5 billion base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there 
are 300 times more twenty-mers than there are base pairs in a set of human chromosomes. 
Using the same analysis, the probability for a seventeen-mer to be fully matched in the 
human genome is approximately 1 in 5. When these segments are used in arrays for 
expression studies, fifteen-mer segments can be used. The probability that the fifteen-mer is 

20 fully matched in the expressed sequences is also approximately one in five because 

expressed sequences comprise less than approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment 
can be a twenty-five mer. The probability that the twenty-five mer would appear in a human 
genome with a single mismatch is calculated by multiplying the probability for a full match 

25 (l-^4 25 ) times the increased probability for mismatch at each nucleotide position (3 x 25). The 
probability that an eighteen mer with a single mismatch can be detected in an array for 
expression studies is approximately one in five. The probability that a twenty-mer with a single 
mismatch can be detected in a human genome is approximately one in five. 

The term "open reading frame," ORF, means a series of nucleotide triplets coding for 

30 amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related 
nucleic acid sequences. For example, a promoter is operably associated or operably linked 
with a coding sequence if the promoter controls the transcription of the coding sequence. 
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While operably linked nucleic acid sequences can be contiguous and in the same reading 
frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding 
sequence but still control transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number 
5 of differentiated cell types that are present in an adult organism. A pluripotent cell is 
restricted in its differentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an 
oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally 
occurring or synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a 

10 stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 

amino acids, more preferably at least about 9 amino acids and most preferably at least about 
17 or more amino acids. The peptide preferably is not greater than about 200 amino acids, 
more preferably less than 150 amino acids and most preferably less than 100 amino acids. 
Preferably the peptide is from about 5 to about 200 amino acids. To be active, any 

1 5 polypeptide must have sufficient length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells 
that have not been genetically engineered and specifically contemplates various polypeptides 
arising from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

20 The term "translated protein coding portion" means a sequence which encodes for the 

full-length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a 
peptide or protein without a signal or leader sequence. The "mature protein portion" means 
that portion of the protein which does not include a signal or leader sequence. The peptide 

25 may have been produced by processing in the cell which removes any leader/signal 

sequence. The mature protein portion may or may not include the initial methionine residue. 
The methionine residue may be removed from the protein during processing in the cell. The 
peptide may be produced synthetically or the protein may have been produced using a 
polynucleotide only encoding for the mature protein coding sequence. 

30 The term "derivative" refers to polypeptides chemically modified by such techniques 

as ubiquitination, labeling (e.g., with radionuclides or various en2ymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
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substitution by chemical synthesis of amino acids such as ornithine, which do not normally 
occur in human proteins. 

The term 'Variant" (or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, 
5 e g., recombinant DNA techniques. Guidance in determining which amino acid residues 
may be replaced, added or deleted without abolishing activities of interest, may be found by 
comparing the sequence of the particular polypeptide with that of homologous peptides and 
minimizing the number of amino acid sequence changes made in regions of high homology 
(conserved regions) or by replacing amino acids with consensus sequence. 
10 Alternatively, recombinant variants encoding these same or similar polypeptides may 

be synthesized or selected by making use of the "redundancy" in the genetic code. Various 
codon substitutions, such as the silent changes which produce various restriction sites, may 
be introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be 
1 5 reflected in the polypeptide or domains of other peptides added to the polypeptide to modify 
the properties of any part of the polypeptide, to change characteristics such as ligand-binding 
affinities, interchain affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative 
20 amino acid replacements. "Conservative" amino acid substitutions may be made on the 

basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino 
acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, 
25 asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, 
and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. "Insertions" or "deletions" are preferably in the range of about 1 to 20 amino acids, 
more preferably 1 to 10 amino acids. The variation allowed may be experimentally 
determined by systematically making insertions, deletions, or substitutions of amino acids in 
30 a polypeptide molecule using recombinant DNA techniques and assaying the resulting 
recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such 
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alterations can, for example, alter one or more of the biological functions or biochemical 
characteristics of the polypeptides of the invention. For example, such alterations may 
change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or 
degradation/turnover rate. Further, such alterations can be selected so as to generate 
5 polypeptides that are better suited for expression, scale up and the like in the host cells 
chosen for expression. For example, cysteine residues can be deleted or substituted with 
another amino acid residue in order to eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the 
indicated nucleic acid or polypeptide is present in the substantial absence of other biological 
10 macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, 
more preferably at least 99% by weight, of the indicated biological macromolecules present 
(but water, buffers, and other small molecules, especially molecules having a molecular 
weight of less than 1000 daltons, can be present). 
1 5 The term "isolated" as used herein refers to a nucleic acid or polypeptide separated 

from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic 
acid or polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide 
is found in the presence of (if anything) only a solvent, buffer, ion, or other component 
normally present in a solution of the same. The terms "isolated" and "purified" do not 
20 encompass nucleic acids or polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or 
mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins 
made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant 
25 microbial" defines a polypeptide or protein essentially free of native endogenous substances 
and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed 
in most bacterial cultures, e.g., E. colU will be free of glycosylation modifications; 
polypeptides or proteins expressed in yeast will have a glycosylation pattern in general 
different from those expressed in mammalian cells. 
30 The term "recombinant expression vehicle or vector" refers to a plasmid or phage or 

virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression 
vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element 
or elements having a regulatory role in gene expression, for example, promoters or 
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enhancers, (2) a structural or coding sequence which is transcribed into mRNA and 
translated into protein, and (3) appropriate transcription initiation and termination sequences. 
Structural units intended for use in yeast or eukaryotic expression systems preferably include 
a leader sequence enabling extracellular secretion of translated protein by a host cell. 
5 Alternatively, where recombinant protein is expressed without a leader or transport 

sequence, it may include an amino terminal methionine residue. This residue may or may 
not be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

The term "recombinant expression system" means host cells which have stably 

1 0 integrated a recombinant transcriptional unit into chromosomal DNA or carry the 

recombinant transcriptional unit extrachromosomally. Recombinant expression systems as 
defined herein will express heterologous polypeptides or proteins upon induction of the 
regulatory elements linked to the DNA segment or synthetic gene to be expressed. This term 
also means host cells which have stably integrated a recombinant genetic element or 

1 5 elements having a regulatory role in gene expression, for example, promoters or enhancers. 
Recombinant expression systems as defined herein will express polypeptides or proteins 
endogenous to the cell upon induction of the regulatory elements linked to the endogenous 
DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic. 
The term "secreted" includes a protein that is transported across or through a 

20 membrane, including transport as a result of signal sequences in its amino acid sequence 
when it is expressed in a suitable host cell. "Secreted" proteins include without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in 
which they are expressed. "Secreted" proteins also include without limitation proteins that 
are transported across the membrane of the endoplasmic reticulum. "Secreted" proteins are 

25 also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 
Beta, see Krasney, P. A. and Young, P.R. (1992) Cytokine 4(2): 134 -143) and factors 
released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. 
(1998) Annu. Rev. Immunol. 16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 

30 sequence" which will direct the polypeptide through the membrane of a cell. Such a 

sequence may be naturally present on the polypeptides of the present invention or provided 
from heterologous protein sources by recombinant DNA techniques. 
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The term "stringent" is used to refer to conditions that are commonly understood in 
the art as stringent. Stringent conditions can include highly stringent conditions (i.e., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 
mM EDTA at 65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent 
5 conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization 
conditions are described herein in the examples. 

In instances of hybridization of deoxyoligonucleotides, additional exemplary 
stringent hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligonucleotides), 55°C (for 20- 

10 base oligonucleotides), and 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" or "substantially similar" can refer both to 
nucleotide and amino acid sequences, for example a mutant sequence, that varies from a 
reference sequence by one or more substitutions, deletions, or additions, the net effect of 
which does not result in an adverse functional dissimilarity between the reference and 

15 subject sequences. Typically, such a substantially equivalent sequence varies from one of 
those listed herein by no more than about 35% {i.e., the number of individual residue 
substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared 
to the corresponding reference sequence, divided by the total number of residues in the 
substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 

20 65% sequence identity to the listed sequence. In one embodiment, a substantially 

equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more 
than 30% (70% sequence identity); in a variation of this embodiment, by no more than 25% 
(75% sequence identity); and in a further variation of this embodiment, by no more than 
20% (80% sequence identity) and in a further variation of this embodiment, by no more than 

25 10% (90% sequence identity) and in a further variation of this embodiment, by no more that 
5% (95%> sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences 
according to the invention preferably have at least 80%> sequence identity with a listed amino 
acid sequence, more preferably at least 85% sequence identity, more preferably at least 90% 
sequence identity, more preferably at least 95% sequence identity, more preferably at least 

30 98%) sequence identity, and most preferably at least 99% sequence identity. Substantially 
equivalent nucleotide sequence of the invention can have lower percent sequence identities, 
taking into account, for example, the redundancy or degeneracy of the genetic code. 
Preferably, the nucleotide sequence has at least about 65% identity, more preferably at least 
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about 75% identity, more preferably at least about 80% sequence identity, more preferably at 
least 85% sequence identity, more preferably at least 90% sequence identity, more preferably 
at least about 95% sequence identity, more preferably at least 98% sequence identity, and 
most preferably at least 99% sequence identity. For the purposes of the present invention, 
5 sequences having substantially equivalent biological activity and substantially equivalent 
expression characteristics are considered substantially equivalent. For the purposes of 
determining equivalence, truncation of the mature sequence (e.g., via a mutation which 
creates a new stop codon) should be disregarded. Sequence identity may be determined, 
e.g., using the Jotun Hein method (Hein, J. (1990) Methods Enzymol. 183:626-645). 

10 Identity between sequences can also be determined by other methods known in the art, e.g. 
by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the 
cell types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that 

15 the DNA is replicable, either as an extrachromosomal element, or by chromosomal 

integration. The term "transfection" refers to the taking up of an expression vector by a 
suitable host cell, whether or not any coding sequences are in fact expressed. The term 
"infection" refers to the introduction of nucleic acids into a suitable host cell by use of a 
virus or viral vector. 

20 As used herein, an "uptake modulating fragment," UMF, means a series of 

nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can be 
readily identified using known UMFs as a target sequence or target motif with the 
computer-based systems described below. The presence and activity of a UMF can be 
confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic acid 

25 molecule is then incubated with an appropriate host under appropriate conditions and the 
uptake of the marker sequence is determined. As described above, a UMF will increase the 
frequency of uptake of a linked marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless 
the context dictates otherwise. 



4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 



WO 03/054152 PCT/US02/39555 

18 

The isolated polynucleotides of the invention include a polynucleotide comprising 
the nucleotide sequences of SEQ ID NO: 1-91 1, or 1 823-2478; a polynucleotide encoding 
any one of the peptide sequences of SEQ ID NO: 1-911, or 1823-2478; and a polynucleotide 
.comprising the nucleotide sequence encoding the mature protein coding sequence of the 
5 polynucleotides of any one of SEQ ID NO: 1-91 1, or 1 823-2478. The polynucleotides of the 
present invention also include, but are not limited to, a polynucleotide that hybridizes under 
stringent conditions to (a) the complement of any of the nucleotides sequences of SEQ ID 
NO: 1-911, or 1823-2478; (b) nucleotide sequences encoding any one of the amino acid 
sequences set forth in the Sequence Listing, ; (c) a polynucleotide which is an allelic variant 

10 of any polynucleotide recited above; (d) a polynucleotide which encodes a species 

homologue of any of the proteins recited above; or (e) a polynucleotide that encodes a 
polypeptide comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 
912-1822, or 2479-3134 (for example, as set forth in Tables 3 A, 3B, 5, or 6). Domains of 
interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like 

15 polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, 
or combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
domains. 

20 The polynucleotides of the invention include naturally occurring or wholly or 

partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The 
polynucleotides may include entire coding region of the cDNA or may represent a portion of 
the coding region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences 

25 disclosed herein. The corresponding genes can be isolated in accordance with known methods 
using the sequence information disclosed herein. Such methods include the preparation of 
probes or primers from the disclosed sequence information for identification and/or 
amplification of genes in appropriate genomic libraries or other sources of genomic materials. 
Further 5' and 3 f sequence can be obtained using methods known in the art. For example, full 

30 length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 
1-911, or 1823-2478 can be obtained by screening appropriate cDNA or genomic DNA 
libraries under suitable hybridization conditions using any of the polynucleotides of SEQ ID 
NO: 1-91 1, or 1823-2478 or a portion thereof as a probe. Alternatively, the polynucleotides of 
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SEQ ID NO: 1-911, or 1823-2478 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

Hie nucleic acid sequences of the invention can be assembled from ESTs and sequences 
(including cDNA and genomic sequences) obtained from one or more public databases, such as 
5 dbEST, gbpri, and UrnGene. The EST sequences can provide identifying sequence 

information, representative fragment or segment information, or novel segment information for 
the full-length gene. 

The polynucleotides of the invention also provide polynucleotides including 
nucleotide sequences that are substantially equivalent to the polynucleotides recited above. 

10 Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 
70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least 
about 85%, 86%>, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, 
and even more typically at least about 95%, 96%, 97%, 98%, 99% sequence identity to a 
polynucleotide recited above. 

1 5 Included within the scope of the nucleic acid sequences of the invention are nucleic 

acid sequence fragments that hybridize under stringent conditions to any of the nucleotide 
sequences of SEQ ID NO: 1-911, or 1823-2478, or complements thereof, which fragment is 
greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 
nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 

20 nucleotides or more that are selective for (i.e. specifically hybridize to) any one of the 
polynucleotides of the invention are contemplated. Probes capable of specifically 
hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention 
from other polynucleotide sequences in the same family of genes or can differentiate human 
genes from genes of other species, and are preferably based on unique nucleotide sequences. 

25 The sequences falling within the scope of the present invention are not limited to these 

specific sequences, but also include allelic and species variations thereof. Allelic and species 
variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1- 
91 1, or 1823-2478, a representative fragment thereof, or a nucleotide sequence at least 90% 
identical, preferably 95% identical, to SEQ ID NO: 1-91 1, or 1823-2478 with a sequence from 

30 another isolate of the same species. Furthermore, to accommodate codon variability, the 

invention includes nucleic acid molecules coding for the same amino acid sequences as do the 
specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of 
one codon for another codon that encodes the same amino acid is expressly contemplated. 
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The nearest neighbor or homology results for the nucleic acids of the present invention, 
including SEQ ID NO: 1-911, or 1823-2478 can be obtained by searching a database using an 
algorithm or a program. Preferably, a BLAST (Basic Local Alignment Search Tool) program is 
used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and 
5 Altschul S.F. et al. J. Mol. Biol. 21:403-410 (1990)). Alternatively a FASTA version 3 search 
against Genpept, using FASTXY algorithm may be performed. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are 
also provided by the present invention. Species homologs may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a 
10 suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which 
also encode proteins which are identical, homologous or related to that encoded by the 
polynucleotides. 

15 The nucleic acid sequences of the invention are further directed to sequences which 

encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic 

20 acids encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These 
nucleic acid alterations can be made at sites that differ in the nucleic acids from different 
species (variable positions) or in highly conserved regions (constant regions). Sites at such 
locations will typically be modified in series, e.g., by substituting first with conservative 

25 choices (e.g. , hydrophobic amino acid to a different hydrophobic amino acid) and then with 
more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then 
deletions or insertions may be made at the target site. Amino acid sequence deletions 
generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are 
typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal 

30 fusions ranging in length from one to one hundred or more residues, as well as intrasequence 
insertions of single or multiple amino acid residues. Intrasequence insertions may range 
generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of 
terminal insertions include the heterologous signal sequences necessary for secretion or for 
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intracellular targeting in different host cells and sequences such as FLAG or poly-histidine 
sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter 
5 a polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

nucleotides on both sides of the changed amino acid to form a stable duplex on either side of 
the site of being changed. In general, the techniques of site-directed mutagenesis are well 
known to those of skill in the art and this technique is exemplified by publications such as, 
Edelman et al., DNA 2: 1 83 (1983). A versatile and efficient method for producing 

10 site-specific changes in a polynucleotide sequence was published by Zoller and Smith, 
Nucleic Acids Res, 10:6487-6500 (1982). PGR may also be used to create amino acid 
sequence variants of the novel nucleic acids. When small amounts of template DNA are 
used as starting material, primer(s) that differs slightly in sequence from the corresponding 
region in the template DNA can generate the desired amino acid variant. PGR amplification 

1 5 results in a population of product DNA fragments that differ from the polynucleotide 

template encoding the polypeptide at the position specified by the primer. The product DNA 
fragments replace the corresponding region in the plasmid and this gives a polynucleotide 
encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 

20 technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques 
well known in the art, such as, for example, the techniques in Sambrook et al., supra, and 
Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be used in the practice of the invention for 

25 the cloning and expression of these novel nucleic acids. Such DNA sequences include those 
which are capable of hybridizing to the appropriate novel nucleic acid sequence under 
stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention could be 
used to generate polynucleotides encoding chimeric or fusion proteins comprising one or 
30 more domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of 
the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, 
amplified, or synthetic) or RNA. Methods and algorithms for obtaining such 
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polynucleotides are well known to those of skill in the art and can include, for example, 
methods for determining hybridization conditions that can routinely isolate polynucleotides 
of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 
5 protein coding sequences corresponding to any one of SEQ ID NO: 1-91 1, or 1823-2478, or 
functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate 
host cells. Also included are the cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 

10 nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et 
al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). 
Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, 
e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well 
known in the art. Accordingly, the invention also provides a vector including a 

15 polynucleotide of the invention and a host cell containing the polynucleotide. In general, the 
vector contains an origin of replication functional in at least one organism, convenient 
restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to 
the invention include expression vectors, replication vectors, probe generation vectors, and 
sequencing vectors. A host cell according to the invention can be a prokaryotic or 

20 eukaryotic cell and can be a unicellular organism or part of a multicellular organism. 

The present invention further provides recombinant constructs comprising a nucleic 
acid having any of the nucleotide sequences of SEQ ID NO: 1-911, or 1823-2478 or a 
fragment thereof or any other polynucleotides of the invention. In one embodiment, the 
recombinant constructs of the present invention comprise a vector, such as a plasmid or viral 

25 vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1- 
91 1, or 1823-2478 or a fragment thereof is inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the ORFs of the present invention, the vector may 
further comprise regulatory sequences, including for example, a promoter, operably linked to 
the ORF. Large numbers of suitable vectors and promoters are known to those of skill in the 

30 art and are commercially available for generating the recombinant constructs of the present 
invention. The following vectors are provided by way of example: Bacterial: pBs, 
phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a 
(Stratagene), pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); Eukaryotic: 
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pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, 
5 Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinant^. 
Many suitable expression control sequences are known in the art. General methods of 
expressing recombinant proteins are also known and are exemplified in R. Kaufman, 
Methods in Enzymology 185, 537-566 (1990). As defined herein "operably linked" means 
that the isolated polynucleotide of the invention and an expression control sequence are 
1 0 situated within a vector or cell in such a way that the protein is expressed by a host cell 

which has been transformed (transfected) with the ligated polynucleotide/expression control 
sequence. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 

1 5 appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters include 
lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate 
early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothionein-I. Selection of the appropriate vector and promoter is well within the leyel 
of ordinary skill in the art. Generally, recombinant expression vectors will include origins of 

20 replication and selectable markers permitting transformation of the host cell, e.g. , the 

ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived 
from a highly expressed gene to direct transcription of a downstream structural sequence. 
Such promoters can be derived from operons encoding glycolytic enzymes such as 3- 
phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among 

25 others. The heterologous structural sequence is assembled in appropriate phase with 

translation initiation and termination sequences, and preferably, a leader sequence capable of 
directing secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an amino 
terminal identification peptide imparting desired characteristics, e.g., stabilization or 

3 0 simplified purification of expressed recombinant product. Useful expression vectors for 
bacterial use are constructed by inserting a structural DNA sequence encoding a desired 
protein together with suitable translation initiation and termination signals in operable 
reading phase with a functional promoter. The vector will comprise one or more phenotypic 
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selectable markers and an origin of replication to ensure maintenance of the vector and to, if 
desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may 
5 also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial 
use can comprise a selectable marker and bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR3 22 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 

10 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, 
USA). These pBR322 "backbone" sections are combined with an appropriate promoter and 
the structural sequence to be expressed. Following transformation of a suitable host strain 
and growth of the host strain to an appropriate cell density, the selected promoter is induced 
or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells 

1 5 are cultured for an additional period. Cells are typically harvested by centrifugation, 

disrupted by physical or chemical means, and the resulting crude extract retained for further 
purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al, Nat. Biotech 17, 870-872 (1999), incorporated herein by 
20 reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 
against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intra-muscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form 
of naked DNA. 

25 

4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO: 1-911, or 1823-2478, or fragments, analogs or 
30 derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. In specific aspects, antisense nucleic acid molecules are provided that comprise a 
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sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an 
entire coding strand, or to only a portion thereof. Nucleic acid molecules encoding 
fragments, homologs, derivatives and analogs of a protein of any of SEQ ID NO: 1-91 1, or 
1823-2478 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO: 1-911, or 1823-2478 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence of the invention. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 
of the invention. The term "noncoding region" refers to 5' and 3 1 sequences that flank the 
coding region that are not translated into amino acids (i.e., also referred to as 5' and 3* 
untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., 
SEQ ID NO: 1-91 1, or 1823-2478, antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of an mRNA, but more 
preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding 
region of an mRNA. For example, the antisense oligonucleotide can be complementary to 
the region surrounding the translation start site of an mRNA. An antisense oligonucleotide 
can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An 
antisense nucleic acid of the invention can be constructed using chemical synthesis or 
enzymatic ligation reactions using procedures known in the art. For example, an antisense 
nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 
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1- methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3- 
methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 

methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil 5 beta-D-mannosylqueosine, 
5*-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N64sopentenyladenine, 
5 uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl- 

2- thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced 
biologically using an expression vector into which a nucleic acid has been subcloned in an 

1 0 antisense orientation (i. e. , RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 

1 5 genomic DNA encoding a protein according to the invention to thereby inhibit expression of 
the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of 

20 administration of antisense nucleic acid molecules of the invention includes direct injection 
at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target 
selected cells and then administered systemically. For example, for systemic administration, 
antisense molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules to 

25 peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic 
acid molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol III 
promoter are preferred. 

30 In yet another embodiment, the antisense nucleic acid molecule of the invention is an 

oc-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual a-units, 
the strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids Res 15: 
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6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res 15: 6131-6148) or a 
chimeric RNA -DNA analogue (Inoue et al (1987) FEES Lett 215: 327-330). 



5 4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 

10 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 

mRNA transcripts to thereby inhibit translation of an mRNA. A ribozyme having specificity 
for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a 
DNA disclosed herein (i.e., SEQ ID NO: 1-911, or 1823-2478). For example, a derivative of 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 

15 active site is complementary to the nucleotide sequence to be cleaved in a mRNA. See, e.g. , 
Cech et al. U.S. Pat. No. 4,987,071; and Cech et al. U.S. Pat. No. 5,1 16,742. Alternatively, 
mRNA of the invention can be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 
261:1411-1418. 

20 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the gene in target cells. See generally, Helene. 
(1991) Anticancer DrugDes. 6: 569-84; Helene. et al. (1992) Ann. N.Y.Acad. Sci. 
660:27-36; and Maher (1992) Bioassays 14: 807-15. 

25 In various embodiments, the nucleic acids of the invention can be modified at the 

base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup 
et al. (1996) BioorgMed Chern 4: 5-23). As used herein, the terms "peptide nucleic acids" 

30 or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 
specific hybridization to DNA and RNA under conditions of low ionic strength. The 
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synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup et ah (1996) above; Perry-O'Keefe et ah (1996) PNAS 93: 
14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
5 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation 
of gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs of the invention can also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; as artificial restriction enzymes 
when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); 

10 or as probes or primers for DNA sequence and hybridization (Hyrup et al. (1996), above; 
Perry-O'Keefe (1996), above). 

In another embodiment, PNAs of the invention can be modified, e.g., to enhance 
their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

1 5 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 

20 base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) 
above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup 
(1996) above and Finn et ah (1996) Nucl Acids Res 24: 3357-63. For example, a DNA chain 
can be synthesized on a solid support using standard phosphoramidite coupling chemistry, 
and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine 

25 phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et ah (1 989) 
Nucl Acid Res 17: 5973-88). PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 3 f DNA segment (Finn et ah 
(1996) above). Alternatively, chimeric molecules can be synthesized with a 5' DNA 
segment and a 3' PNA segment. See, Petersen et ah (1975) BioorgMed Chem Lett 5: 

30 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such 
as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et ah, 1989, Proc. Natl. Acad. Sci. U.S.A. 
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86:6553-6556; Lemaitre et ah, 1987, Proc. Natl Acad. Set 84:648-652; PCT Publication 
No. W088/09810) or the blood-brain barrier (see, e.g. 9 PCT Publication No. W089/10134). 
In addition, oligonucleotides can be modified with hybridization triggered cleavage agents 
(See, e.g., Krol et ah, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., 
5 Zon, 1988, Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to 
another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport 
agent, a hybridization-triggered cleavage agent, etc. 

4.5 HOSTS 

10 The present invention further provides host cells genetically engineered to contain 

the polynucleotides of the invention. For example, such host cells may contain nucleic acids 
of the invention introduced into the host cell using known transformation, transfection or 
infection methods. The present invention still further provides host cells genetically 
engineered to express the polynucleotides of the invention, wherein such polynucleotides are 

15 in operative association with a regulatory sequence heterologous to the host cell which 
drives expression of the polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by 
homologous recombination) to provide increased polypeptide expression by replacing, in 

20 whole or in part, the naturally occurring promoter with all or part of a heterologous promoter 
so that the cells express the polypeptide at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to the encoding sequences. See, for 
example, PCT International Publication No. WO94/12650, PCT International Publication 
No. WO92/20808, and PCT International Publication No. WO91/09955. It is also 

25 contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA 
(e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate 
synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be 
inserted along with the heterologous promoter DNA. If linked to the coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification 

30 of the desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
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calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation 
(Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one 
of the polynucleotides of the invention, can be used in conventional manners to produce the 
gene product encoded by the isolated fragment (in the case of an ORF) or can be used to 
5 produce a heterologous protein under the control of the EMF. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, 
Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and 
B. suhtilis. The most preferred cells are those which do not normally express the particular 

10 polypeptide or protein or which expresses the polypeptide or protein at low natural level. 
Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under 
the control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. Appropriate cloning and expression vectors for use with prokaryotic and 

15 eukaryotic hosts are described by Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is 
hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 lines 

20 of monkey kidney fibroblasts, described by Gluzman, Cell 23: 175 (1981). Other cell lines 
capable of expressing a compatible vector are, for example, the CI 27, monkey COS cells, 
Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, 
human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal 
diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, 

25 HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. Mammalian expression 
vectors will comprise an origin of replication, a suitable promoter and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, 

30 and polyadenylation sites may be used to provide the required nontranscribed genetic 

elements. Recombinant polypeptides and proteins produced in bacterial culture are usually 
isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous 
ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, 
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as necessary, in completing configuration of the mature protein. Finally, high performance 
liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells 
employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. 
5 Alternatively, it may be possible to produce the protein in lower eukaryotes such as 

yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, 
or any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
10 strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
enzymatic methods. 

15 In another embodiment of the present invention, cells and tissues may be engineered 

to express an endogenous gene comprising the polynucleotides of the invention under the 
control of inducible regulatory elements, in which case the regulatory sequences of the 
endogenous gene may be replaced by homologous recombination. As described herein, gene 
targeting can be used to replace a gene's existing regulatory region with a regulatory 

20 sequence isolated from a different gene or a novel regulatory sequence synthesized by 

genetic engineering methods. Such regulatory sequences may be comprised of promoters, 
enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional 
initiation sites, and regulatory protein binding sites or combinations of said sequences. 
Alternatively, sequences which affect the structure or stability of the RNA or protein 

25 produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequence include polyadenylation signals, mRNA stability elements, splice sites, leader 
sequences for enhancing or modifying transport or secretion properties of the protein, or 
other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

30 The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 
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element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different 
cell-type specificity than the naturally occurring elements. Here, the naturally occurring 
sequences are deleted and new sequences are added. In all cases, the identification of the 
5 targeting event may be facilitated by the use of one or more selectable marker genes that are 
contiguous with the targeting DNA, allowing for the selection of cells in which the 
exogenous DNA has integrated into the host cell genome. The identification of the targeting 
event may also be facilitated by the use of one or more marker genes exhibiting the property 
of negative selection, such that the negatively selectable marker is linked to the exogenous 

1 0 DNA, but configured such that the negatively selectable marker flanks the targeting 

sequence, and such that a correct homologous recombination event with sequences in the 
host cell genome does not result in the stable integration of the negatively selectable marker. 
Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) 
gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

1 5 The gene targeting or gene activation techniques which can be used in accordance 

with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 
to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by 

20 reference herein in its entirety. 



4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a 
polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 912- 

25 1 822, or 2479-3 1 34 or an amino acid sequence encoded by any one of the nucleotide 

sequences SEQ ID NO: 1-911, or 1823-2478 or the corresponding full length or mature 
protein. Polypeptides of the invention also include polypeptides preferably with biological or 
immunological activity that are encoded by: (a) a polynucleotide having any one of the 
nucleotide sequences set forth in SEQ ID NO: 1-91 1, or 1823-2478 or (b) polynucleotides 

30 encoding any one of the amino acid sequences set forth as SEQ ID NO: 912-1822, or 2479- 
3 134 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either 
(a) or (b) under stringent hybridization conditions. The invention also provides biologically 
active or immunologically active variants of any of the amino acid sequences set forth as 
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SEQ ID NO: 912-1822, or 2479-3134 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least 
about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 
90%, 91%, 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least 
5 about 98%, or most typically at least about 99% amino acid identity) that retain biological 
activity. Polypeptides encoded by allelic variants may have a similar, increased, or 
decreased activity compared to polypeptides comprising SEQ ID NO: 912-1822, or 2479- 
3134. 

Fragments of the proteins of the present invention which are capable of exhibiting 

10 biological activity are also encompassed by the present invention. Fragments of the protein 
may be in linear form or they may be cyclized using known methods, for example, as 
described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. 
McDowell, et al., J. Amer. Chem. Soc. 1 14, 9245-9253 (1992), both of which are 
incorporated herein by reference. Such fragments may be fused to carrier molecules such as 

1 5 immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. Fragments are also identified in Tables 3A, 3B, 5, or 6. 

The present invention also provides both full-length and mature forms (for example, 
without a signal sequence or precursor sequence) of the disclosed proteins. The protein 
coding sequence is identified in the sequence listing by translation of the disclosed 

20 nucleotide sequences. The predicted signal sequence is set forth in Table 6. The mature 
form of such protein may be obtained and confirmed by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell and sequencing of the cleaved 
product. One of skill in the art will recognize that the actual cleavage site may be different 
than that predicted in Table 6. The sequence of the mature form of the protein is also 

25 determinable from the amino acid sequence of the full-length form. Where proteins of the 
present invention are membrane bound, soluble forms of the proteins are also provided. In 
such forms, part or all of the regions causing the proteins to be membrane bound are deleted 
so that the proteins are fully secreted from the cell in which they are expressed (See, e.g., 
Sakal et al., Prep. Biochem. Biotechnol. (2000), 30(2), pp. 107-23, incorporated herein by 

30 reference). 

Protein compositions of the present invention may further comprise an acceptable 
carrier, such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
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The present invention further provides isolated polypeptides encoded by the nucleic 
acid fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide 
fragments which differ from a nucleic acid fragment of the present invention {e.g., an ORF) 
5 by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical 
polypeptide sequence. Preferred nucleic acid fragments of the present invention are the 
ORFs that encode proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino 

10 acid sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or 
tertiary structural and/or conformational characteristics with proteins may possess biological 
properties in common therewith, including protein activity. This technique is particularly 
useful in producing small peptides and fragments of larger polypeptides. Fragments are 

1 5 useful, for example, in generating antibodies against the native polypeptide. Thus, they may 
be employed as biologically active or immunological substitutes for natural, purified 
proteins in screening of therapeutic compounds and in immunological processes for the 
development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified 

20 from cells which have been altered to express the desired polypeptide or protein. As used 
herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, 
through genetic manipulation, is made to produce a polypeptide or protein which it normally 
does not produce or which the cell normally produces at a lower level. One skilled in the art 
can readily adapt procedures for introducing and expressing either recombinant or synthetic 

25 sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one 
of the polypeptides or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising 
growing a culture of host cells of the invention in a suitable culture medium, and purifying 
the protein from the cells or the culture in which the cells are grown. For example, the 

30 methods of the invention include a process for producing a polypeptide in which a host cell 
containing a suitable expression vector that includes a polynucleotide of the invention is 
cultured under conditions that allow expression of the encoded polypeptide. The 
polypeptide can be recovered from the culture, conveniently from the culture medium, or 
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from a lysate prepared from the host cells and further purified. Preferred embodiments 
include those in which the protein produced by such process is a full length or mature form 
of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells 
5 which naturally produce the polypeptide or protein. One skilled in the art can readily follow 
known methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange 
chromatography, and immuno-affmity chromatography. See, e.g., Scopes, Protein 

10 Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in 

Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular 
Biology. Polypeptide fragments that retain biological/immunological activity include 
fragments comprising greater than about 100 amino acids, or greater than about 200 amino 
acids, and fragments that encode specific protein domains. 

1 5 The purified polypeptides can be used in in vitro binding assays which are well 

known in the art to identify molecules which bind to the polypeptides. These molecules 
include but are not limited to, for e.g., small molecules, molecules from combinatorial 
libraries, antibodies or other proteins. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 

20 well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the 
peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that 
are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or other 

25 cell by the specificity of the binding molecule for SEQ ID NO: 912-1822, or 2479-3134. 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are 
characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

30 The proteins provided herein also include proteins characterized by amino acid 

sequences similar to those of purified proteins but into which modification are naturally 
provided or deliberately engineered. For example, modifications, in the peptide or DNA 
sequence, can be made by those skilled in the art using known techniques. Modifications of 
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interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For example, 
one or more of the cysteine residues may be deleted or replaced with another amino acid to 
alter the conformation of the molecule. Techniques for such alteration, substitution, 
5 replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. 
Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or 
deletion retains the desired activity of the protein. Regions of the protein that are important 
for the protein function can be determined by various methods known in the art including the 
alanine-scanning method which involved systematic substitution of single or strings of 

10 amino acids with alanine, followed by testing the resulting alanine-containing variant for 

biological activity. This type of analysis determines the importance of the substituted amino 
acid(s) in biological activity. Regions of the protein that are important for protein function 
may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be 

1 5 expected to retain protein activity in whole or in part and are useful for screening or other 
immunological methodologies may also be easily made by those skilled in the art given the 
disclosures herein. Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of 
the invention to suitable control sequences in one or more insect expression vectors, and 

20 employing an insect expression system. Materials and methods for baculovirus/insect cell 
expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, 
Calif., U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described 
in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), 
incorporated herein by reference. As used herein, an insect cell capable of expressing a 

25 polynucleotide of the present invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells 
under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or cell 
extracts) using known purification processes, such as gel filtration and ion exchange 

30 chromatography. The purification of the protein may also include an affinity column 

containing agents which will bind to the protein; one or more column steps over such affinity 
resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; 
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one or more steps involving hydrophobic interaction chromatography using such resins as 
phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography. 

Alternatively, the protein of the invention may also be expressed in a form which will 
facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
5 maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as 
a His tag. Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and 
Invitrogen, respectively. The protein can also be tagged with an epitope and subsequently 
purified by using a specific antibody directed to such epitope. One such epitope ("FLAG®") 

10 is commercially available from Kodak (New Haven, Conn.). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
methyl or other aliphatic groups, can be employed to further purify the protein. Some or all 
of the foregoing purification steps, in various combinations, can also be employed to provide 

15 a substantially homogeneous isolated recombinant protein. The protein thus purified is 

substantially free of other mammalian proteins and is defined in accordance with the present 
invention as an "isolated protein." 

The polypeptides of the invention include analogs (variants). This embraces 
fragments, as well as peptides in which one or more amino acids has been deleted, inserted, 

20 or substituted. Also, analogs of the polypeptides of the invention embrace fusions of the 

polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide 
or analog is fused to another moiety or moieties, e.g., targeting moiety or another therapeutic 
agent. Such analogs may exhibit improved properties such as activity and/or stability. 
Examples of moieties which may be fused to the polypeptide or an analog include, for 

25 example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, 
e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, 
dendritic cells, granulocytes, etc., as well as receptor and ligands expressed on pancreatic or 
immune cells. Other moieties which may be fused to the polypeptide include therapeutic 
agents which are used for treatment, for example, immunosuppressive drugs such as 

30 cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. Also, polypeptides may be 
fused to immune modulators, and other cytokines such as alpha or beta interferon. 
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4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE 
IDENTITY AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between 
the sequences tested. Methods to determine identity and similarity are codified in computer 
5 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, 
S.F. et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic 
Acids Res. vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu 

10 et al., J. Comp. Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif 
software (Nevill-Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by 
reference), Pfam software (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 
(1998), herein incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction 
algorithm (J. Mol Biol, 157, pp. 105-31 (1982), the GeneAtlas software (Molecular 

15 Simulations Inc. (MSI), San Diego, CA) (Sanchez and Sali (1998) Proc. Natl. Acad. Sci., 95, 
13597-13602; Kitson DH et al, (2000) "Remote homology detection using structural 
modeling - an evaluation" Submitted; Fischer and Eisenberg (1996) Protein Sci. 5, 947- 
955), Neural Network SignalP VI. 1 program (from Center for Biological Sequence 
Analysis, The Technical University of Denmark) incorporated herein by reference). 

20 Polypeptide sequences were examined by a proprietary algorithm, SeqLoc that separates the 
proteins into three sets of locales: intracellular, membrane, or secreted. This prediction is 
based upon three characteristics of each polypeptide, including percentage of cysteine 
residues, Kyte-Doolittle scores for the first 20 amino acids of each protein, and Kyte- 
Doolittle scores to calculate the longest hydrophobic stretch of the said protein. Values of 

25 predicted proteins are compared against the values from a set of 592 proteins of known 

cellular localization from the Swissprot database (http://www.expasv.ch/sproft . Predictions 
are based upon the maximum likelihood estimation. 

Pesence of transmembrane region(s) was detected using the TMpred program 
(htt |i://www.ch.embnet.org/software/TMPRED form.htmlY 

30 The BLAST programs are publicly available from the National Center for 

Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. 
NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 
(1990). 
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4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
another polypeptide. Within a fusion protein the polypeptide according to the invention can 
5 correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically 
active portions of a protein according to the invention. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the polypeptide according to the invention 

1 0 and the other polypeptide are fused in-frame to each other. The polypeptide can be fused to 
the N-terminus or C-terminus, or to the middle. 

For example, in one embodiment a fusion protein comprises a polypeptide according 
to the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 

15 polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., 
glutathione S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in 
which the polypeptide sequences according to the invention comprise one or more domains 
fused to sequences derived from a member of the immunoglobulin protein family. The 

20 immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in 
vivo. The immunoglobulin fusion proteins can be used to affect the bioavailability of a 
cognate ligand. Inhibition of the ligand/protein interaction may be useful therapeutically for 

25 both the treatment of proliferative and differentiative disorders, e.g., cancer as well as 
modulating (e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies in a 
subject, to purify ligands, and in screening assays to identify molecules that inhibit the 
interaction of a polypeptide of the invention with a ligand. 

30 A chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 
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appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
5 overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) 
Current Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety 
{e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be 
10 cloned into such an expression vector such that the fusion moiety is linked in-frame to the 
protein of the invention. 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 

15 function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides 
of the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more 
particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo 

20 by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for 
example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For 
additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 
(1989); Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). 
Introduction of any one of the nucleotides of the present invention or a gene encoding the 

25 polypeptides of the present invention can also be accomplished with extrachromosomal 
substrates (transient expression) or artificial chromosomes (stable expression). Cells may 
also be cultured ex vivo in the presence of proteins of the present invention in order to 
proliferate or to produce a desired effect on or activity in such cells. Treated cells can then 
be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that in other 

30 human disease states, preventing the expression of or inhibiting the activity of polypeptides 
of the invention will be useful in treating the disease states. It is contemplated that antisense 
therapy or gene therapy could be applied to negatively regulate the expression of 
polypeptides of the invention. 
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Other methods inhibiting expression of a protein include the introduction of antisense 
molecules to the nucleic acids of the present invention, their complements, or their translated 
RNA sequences, by methods known in the art. Further, the polypeptides of the present 
invention can be inhibited by using targeted deletion methods, or the insertion of a negative 
5 regulatory element such as a silencer, which is tissue specific. 

The present invention still further provides cells genetically engineered in vivo to 
express the polynucleotides of the invention, wherein such polynucleotides are in operative 
association with a regulatory sequence heterologous to the host cell which drives expression of 
the polynucleotides in the cell. These methods can be used to increase or decrease the 

1 0 expression of the polynucleotides of the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of 
cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be 
modified (e.g., by homologous recombination) to provide increased polypeptide expression by 
replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous 

1 5 promoter so that the cells express the protein at higher levels. The heterologous promoter is 

inserted in such a manner that it is operatively linked to the desired protein encoding sequences. 
See, for example, PCT International Publication No. WO 94/12650, PCT International 
Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also 
contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., 

20 ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, 

aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with 
the heterologous promoter DNA. If linked to the desired protein coding sequence, 
amplification of the marker DNA by standard selection methods results in co-amplification of 
the desired protein coding sequences in the cells. 

25 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control 
of inducible regulatory elements, in which case the regulatory sequences of the endogenous 
gene may be replaced by homologous recombination. As described herein, gene targeting can 
be used to replace a gene's existing regulatory region with a regulatory sequence isolated from 

30 a different gene or a novel regulatory sequence synthesized by genetic engineering methods. 
Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment 
regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding 
sites or combinations of said sequences. Alternatively, sequences which affect the structure or 
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stability of the RNA or protein produced may be replaced, removed, added, or otherwise 
modified by targeting. These sequences include polyadenylation signals, mRNA stability 
elements, splice sites, leader sequences for enhancing or modifying transport or secretion 
properties of the protein, or other sequences which alter or improve the function or stability of 
5 protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple 
deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory 

1 0 element. Alternatively, the targeting event may replace an existing element; for example, a 
tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type 
specificity than the naturally occurring elements. Here, the naturally occurring sequences are 
deleted and new sequences are added. In all cases, the identification of the targeting event may 
be facilitated by the use of one or more selectable marker genes that are contiguous with the 

1 5 targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated 
into the cell genome. The identification of the targeting event may also be facilitated by the use 
of one or more marker genes exhibiting the property of negative selection, such that the 
negatively selectable marker is linked to the exogenous DNA, but configured such that the 
negatively selectable marker flanks the targeting sequence, and such that a correct homologous 

20 recombination event with sequences in the host cell genome does not result in the stable 

integration of the negatively selectable marker. Markers useful for this purpose include the 
Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 

25 this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
Chappel; U.S. Patent No. 5,578,461 to Sherwinet al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (W09 1/06667) by Skoultchi et al., each of which is incorporated by 
reference herein in its entirety. 

30 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
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inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 
5 are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
systems to identify compounds that modulate lipid metabolism. Transgenic animals, 

10 preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of 
expression of the polypeptides of the invention. Inactivation can be carried out using 

1 5 homologous recombination methods described above. Activation can be achieved by 

supplementing or even replacing the homologous promoter to provide for increased protein 
expression. The homologous promoter can be supplemented by insertion of one or more 
heterologous enhancer elements known to confer promoter activation in a particular tissue. 
The polynucleotides of the present invention also make possible the development, 

20 through, e.g., homologous recombination or knock out strategies, of animals that fail to 

express polypeptides of the invention or that express a variant polypeptide. Such animals are 
useful as models for studying the in vivo activities of polypeptide as well as for studying 
modulators of the polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 

25 invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination 

30 are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, 
can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. 
Transgenic animals are useful to determine the roles polypeptides of the invention play in 
biological processes, and preferably in disease states. Transgenic animals are useful as model 
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systems to identify compounds that modulate lipid metabolism. Transgenic animals, 
preferably non-human mammals, are produced using methods as described in U.S. Patent No 
5,489,743 and PCT Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
5 invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or 
even replacing the homologous promoter to provide for increased protein expression. The 
homologous promoter can be supplemented by insertion of one or more heterologous 
10 enhancer elements known to confer promoter activation in a particular tissue. 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one 
or more of the uses or biological activities (including those associated with assays cited 

15 herein) identified herein. Uses or activities described for proteins of the present invention 
may be provided by administration or use of such proteins or of polynucleotides encoding 
such proteins (such as, for example, in gene therapies or vectors suitable for introduction of 
DNA). The mechanism underlying the particular condition or pathology will dictate whether 
the polypeptides of the invention, the polynucleotides of the invention or modulators 

20 (activators or inhibitors) thereof would be beneficial to the subject in need of treatment. 

Thus, "therapeutic compositions of the invention" include compositions comprising isolated 
polynucleotides (including recombinant DNA molecules, cloned genes and degenerate 
variants thereof) or polypeptides of the invention (including full length protein, mature 
protein and truncations or domains thereof), or compounds and other substances that 

25 modulate the overall activity of the target gene products, either at the level of target 

gene/protein expression or target protein activity. Such modulators include polypeptides, 
analogs, (variants), including fragments and fusion proteins, antibodies and other binding 
proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides 
of the invention (identified, e.g., via drug screening assays as described herein); antisense 

30 polynucleotides and polynucleotides suitable for triple helix formation; and in particular 
antibodies or other binding partners that specifically recognize one or more epitopes of the 
polypeptides of the invention. 
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The polypeptides of the present invention may likewise be involved in cellular 
activation or in one of the other physiological pathways described herein. 

4.10.1 RESEARCH USES AND UTILITIES 

5 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express recombinant 
protein for analysis, characterization or therapeutic use; as markers for tissues in which the 
corresponding protein is preferentially expressed (either constitutively or at a particular stage 
of tissue differentiation or development or in disease states); as molecular weight markers on 

1 0 gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in patients to identify 
potential genetic disorders; as probes to hybridize and thus discover novel, related DNA 
sequences; as a source of information to derive PGR primers for genetic fingerprinting; as a 
probe to "subtract-out" known sequences in the process of discovering other novel 

15 polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or other 
support, including for examination of expression patterns; to raise anti-protein antibodies 
using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), 

20 the polynucleotide can also be used in interaction trap assays (such as, for example, that 

described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding interaction. 

The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 

25 screening; to raise antibodies or to elicit another immune response; as a reagent (including 
the labeled reagent) in assays designed to quantitatively determine levels of the protein (or 
its receptor) in biological fluids; as markers for tissues in which the corresponding 
polypeptide is preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in a disease state); and, of course, to isolate correlative 

30 receptors or ligands. Proteins involved in these binding interactions can also be used to 
screen for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent 
grade or kit format for commercialization as research products. 
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Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. 
Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular 
5 Cloning Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein or 

1 0 amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of 
carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be added to 
the feed of a particular organism or can be administered as a separate solid or liquid 
preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the case 
of microorganisms, the polypeptide or polynucleotide of the invention can be added to the 

1 5 medium in or on which the microorganism is cultured. 

4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
20 proliferation (either inducing or inhibiting) or cell differentiation (either inducing or 

inhibiting) activity or may induce production of other cytokines in certain cell populations. 
A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
Many protein factors discovered to date, including all known cytokines, have exhibited 
activity in one or more factor-dependent cell proliferation assays, and hence the assays serve 
25 as a convenient confirmation of cytokine activity. The activity of therapeutic compositions 
of the present invention is evidenced by any one of a number of routine factor dependent cell 
proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, 
B9/11, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, 
Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in 
30 the following: 

Assays for T-cell or thymocyte proliferation include without limitation those 
described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 
133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al, I. 
5 Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells 
or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of 

1 0 mouse and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. 
e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 
include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K, Davis, L. S. and Lipsky, P. E. In Current 

15 Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 
Sons, Toronto. 1991; deVries et al, J. Exp. Med. 173:1205-1211, 1991; Moreau et al., 
Nature 336:690-692, 1988; Greenberger et al, Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 
1983; Measurement of mouse and human interleukin 6-Nordan, R. In Current Protocols in 
Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; 

20 Smith et al, Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

Interleukin 1 1-Bennett, F, Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols 
in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9— Ciarletta, A, Giannotti, J., Clark, S. C. 
and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, 

25 John Wiley and Sons, Toronto. 1 99 1 . 

Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, 

30 E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 

(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and their 
cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al, Proc. 
Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al, Eur. J. Immun. 1 1:405-41 1, 
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1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al. 3 J. Immunol. 140:508-512, 
1988. 



4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

5 A polypeptide of the present invention may exhibit stem cell growth factor activity 

and be involved in the proliferation, differentiation and survival of pluripotent and totipotent 
stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells 
and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells 
in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or 

10 pluripotential state which would be useful for re-engineering damaged or diseased tissues, 
transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. 
The ability to produce large quantities of human cells has important working applications for 
the production of human proteins which currently must be obtained from non-human sources 
or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other 

15 neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, 
tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, 
gastrointestinal cells and others; and organs for transplantation such as kidney, liver, 
pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines 

20 may be administered in combination with the polypeptide of the invention to achieve the 

desired effect, including any of the growth factors listed herein, other stem cell maintenance 
factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), 
Flt-3 ligand (Flt-3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL- 
6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, 

25 thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), 
neural growth factors and basic fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion 
of these cells in culture will facilitate the production of large quantities of mature cells. 
Techniques for culturing stem cells are known in the art and administration of polypeptides 

30 of the invention, optionally with other growth factors and/or cytokines, is expected to 
enhance the survival and proliferation of the stem cell populations. This can be 
accomplished by direct administration of the polypeptide of the invention to the culture 
medium. Alternatively, stroma cells transfected with a polynucleotide that encodes for the 
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polypeptide of the invention can be used as a feeder layer for the stem cell populations in 
culture or in vivo. Stromal support cells for feeder layers may include embryonic bone 
marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic 
fibroblasts (see U.S. Patent No. 5,690,926). 
5 Stem cells themselves can be transfected with a polynucleotide of the invention to 

induce autocrine expression of the polypeptide of the invention. This will allow for 
generation of undifferentiated totipotential/pluripotential stem cejl lines that are useful as is 
or that can then be differentiated into the desired mature cell types. These stable cell lines 
can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create 

1 0 cDNA libraries and templates for polymerase chain reaction experiments. These studies 

would allow for the isolation and identification of differentially expressed genes in stem cell 
populations that regulate stem cell proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present 

1 5 invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells 
that can be used to augment or replace cells damaged by illness, autoimmune disease, 
accidental damage or genetic disorders. The polypeptide of the invention may be useful for 
inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, 
i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as 

20 well as mechanical and traumatic disorders which involve degeneration, death or trauma to 
neural cells or nerve tissue. In addition, the expanded stem cell populations can also be 
genetically altered for gene therapy purposes and to decrease host rejection of replacement 
tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 

25 manipulated to achieve controlled differentiation of the stem cells into more differentiated 
cell types. A broadly applicable method of obtaining pure populations of a specific 
differentiated cell type from undifferentiated stem cell populations involves the use of a cell- 
type specific promoter driving a selectable marker. The selectable marker allows only cells 
of the desired type to survive. For example, stem cells can be induced to differentiate into 

30 cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. 

Invest, 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of 
Tissue Engineering eds. Lanza et ah, Academic Press (1997)). Alternatively, directed 
differentiation of stem cells can be accomplished by culturing the stem cells in the presence 
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of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the 
invention which would inhibit the effects of endogenous stem cell factor activity and allow 
differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the 
5 invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of 
various cell sources (including hematopoietic stem cells and embryonic stem cells) and 
cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 
92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in 
combination with other growth factors or cytokines. The ability of the polypeptide of the 
10 invention to induce stem cells proliferation is determined by colony formation on semi-solid 
support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991). 

4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of 

15 hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. 
Even marginal biological activity in support of colony forming cells or of factor-dependent 
cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth 
and proliferation of erythroid progenitor cells alone or in combination with other cytokines, 
thereby indicating utility, for example, in treating various anemias or for use in conjunction 

20 with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or 
erythroid cells; in supporting the growth and proliferation of myeloid cells such as 
granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, 
in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in 
supporting the growth and proliferation of megakaryocytes and consequently of platelets 

25 thereby allowing prevention or treatment of various platelet disorders such as 

thrombocytopenia, and generally for use in place of or complimentary to platelet 
transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above-mentioned hematopoietic cells and 
therefore find therapeutic utility in various stem cell disorders (such as those usually treated 

30 with transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell compartment post 
irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
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transplantation or with peripheral progenitor cell transplantation (homologous or 
heterologous)) as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
5 cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., 
Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81:2903-2915, 
10 1993. 

Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. 
R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; 

15 Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic 
colony forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., 
New York, N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; 
Cobblestone area forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. 

20 R. I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term 
bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, 
T. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, 
Inc., New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In 
Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., 

25 New York, N.Y. 1 994. 



4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, 
tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and 
30 tissue repair and replacement, and in healing of burns, incisions and ulcers. 

A polypeptide of the present invention which induces cartilage and/or bone growth in 
circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
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polypeptide, antibody, binding partner, or other modulator of the invention may have 
prophylactic use in closed as well as open fracture reduction and also in the improved 
fixation of artificial joints. De novo bone formation induced by an osteogenic agent 
contributes to the repair of congenital, trauma induced, or oncologic resection induced 
5 craniofacial defects, and also is useful in cosmetic plastic surgery. 

A polypeptide of this invention may also be involved in attracting bone-forming 
cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by 

1 0 blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast 
activity, etc.) mediated by inflammatory processes may also be possible using the 
composition of the invention. 

Another category of tissue regeneration activity that may involve the polypeptide of 
the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue 

15 or other tissue formation in circumstances where such tissue is not normally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or 
ligament defects in humans and other animals. Such a preparation employing a 
tendon/ligament-like tissue inducing protein may have prophylactic use in preventing 
damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or 

20 ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. De 
novo tendon/ligament-like tissue formation induced by a composition of the present 
invention contributes to the repair of congenital, trauma induced, or other tendon or ligament 
defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair 
of tendons or ligaments. The compositions of the present invention may provide 

25 environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or 

ligament-forming cells, induce differentiation of progenitors of tendon- or ligament-forming 
cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to 
effect tissue repair. The compositions of the invention may also be useful in the treatment of 
tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions 

30 may also include an appropriate matrix and/or sequestering agent as a carrier as is well 
known in the art. 

The compositions of the present invention may also be useful for proliferation of 
neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central 
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and peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a composition may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
5 localized neuropathies, and central nervous system diseases, such as Alzheimer's, 

Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present invention 
include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and 
cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from 
1 0 chemotherapy or other medical therapies may also be treatable using a composition of the 
invention. 

Compositions of the invention may also be useful to promote better or faster closure 

of non-healing wounds, including without limitation pressure ulcers, ulcers associated with 

vascular insufficiency, surgical and traumatic wounds, and the like. 
1 5 Compositions of the present invention may also be involved in the generation or 

regeneration of other tissues, such as organs (including, for example, pancreas, liver, 

intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular 

(including vascular endothelium) tissue, or for promoting the growth of cells comprising 

such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic 
20 scarring may allow normal tissue to regenerate. A polypeptide of the present invention may 

also exhibit angiogenic activity. 

A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 

conditions resulting from systemic cytokine damage. 
25 A composition of the present invention may also be useful for promoting or 

inhibiting differentiation of tissues described above from precursor tissues or cells; or for 

inhibiting the growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
Assays for tissue generation activity include, without limitation, those described in: 
30 International Patent Publication No. W095/1 6035 (bone, cartilage, tendon); International 

Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 

W09 1/07491 (skin, endothelium). 
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Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), 
Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

5 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays are 
described herein. A polynucleotide of the invention can encode a polypeptide exhibiting 

10 such activities. A protein may be useful in the treatment of various immune deficiencies and 
disorders (including severe combined immunodeficiency (SCID)), e.g., in regulating (up or 
down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic 
activity of NK cells and other cell populations. These immune deficiencies may be genetic or 
be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from 

15 autoimmune disorders. More specifically, infectious diseases causes by viral, bacterial, 

fungal or other infection may be treatable using a protein of the present invention, including 
infections by HIV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria 
spp. and various fungal infections such as candidiasis. Of course, in this regard, proteins of 
the present invention may also be useful where a boost to the immune system generally may 

20 be desirable, i.e., in the treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus 
erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre 
syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, 

25 graft-versus-host disease and autoimmune inflammatory eye disease. Such a protein (or 
antagonists thereof, including antibodies) of the present invention may also to be useful in 
the treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug 
reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, 
hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic 

30 contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, allergic conjunctivitis, 
atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and 
contact allergies), such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein (or antagonists 
thereof) of the present invention. The therapeutic effects of the polypeptides or antagonists 
thereof on allergic reactions can be evaluated by in vivo animals models such as the 
cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin 
5 prick test (Hoffmann et al., Allergy 54: 446-54, 1 999), guinea pig skin sensitization test 
(Vohr et al., Arch, ^oxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

Using the proteins of the invention it may also be possible to modulate immune 
responses, in a number of ways. Down regulation may be in the form of inhibiting or 

10 blocking an immune response already in progress or may involve preventing the induction of 
an immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T 
cell responses is generally an active, non-antigen-specific, process which requires continuous 
exposure of the T cells to the suppressive agent. Tolerance, which involves inducing 

15 non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that 
it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. 
Operationally, tolerance can be demonstrated by the lack of a T cell response upon 
reexposure to specific antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

20 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin 
and organ transplantation and in graft-versus-host disease (GVHD). For example, blockage 
of T cell function should result in reduced tissue destruction in tissue transplantation. 
Typically, in tissue transplants, rejection of the transplant is initiated through its recognition 

25 as foreign by T cells, followed by an immune reaction that destroys the transplant. The 

administration of a therapeutic composition of the invention may prevent cytokine synthesis 
by immune cells, such as T cells, and thus acts as an immunosuppressant. Moreover, a lack 
of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in 
a subject. Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may 

30 avoid the necessity of repeated administration of these blocking reagents. To achieve 

sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the 
function of a combination of B lymphocyte antigens. 
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The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
5 used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 

described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. 
Sci USA, 89:1 1 102-1 1 105 (1992). In addition, murine models of GVHD (see Paul ed., 
Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of therapeutic compositions of the invention on the development of that 
10 disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation 
of T cells that are reactive against self-tissue and which promote the production of cytokines 
and autoantibodies involved in the pathology of the diseases. Preventing the activation of 

15 autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents 
which block stimulation of T cells can be used to inhibit T cell activation and prevent 
production of autoantibodies or T cell-derived cytokines which may be involved in the 
disease process. Additionally, blocking reagents may induce antigen-specific tolerance of 
autoreactive T cells which could lead to long-term relief from the disease. The efficacy of 

20 blocking reagents in preventing or alleviating autoimmune disorders can be determined 
using a number of well-characterized animal models of human autoimmune diseases. 
Examples include murine experimental autoimmune encephalitis, systemic lupus 
erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune collagen 
arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia 

25 gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a 
means of up regulating immune responses, may also be useful in therapy. Upregulation of 
immune responses may be in the form of enhancing an existing immune response or eliciting 
30 an initial immune response. For example, enhancing an immune response may be useful in 
cases of viral infection, including systemic viral diseases such as influenza, the common 
cold, and encephalitis. 
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Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory 
form of a soluble peptide of the present invention and reintroducing the in vitro activated T 
5 cells into the patient. Another method of enhancing anti-viral immune responses would be to 
isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of 
the present invention as described herein such that the cells express all or a portion of the 
protein on their surface, and reintroduce the transfected cells into the patient. The infected 
cells would now be capable of delivering a costimulatory signal to, and thereby activate, T 
10 cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal 
to T cells to induce a T cell mediated immune response against the transfected tumor cells. 
In addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 
reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected 

1 5 with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) 
of an MHC class I alpha chain protein and p 2 microglobulin protein or an MHC class II 
alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I 
or MHC class II proteins on the cell surface. Expression of the appropriate class I or class II 
MHC in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., 

20 B7-1 , B7-2, B7-3) induces a T cell mediated immune response against the transfected tumor 
cell. Optionally, a gene encoding an antisense construct which blocks expression of an MHC 
class II associated protein, such as the invariant chain, can also be cotransfected with a DNA 
encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of 
tumor associated antigens and induce tumor specific immunity. Thus, the induction of a T 

25 cell mediated immune response in a human subject may be sufficient to overcome 
tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among other means, be measured by 
the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

30 those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 
D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
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78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., 
J. Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et 
al., Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 
5 1994. 

Assays for T-cell-dependent immunoglobulin responses and isotype switching 
(which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described in: 
Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 

10 antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. 
E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 
proteins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, 

15 D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 
Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 
1992. 

20 Dendritic cell-dependent assays (which will identify, among others, proteins 

expressed by dendritic cells that activate naive T-cells) include, without limitation, tfiose 
described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al., Journal of Immunology 
154:5071-5079, 1995; Porgador et al., Journal of Experimental Medicine 182:255-260, 

25 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 

264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 
1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., 
Journal of Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 

30 proteins that prevent apoptosis after superantigen induction and proteins that regulate 

lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et 
al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, 
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Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; 
Gorczyca et al., International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, without limitation, those described in: Antica et aL, Blood 84:1 1 1-1 17, 1994; Fine 
5 etal., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; 
Toki et al., Proc. Nat Acad Sci. USA 88:7548-7551, 1991. 

4.10.8 ACTIVIN/INfflBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
10 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate 
the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present 
invention, alone or in heterodimers with a member of the inhibin family, may be useful as a 
1 5 contraceptive based on the ability of inhibins to decrease fertility in female mammals and 
decrease spermatogenesis in male mammals. Administration of sufficient amounts of other 
inhibins can induce infertility in these mammals. Alternatively, the polypeptide of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin 
group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
20 molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
U.S. Pat. No. 4,798,885. A polypeptide of the invention may also be useful for advancement 
of the onset of fertility in sexually immature mammals, so as to increase the lifetime 
reproductive performance of domestic animals such as, but not limited to, cows, sheep and 
pigs. 

25 The activity of a polypeptide of the invention may, among other means, be measured 

by the following methods. 

Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et 
al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. 

30 Natl. Acad. Sci. USA 83:3091-3095, 1986. 



4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 
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A polypeptide of the present invention may be involved in chemotactic or 
chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, 
neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. 
5 Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a 

desired cell population to a desired site of action. Chemotactic or chemokinetic compositions 
(e.g. proteins, antibodies, binding partners, or modulators of the invention) provide particular 
advantages in treatment of wounds and other trauma to tissues, as well as in treatment of 
localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to 
1 0 tumors or sites of infection may result in improved immune responses against the tumor or 
infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
1 5 movement of cells. Whether a particular protein has chemotactic activity for a population of 
cells can be readily determined by employing such protein or peptide in any known assay for 
cell chemotaxis. 

Therapeutic compositions of the invention can be used in the following: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

20 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene 

25 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta 
Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston etal. J. of Immunol. 153:1762-1768, 1994. 

30 4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 
attributes. Compositions may be useful in treatment of various coagulation disorders 
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(including hereditary disorders, such as hemophilias) or to enhance coagulation and other 
hemostatic events in treating wounds resulting from trauma, surgery or other causes. A 
composition of the invention may also be useful for dissolving or inhibiting formation of 
thromboses and for treatment and prevention of conditions resulting therefrom (such as, for 
5 example, infarction of cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, 
10 Prostaglandins 35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation 
or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 

1 5 invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. 
For example, the presence or increased expression of a polynucleotide/polypeptide of the 
invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing 
malignancy. Conversely, a defect in the gene or absence of the polypeptide may be 
associated with a cancer condition. Identification of single nucleotide polymorphisms 

20 associated with cancer or a predisposition to cancer may also be useful for diagnosis or 
prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor 
growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. 

25 Therapeutic compositions of the invention may be effective in adult and pediatric oncology 
including in solid phase tumors/malignancies, locally advanced tumors, human soft: tissue 
sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies 
including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck 
cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including 

30 small cell carcinoma and non-small cell cancers, breast cancers including small cell 

carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, 
stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal 
neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and 
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prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine 
(including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers 
including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
5 nervous system, bone cancers including osteomas, skin cancers including malignant 

melanoma, tumor progression of human skin keratinocytes, squamous cell carcinoma, basal 
cell carcinoma, hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention 
(including inhibitors and stimulators of the biological activity of the polypeptide of the 

10 invention) may be administered to treat cancer. Therapeutic compositions can be 

administered in therapeutically effective dosages alone or in combination with adjuvant 
cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser 
therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor 
growth, inhibiting metastasis, or otherwise improving overall clinical condition, without 

1 5 necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a 
pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer 

20 treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a 
treatment in combination with the polypeptide or modulator of the invention include: 
Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, 
Carmustine, Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HC1 
(Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HC1, Doxorubicin HC1, 

25 Estramustine phosphate sodium, Etoposide (VI 6-21 3), Floxuridine, 5-Fluorouracil (5-Fu), 
Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon 
Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine 
HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), 
Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, Streptozocin, 

30 Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 

Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
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treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing 
cancers. Under these circumstances, it may be beneficial to treat these individuals with 
therapeutically effective doses of the polypeptide of the invention to reduce the risk of 
5 developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays 
of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) 
Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 

10 and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 
52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays 
as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis 
assays such as induction of vascularization of the chick chorioallantoic membrane or 
induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. Dev. 

15 Biol., 40: 1 189-97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. 
Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection 
catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

20 A polypeptide of the present invention may also demonstrate activity as receptor, 

receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of 
the invention can encode a polypeptide exhibiting such characteristics. Examples of such 
receptors and ligands include, without limitation, cytokine receptors and their ligands, 
receptor kinases and their ligands, receptor phosphatases and their ligands, receptors 

25 involved in cell-cell interactions and their ligands (including without limitation, cellular 

adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs 
involved in antigen presentation, antigen recognition and development of cellular and 
humoral immune responses. Receptors and ligands are also useful for screening of potential 
peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A protein of 

30 the present invention (including, without limitation, fragments of receptors and ligands) may 
themselves be useful as inhibitors of receptor/ligand interactions. 

The activity of a polypeptide of the invention may, among other means, be measured 
by the following methods: 
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Suitable assays for receptor-ligand activity include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. 
Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 
5 7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., 
J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; 
Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be 

10 identified through binding assays, affinity chromatography, dihybrid screening assays, 
BIAcore assays, gel overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or 
a partial antagonist require the use of other proteins as competing ligands. The polypeptides 
of the present invention or ligand(s) thereof may be labeled by being coupled to 

15 radioisotopes, colorimetric molecules or a toxin molecules by conventional methods. 

("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 
(1990) Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not 
limited to, tritium and carbon- 14 . Examples of colorimetric molecules include, but are not 
limited to, fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric 

20 molecules. Examples of toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening 

25 techniques. The polypeptides or fragments employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the polypeptide or a fragment 
thereof. Drugs are screened against such transformed cells in competitive binding assays. 

30 Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, the formation of complexes between polypeptides of the invention or 
fragments and the agent being tested or examine the diminution in complex formation 
between the novel polypeptides and an appropriate cell line, which are well known in the art. 
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Sources for test compounds that may be screened for ability to bind to or modulate 
(i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic 
and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 
5 Chemical libraries may be readily synthesized or purchased from a number of 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 
fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

10 screening may be created by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of the organisms themselves. Natural product 
libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants 
thereof. For a review, see Science 282:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides 

15 or organic compounds and can be readily prepared by traditional automated synthesis 

methods, PGR, cloning or proprietary synthetic methods. Of particular interest are peptide 
and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, 
protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide 
libraries. For a review of combinatorial chemistry and libraries created therefrom, see 

20 Myers, Curr. Opin. Biotechnol 8:701-707 (1997). For reviews and examples of 

peptidomimetic libraries, see Al-Obeidi et al., Mol Biotechnol, 9(3):205-23 (1998); Hruby 
et al., Curr Opin Chem Biol, 1(1): 1 14-19 (1997); Dorner et al., Bioorg Med Chem, 
4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein 

25 permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" 
to bind a polypeptide of the invention. The molecules identified in the binding assay are then 
tested for antagonist or agonist activity in in vivo tissue culture or animal models that are 
well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or 
animals and then tested for either cell/animal death or prolonged survival of the animal/cells. 

30 The binding molecules thus identified may be complexed with toxins, e.g., ricin or 

cholera, or with other compounds that are toxic to cells such as radioisotopes. The 
toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of 
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the binding molecule for a polypeptide of the invention. Alternatively, the binding 
molecules may be complexed with imaging agents for targeting and imaging purposes. 

4.10,14 ASSAY FOR RECEPTOR ACTIVITY 

5 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For 
example, expression cloning using mammalian or bacterial cells, or dihybrid screening 
assays can be used to identify polynucleotides encoding binding partners. As another 

1 0 example, affinity chromatography with the appropriate immobilized polypeptide of the 
invention can be used to isolate polypeptides that recognize and bind polypeptides of the 
invention. There are a number of different libraries used for the identification of 
compounds, and in particular small molecules, that modulate (i.e., increase or decrease) 
biological activity of a polypeptide of the invention. Ligands for receptor polypeptides of the 

1 5 invention can also be identified by adding exogenous ligands, or cocktails of ligands to two 
cells populations that are genetically identical except for the expression of the receptor of the 
invention: one cell population expresses the receptor of the invention whereas the other does 
not. The responses of the two cell populations to the addition of ligands(s) are then 
compared. Alternatively, an expression library can be co-expressed with the polypeptide of 

20 the invention in cells and assayed for an autocrine response to identify potential ligand(s). As 
still another example, BIAcore assays, gel overlay assays, or other methods known in the art 
can be used to identify binding partner polypeptides, including, (1) organic and inorganic 
chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of 
random peptides, oligonucleotides or organic molecules. 

25 The role of downstream intracellular signaling molecules in the signaling cascade of 

the polypeptide of the invention can be determined. For example, a chimeric protein in 
which the cytoplasmic domain of the polypeptide of the invention is fused to the 
extracellular portion of a protein, whose ligand has been identified, is produced in a host 
cell. The cell is then incubated with the ligand specific for the extracellular portion of the 

30 chimeric protein, thereby activating the chimeric receptor. Known downstream proteins 
involved in intracellular signaling can then be assayed for expected modifications i.e. 
phosphorylation. Other methods known to those in the art can also be used to identify 
signaling molecules involved in receptor activity. 
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4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. 
The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 
5 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Compositions with such activities can be used to treat inflammatory 

10 conditions including chronic or acute conditions), including without limitation intimation 
associated with infection (such as septic shock, sepsis or systemic inflammatory response 
syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung 
injury, inflammatory bowel disease, Crohn's disease or resulting from over production of 

1 5 cytokines such as TNF or IL-1 . Compositions of the invention may also be useful to treat 

anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this 
invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, 
acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic 
inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus 

20 host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, 
other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 
acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

25 4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of 
the invention. Such leukemias and related disorders include but are not limited to acute 
leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, 
30 promyelocyte, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic 

myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such 
disorders, see Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 
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4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
5 therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient 
(including human and non-human mammalian patients) according to the invention include 
but are not limited to the following lesions of either the central (including spinal cord, brain) 
10 or peripheral nervous systems : 

(i) traumatic lesions, including lesions caused by physical injury or associated 
with surgery, for example, lesions which sever a portion of the nervous system, or 
compression injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
15 results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by 
human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme 

20 disease, tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration 
associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or 
amyotrophic lateral sclerosis; 

25 (v) lesions associated with nutritional diseases or disorders, in which a portion of 

the nervous system is destroyed or injured by a nutritional disorder or disorder of 
metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, 
Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary 
degeneration of the corpus callosum), and alcoholic cerebellar degeneration; 

30 (vi) neurological lesions associated with systemic diseases including but not 

limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, 
carcinoma, or sarcoidosis; 
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(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 

5 immunodeficiency virus-associated myelopathy, transverse myelopathy or various 

etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival 
or differentiation of neurons. For example, and not by way of limitation, therapeutics which 
10 elicit any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, 
e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

1 5 (iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method 
set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons 
may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or 

20 Brown et al. (1981, Ann. Rev. Neurosci. 4: 17-42); increased production of 

neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody 
binding, Northern blot assay, etc., depending on the molecule to be measured; and motor 
neuron dysfunction may be measured by assessing the physical manifestation of motor 
neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability. 

25 In specific embodiments, motor neuron disorders that may be treated according to the 

invention include but are not limited to disorders such as infarction, infection, exposure to 
toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor 
neurons as well as other components of the nervous system, as well as disorders that 
selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited 

30 to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, 
infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio- 
Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory 
Neuropathy (Charcot-Marie-Tooth Disease). 
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4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following 
additional activities or effects: inhibiting the growth, infection or function of 5 or killing, 
5 infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; 
effecting (suppressing or enhancing) bodily characteristics, including, without limitation, 
height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or 
organ or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; 

10 effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 
carbohydrate, vitamins, minerals, co-factors or other nutritional factors or component(s); 
effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 

1 5 violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of 
the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 

20 example, the ability to bind antigens or complement); and the ability to act as an antigen in a 
vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

25 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for 
diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential 
predisposition or susceptibility to various disease states (such as disorders involving 
inflammation or immune response) or a differential response to drug administration, and this 

30 genetic information can be used to tailor preventive or therapeutic treatment appropriately. 
For example, the existence of a polymorphism associated with a predisposition to 
inflammation or autoimmune disease makes possible the diagnosis of this condition in 
humans by identifying the presence of the polymorphism. 
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Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, 
optionally involving isolation or amplification of the DNA, and identifying the presence of 
the polymorphism in the DNA. For example, PGR may be used to amplify an appropriate 
5 fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be 
subjected to allele-specific oligonucleotide hybridization (in which appropriate 
oligonucleotides are hybridized to the DNA under conditions permitting detection of a single 
base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that 
hybridizes immediately adjacent to the position of the polymorphism is extended with one or 

10 more labeled nucleotides). In addition, traditional restriction fragment length polymorphism 
analysis (using restriction enzymes that provide differential digestion of the genomic DNA 
depending on the presence or absence of the polymorphism) may be performed. Arrays with 
nucleotide sequences of the present invention can be used to detect polymorphisms. The 
array can comprise modified nucleotide sequences of the present invention in order to detect 

15 the nucleotide sequences of the present invention. In the alternative, any one of the 

nucleotide sequences of the present invention can be placed on the array to detect changes 
from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, 

20 e.g., by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against 
rheumatoid arthritis is determined in an experimental animal model system. The 

25 experimental model system is adjuvant induced arthritis in rats, and the protocol is described 
by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. 
Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a single 
injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in 
complete Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected 

30 at the base of the tail with an adjuvant mixture. The polypeptide is administered in phosphate 
buffered solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering 
PBS only. 
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The procedure for testing the effects of the test compound would consist of 
intradermal^ injecting killed Mycobacterium tuberculosis in CFA followed by immediately 
administering the test compound and subsequent treatment every other day until day 24. At 
14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis 
5 score may be obtained as described by J. Holoskitz above. An analysis of the data would 
reveal that the test compound would have a dramatic affect on the swelling of the joints as 
measured by a decrease of the arthritis score. 

4.11 THERAPEUTIC METHODS 

10 The compositions (including polypeptide fragments, analogs, variants and antibodies 

or other binding partners or modulators including antisense polynucleotides) of the invention 
have numerous applications in a variety of therapeutic methods. Examples of therapeutic 
applications include, but are not limited to, those exemplified herein. 



15 4.11.1 EXAMPLE 

One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode 
of administration is not particularly important, parenteral administration is preferred. An 

20 exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, 
weight, condition and response of the individual patient. Typically, the amount of 
polypeptide administered per dose will be in the range of about 0.01|u,g/kg to 100 mg/kg of 

25 body weight, with the preferred dose being about O.lfig/kg to 10 mg/kg of patient body 

weight. For parenteral administration, polypeptides of the invention will be formulated in an 
injectable form combined with a pharmaceutically acceptable parenteral vehicle. Such 
vehicles are well known in the art and examples include water, saline, Ringer's solution, 
dextrose solution, and solutions consisting of small amounts of the human serum albumin. 

30 The vehicle may contain minor amounts of additives that maintain the isotonicity and 

stability of the polypeptide or other active ingredient. The preparation of such solutions is 
within the skill of the art. 
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4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

A protein or other composition of the present invention (from whatever source 
derived, including without limitation from recombinant and non-recombinant sources and 
5 including antibodies and other binding partners of the polypeptides of the invention) may be 
administered to a patient in need, by itself, or in pharmaceutical compositions where it is 
mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of 
disorders. Such a composition may optionally contain (in addition to protein or other active 
ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other 

10 materials well known in the art. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredient(s). The characteristics of the carrier will depend on the route of administration. 
The pharmaceutical composition of the invention may also contain cytokines, lymphokines, 
or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, 

15 IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, IL-13, IL-14, IL-15, IFN, TNFO, TNF1, TNF2, 
G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further 
compositions, proteins of the invention may be combined with other agents beneficial to the 
treatment of the disease or disorder in question. These agents include various growth factors 
such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming 

20 growth factors (TGF-a and TGF-P), insulin-like growth factor (IGF), as well as cytokines 
described herein. 

The pharmaceutical composition may further contain other agents which either 
enhance the activity of the protein or other active ingredient or complement its activity or 
use in treatment. Such additional factors and/or agents may be included in the 

25 pharmaceutical composition to produce a synergistic effect with protein or other active 

ingredient of the invention, or to minimize side effects. Conversely, protein or other active 
ingredient of the present invention may be included in formulations of the particular clotting 
factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti- inflammatory agent to minimize side effects of the clotting factor, cytokine, 

30 lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or 

anti-inflammatory agent (such as IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, 
immunosuppressive agents). A protein of the present invention may be active in multimers 
(e.g., heterodimers or homodimers) or complexes with itself or other proteins. As a result, 
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pharmaceutical compositions of the invention may comprise a protein of the invention in 
such multimeric or complexed form. 

As an alternative to being included in a pharmaceutical composition of the invention 
including a first protein, a second protein or a therapeutic agent may be concurrently 
5 administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 
Techniques for formulation and administration of the compounds of the instant application 
may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 
latest edition. A therapeutically effective dose further refers to that amount of the compound 

10 sufficient to result in amelioration of symptoms, e.g. , treatment, healing, prevention or 

amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 1 
ingredient, administered alone, a therapeutically effective dose refers to that ingredient 
alone. When applied to a combination, a therapeutically effective dose refers to combined 

15 amounts of the active ingredients that result in the therapeutic effect, whether administered 
in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therapeutically effective amount of protein or other active ingredient of the present invention 
is administered to a mammal having a condition to be treated. Protein or other active 

20 ingredient of the present invention may be administered in accordance with the method of 
the invention either alone or in combination with other therapies such as treatments 
employing cytokines, lymphokines or other hematopoietic factors. When co- administered 
with one or more cytokines, lymphokines or other hematopoietic factors, protein or other 
active ingredient of the present invention may be administered either simultaneously with 

25 the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or 

anti-thrombotic factors, or sequentially. If administered sequentially, the attending physician 
will decide on the appropriate sequence of administering protein or other active ingredient of 
the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic factors. 

30 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, or intestinal administration; parenteral delivery, including intramuscular, 
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subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of protein 
or other active ingredient of the present invention used in the pharmaceutical composition or 
to practice the method of the present invention can be carried out in a variety of conventional 
5 ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, 

intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient 
is preferred. 

Alternately, one may administer the compound in a local rather than systemic 
manner, for example, via injection of the compound directly into a arthritic joints or in 

10 fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the 

scarring process frequently occurring as complication of glaucoma surgery, the compounds 
may be administered topically, for example, as eye drops. Furthermore, one may administer 
the drug in a targeted drug delivery system, for example, in a liposome coated with a specific 
antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted 

15 to and taken up selectively by the afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an 
effective dosage to the desired site of action. The determination of a suitable route of 
administration and an effective dosage for a particular indication is within the level of skill 
in the art. Preferably for wound treatment, one administers the therapeutic compound 

20 directly to the site. Suitable dosage ranges for the polypeptides of the invention can be 
extrapolated from these dosages or from similar studies in appropriate animal models. 
Dosages can then be adjusted as necessary by the clinician to provide maximal therapeutic 
benefit. 

25 4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus 
may be formulated in a conventional manner using one or more physiologically acceptable 
carriers comprising excipients and auxiliaries which facilitate processing of the active 
compounds into preparations which can be used pharmaceutical^. These pharmaceutical 
30 compositions may be manufactured in a manner that is itself known, e.g., by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, 
encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon 
the route of administration chosen. When a therapeutically effective amount of protein or 
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other active ingredient of the present invention is administered orally, protein or other active 
ingredient of the present invention will be in the form of a tablet, capsule, powder, solution 
or elixir. When administered in tablet form, the pharmaceutical composition of the invention 
may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, 
5 and powder contain from about 5 to 95% protein or other active ingredient of the present 
invention, and preferably from about 25 to 90% protein or other active ingredient of the 
present invention. When administered in liquid form, a liquid carrier such as water, 
petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or 
sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical 

10 composition may further contain physiological saline solution, dextrose or other saccharide 
solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When 
administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% 
by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

1 5 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, 
protein or other active ingredient of the present invention will be in the form of a 
pyrogen-free, parenterally acceptable aqueous solution. The preparation of such parenterally 
acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, 

20 stability, and the like, is within the skill in the art. A preferred pharmaceutical composition 
for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein 
or other active ingredient of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 

25 pharmaceutical composition of the present invention may also contain stabilizers, 

preservatives, buffers, antioxidants, or other additives known to those of skill in the art. For 
injection, the agents of the invention may be formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
physiological saline buffer. For transmucosal administration, penetrants appropriate to the 

30 barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. 

For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such 
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carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a 
patient to be treated. Pharmaceutical preparations for oral use can be obtained from a solid 
excipient, optionally grinding a resulting mixture, and processing the mixture of granules, 
5 after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 

excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or 
sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, 
sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, 

10 disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or 
alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with 
suitable coatings. For this purpose, concentrated sugar solutions may be used, which may 
optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, 
and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 

1 5 Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made 
of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 
or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler 

20 such as lactose, binders such as starches, and/or lubricants such as talc or magnesium 

stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved 
or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene 
glycols. In addition, stabilizers may be added. All formulations for oral administration 
should be in dosages suitable for such administration. For buccal administration, the 

25 compositions may take the form of tablets or lozenges formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 

30 or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin 
for use in an inhaler or insufflator may be formulated containing a powder mix of the 
compound and a suitable powder base such as lactose or starch. The compounds may be 
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formulated for parenteral administration by injection, e.g., by bolus injection or continuous 
infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampules 
or in multi-dose containers, with an added preservative. The compositions may take such 
forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 
5 formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions 
of the active compounds in water-soluble form. Additionally, suspensions of the active 
compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such 

10 as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. Alternatively, the active ingredient may be in 

1 5 powder form for constitution with a suitable vehicle, e,g. , sterile pyrogen-free water, before 
use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. In addition to the formulations described previously, the compounds may 

20 also be formulated as a depot preparation. Such long acting formulations may be 

administered by implantation (for example subcutaneously or intramuscularly) or by 
intramuscular injection. Thus, for example, the compounds may be formulated with suitable 
polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion 
exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

25 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co- 

solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic 
polymer, and an aqueous phase. The co-solvent system may be the VPD co-solvent system. 
VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 
80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol. The VPD 

30 co-solvent system (VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water 

solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces 
low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system 
may be varied considerably without destroying its solubility and toxicity characteristics. 
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Furthermore, the identity of the co-solvent components may be varied: for example, other 
low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of 
polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene 
glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for 
5 dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds 
may be employed. Liposomes and emulsions are well known examples of delivery vehicles 
or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also 
may be employed, although usually at the cost of greater toxicity. Additionally, the 
compounds may be delivered using a sustained-release system, such as semipermeable 

10 matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of 
sustained-release materials have been established and are well known by those skilled in the 
art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 

1 5 ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, 
gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the 

20 invention may be provided as salts with pharmaceutically compatible counter ions. Such 
pharmaceutically acceptable base addition salts are those salts which retain the biological 
effectiveness and properties of the free acids and which are obtained by reaction with 
inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, 
trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, 

25 potassium benzoate, triethanol amine and the like. 

The pharmaceutical composition of the invention may be in the form of a complex of 
the protein(s) or other active ingredient(s) of present invention along with protein or peptide 
antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 

30 receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) 

following presentation of the antigen by MHC proteins. MHC and structurally related 
proteins including those encoded by class I and class II MHC genes on host cells will serve 
to present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
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supplied as purified MHC -peptide complexes alone or with co-stimulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunoglobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the invention. 
5 The pharmaceutical composition of the invention may be in the form of a liposome in 

which protein of the present invention is combined, in addition to other pharmaceutically 
acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. 
Suitable lipids for liposomal formulation include, without limitation, monoglycerides, 

10 diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. 

Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, 
for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of 
which are incorporated herein by reference. 

The amount of protein or other active ingredient of the present invention in the 

15 pharmaceutical composition of the present invention will depend upon the nature and 
severity of the condition being treated, and on the nature of prior treatments which the 
patient has undergone. Ultimately, the attending physician will decide the amount of protein 
or other active ingredient of the present invention with which to treat each individual patient. 
Initially, the attending physician will administer low doses of protein or other active 

20 ingredient of the present invention and observe the patient's response. Larger doses of 
protein or other active ingredient of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used to 
practice the method of the present invention should contain about 0.01 jxg to about 100 mg 

25 (preferably about 0.1 jug to about 10 mg, more preferably about 0.1 |ug to about 1 mg) of 
protein or other active ingredient of the present invention per kg body weight. For 
compositions of the present invention which are useful for bone, cartilage, tendon or 
ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the 

30 therapeutic composition for use in this invention is, of course, in a pyrogen-free, 

physiologically acceptable form. Further, the composition may desirably be encapsulated or 
injected in a viscous form for delivery to the site of bone, cartilage or tissue damage. 
Topical administration may be suitable for wound healing and tissue repair. Therapeutically 
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useful agents other than a protein or other active ingredient of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and/or cartilage formation, the composition 
5 would include a matrix capable of delivering the protein-containing or other active 

ingredient-containing composition to the site of bone and/or cartilage damage, providing a 
structure for the developing bone and cartilage and optimally capable of being resorbed into 
the body. Such matrices may be formed of materials presently in use for other implanted 
medical applications. 

10 The choice of matrix material is based on biocompatibility, biodegradability, 

mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential matrices 
for the compositions may be biodegradable and chemically defined calcium sulfate, 
tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and poly anhydrides. 

1 5 Other potential materials are biodegradable and biologically well-defined, such as bone or 
dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix 
components. Other potential matrices are nonbiodegradable and chemically defined, such as 
sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised 
of combinations of any of the above-mentioned types of material, such as polylactic acid and 

20 hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in 
composition, such as in calcium-aluminate-phosphate and processing to alter pore size, 
particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole 
weight) copolymer of lactic acid and glycolic acid in the form of porous particles having 
diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize 

25 a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent 
the protein compositions from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, 

30 hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being 
cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents 
include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, 
carboxyvinyl polymer and poly (vinyl alcohol). The amount of sequestering agent useful 
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herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which 
represents the amount necessary to prevent desorption of the protein from the polymer 
matrix and to provide appropriate handling of the composition, yet not so much that the 
progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the 
5 opportunity to assist the osteogenic activity of the progenitor cells. In further compositions, 
proteins or other active ingredients of the invention may be combined with other agents 
beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. 
These agents include various growth factors such as epidermal growth factor (EGF), platelet 
derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-P), and 

1 0 insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
patients for such treatment with proteins or other active ingredients of the present invention. 
The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 

1 5 regeneration will be determined by the attending physician considering various factors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site 
of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue 
(e.g., bone), the patient's age, sex, and diet, the severity of any infection, time of 
administration and other clinical factors. The dosage may vary with the type of matrix used 

20 in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. 
For example, the addition of other known growth factors, such as IGF I (insulin like growth 
factor I), to the final composition, may also effect the dosage. Progress can be monitored by 
periodic assessment of tissue/bone growth and/or repair, for example, X-rays, 
histomorphometric determinations and tetracycline labeling. 

25 Polynucleotides of the present invention can also be used for gene therapy. Such 

polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in 

30 the presence of proteins of the present invention in order to proliferate or to produce a 

desired effect on or activity in such cells. Treated cells can then be introduced in vivo for 
therapeutic purposes. 
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4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve 
its intended purpose. More specifically, a therapeutically effective amount means an amount 
5 effective to prevent development of or to alleviate the existing symptoms of the subject 
being treated. Determination of the effective amount is well within the capability of those 
skilled in the art, especially in light of the detailed disclosure provided herein. For any 
compound used in the method of the invention, the therapeutically effective dose can be 
estimated initially from appropriate in vitro assays. For example, a dose can be formulated in 

10 animal models to achieve a circulating concentration range that can be used to more 

accurately determine useful doses in humans. For example, a dose can be formulated in 
animal models to achieve a circulating concentration range that includes the IC50 as 
determined in cell culture (i.e., the concentration of the test compound which achieves a 
half-maximal inhibition of the protein's biological activity). Such information can be used 

15 to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in 
cell cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% 

20 of the population) and the ED50 (the dose therapeutically effective in 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio between LD 50 and ED 50 . Compounds which exhibit high therapeutic 
indices are preferred. The data obtained from these cell culture assays and animal studies 
can be used in formulating a range of dosage for use in human. The dosage of such 

25 compounds lies preferably within a range of circulating concentrations that include the ED 50 
with little or no toxicity. The dosage may vary within this range depending upon the dosage 
form employed and the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of the patient's 
condition. See, e,g., Fingl et al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 

30 1 p. 1 . Dosage amount and interval may be adjusted individually to provide plasma levels of 
the active moiety which are sufficient to maintain the desired effects, or minimal effective 
concentration (MEC). The MEC will vary for each compound but can be estimated from in 
vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics 
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and route of administration. However, HPLC assays or bioassays can be used to determine 
plasma concentrations. 

Dosage intervals can also be determined using MEC value. Compounds should be 
administered using a regimen which maintains plasma levels above the MEC for 10-90% of 
5 the time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the 
invention will be in the range of about 0.01 p.g/kg to 100 mg/kg of body weight daily, with 
1 0 the preferred dose being about 0.1 jag/kg to 25 mg/kg of patient body weight daily, varying 
in adults and children. Dosing may be once daily, or equivalent doses may be delivered at 
longer or shorter intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
1 5 administration and the judgment of the prescribing physician. 



4.12.4 PACKAGING 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one or more unit dosage forms containing the active ingredient. The pack may, 
20 for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser 

device may be accompanied by instructions for administration. Compositions comprising a 
compound of the invention formulated in a compatible pharmaceutical carrier may also be 
prepared, placed in an appropriate container, and labeled for treatment of an indicated 
condition. 

25 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of 
the invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
30 contain an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 
Fab, Fab' and F (a b*)2 fragments, and an F a b expression library. In general, an antibody molecule 
obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ 
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from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG 2? and others. Furthermore, in humans, the light 
chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 
5 An isolated related protein of the invention may be intended to serve as an antigen, or 

a portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length protein can be used or, 
alternatively, the invention provides antigenic peptide fragments of the antigen for use as 

10 immunogens. An antigenic peptide fragment comprises at least 6 amino acid residues of the 
amino acid sequence of the full length protein, such as an amino acid sequence shown in 
SEQ ID NO: 912-1822, or 2479-3134, or Tables 3A, 3B, 5, or 6, and encompasses an 
epitope thereof such that an antibody raised against the peptide forms a specific immune 
complex with the full length protein or with any fragment that contains the epitope. 

15 Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 
amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 

20 antigenic peptide is a surface region of the protein, e.g., a hydrophilic region. A 

hydrophobicity analysis of the human related protein sequence will indicate which regions of 
a related protein are particularly hydrophilic and, therefore, are likely to encode surface 
residues useful for targeting antibody production. As a means for targeting antibody 
production, hydropathy plots showing regions of hydrophilicity and hydrophobicity may be 

25 generated by any method well known in the art, including, for example, the Kyte Doolittle or 
the Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and 
Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol. 
Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
Antibodies that are specific for one or more domains within an antigenic protein, or 

30 derivatives, fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 
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The term "specific for" indicates that the variable regions of the antibodies of the 
invention recognize and bind polypeptides of the invention exclusively (i.e., able to 
distinguish the polypeptide of the invention from other similar polypeptides despite sequence 
identity, homology, or similarity found in the family of polypeptides), but may also interact 
with other proteins (for example, S. aureus protein A or other antibodies in ELISA 
techniques) through interactions with sequences outside the variable region of the antibodies, 
and in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely practiced in 
the art. For a comprehensive discussion of such assays, see Harlow et al. (Eds), Antibodies 
A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring Harbor, NY (1988), 
Chapter 6. Antibodies that recognize and bind fragments of the polypeptides of the 
invention are also contemplated, provided that the antibodies are first and foremost specific 
for, as defined above, full-length polypeptides of the invention. As with antibodies that are 
specific for full length polypeptides of the invention, antibodies of the invention that 
recognize fragments are those which can distinguish polypeptides from the same family of 
polypeptides despite inherent sequence identity, homology, or similarity found in the family 
of proteins. 

Antibodies of the invention are useful for, for example, therapeutic purposes (by 
modulating activity of a polypeptide of the invention), diagnostic purposes to detect or 
quantitate a polypeptide of the invention, as well as purification of a polypeptide of the 
invention. Kits comprising an antibody of the invention for any of the purposes described 
herein are also comprehended. In general, a kit of the invention also includes a control 
antigen for which the antibody is immunospecific. The invention further provides a 
hybridoma that produces an antibody according to the invention. Antibodies of the 
invention are useful for detection and/or purification of the polypeptides of the invention. 

Monoclonal antibodies binding to the protein of the invention may be useful 
diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting and 
preventing the metastatic spread of the cancerous cells, which may be mediated by the 
protein. 
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The labeled antibodies of the present invention can be used for in vitro, in vivo, and 
in situ assays to identify cells or tissues in which a fragment of the polypeptide of interest is 
expressed. The antibodies may also be used directly in therapies or other diagnostics. The 
present invention further provides the above-described antibodies immobilized on a solid 
support. Examples of such solid supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and Sepharose®, acrylic resins and such as polyacrylamide 
and latex beads. Techniques for coupling antibodies to such solid supports are well known 
in the art (Weir, D.M. et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10 (1986); Jacoby, W.D. et al., Meth. 
Enzym. 34 Academic Press, N.Y. (1974)). The immobilized antibodies of the present 
invention can be used for in vitro, in vivo, and in situ assays as well as for immuno-affinity 
purification of the proteins of the present invention. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: 
A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, incorporated herein by reference). Some of these antibodies are 
discussed below. 

4.13.1 POLYCLONAL ANTIBODIES 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with the 
native protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 
recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples 
of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, 
serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can 
further include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 
aluminum hydroxide), surface-active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as 
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Bacille Calmette-Guerin and Coiynebacterium parvum, or similar immunostimulatory 
agents. Additional examples of adjuvants that can be employed include MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
5 isolated from the mammal (e.g., from the blood) and further purified by well known 

techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
10 chromatography. Purification of immunoglobulins is discussed, for example, by D. 

Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 



4.13.2 MONOCLONAL ANTIBODIES 

15 The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 

used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) 
of the monoclonal antibody are identical in all the molecules of the population. MAbs thus 

20 contain an antigen-binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256, 495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 

25 immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes can be 
immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells 
30 of human origin are desired, or spleen cells or lymph node cells are used if non-human 

mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
(Goding, Monoclonal Antibodies: Principles and Practice, Academic Press, (1986) pp. 59- 



WO 03/054152 PCT/US02/39555 

89 

103). Immortalized cell lines are usually transformed mammalian cells, particularly 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells can be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, 
5 immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 

10 level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, 
San Diego, California and the American Type Culture Collection, Manassas, Virginia. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for 

15 the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); 
Brodeur et al., Monoclonal Antibody Production Techniques and Applications, Marcel 
Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed 
for the presence of monoclonal antibodies directed against the antigen. Preferably, the 

20 binding specificity of monoclonal antibodies produced by the hybridoma cells is determined 
by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107, 220 (1980). Preferably, 

25 antibodies having a high degree of specificity and a high binding affinity for the target 
antigen are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods. Suitable culture media for this 
purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 
30 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
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such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
5 the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as 
a preferred source of such DNA. Once isolated, the DNA can be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster 

10 ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, 
to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA 
also can be modified, for example, by substituting the coding sequence for human heavy and 
light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the 

1 5 immunoglobulin coding sequence all or part of the coding sequence for a non- 

immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted 
for the constant domains of an antibody of the invention, or can be substituted for the 
variable domains of one antigen-combining site of an antibody of the invention to create a 
chimeric bivalent antibody. 

20 

4.13.3 HUMANIZED ANTIBODIES 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 

25 the administered immunoglobulin. Humanized forms of antibodies are chimeric 

immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab f , 
F(ab') 2 or other antigen-binding subsequences of antibodies) that are principally comprised 
of the sequence of a human immunoglobulin, and contain minimal sequence derived from a 
non-human immunoglobulin. Humanization can be performed following the method of 

30 Winter and co-workers (Jones et al., Nature, 321, 522-525 (1986); Riechmann et al., Nature, 
332, 323-327 (1988); Verhoeyen et al., Science, 239, 1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. (See 
also U.S. Patent No. 5,225,539). In some instances, Fv framework residues of the human 



WO 03/054152 PCT/US02/39555 

91 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
can also comprise residues that are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
5 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework regions are those of a human immunoglobulin 
consensus sequence. The humanized antibody optimally also will comprise at least a portion 
of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin 
(Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2, 593-596 
10 (1992)). 

4,13.4 HUMAN ANTIBODIES 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from 

1 5 human genes. Such antibodies are termed "human antibodies", or "fully human antibodies" 
herein. Human monoclonal antibodies can be prepared by the trioma technique; the human 
B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 

20 monoclonal antibodies may be utilized in the practice of the present invention and may be 
produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80, 
2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et 
al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 
In addition, human antibodies can also be produced using additional techniques, 

25 including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227, 381 (1991); 
Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 

30 humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779- 
783 (1992)); Lonberg et al. (Nature 368, 856-859 (1994)); Morrison (Nature 368, 812-13 
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(1994) ); Fishwild et al, (Nature Biotechnology 14, 845-51 (1996)); Neuberger (Nature 
Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13, 65-93 

(1995) ). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
that are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains 
in the nonhuman host have been incapacitated, and active loci encoding human heavy and 
light chain immunoglobulins are inserted into the host's genome. The human genes are 
incorporated, for example, using yeast artificial chromosomes containing the requisite 
human DNA segments. An animal which provides all the desired modifications is then 
obtained as progeny by crossbreeding intermediate transgenic animals containing fewer than 
the full complement of the modifications. The preferred embodiment of such a nonhuman 
animal is a mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 
96/33735 and WO 96/34096. This animal produces B cells that secrete fully human 
immunoglobulins. The antibodies can be obtained directly from the animal after 
immunization with an immunogen of interest, as, for example, a preparation of a polyclonal 
antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment genes 
from at least one endogenous heavy chain locus in an embryonic stem cell to prevent 
rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem cell 
a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable 
marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that 
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contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a light 
chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The 
hybrid cell expresses an antibody containing the heavy chain and the light chain. 
5 In a further improvement on this procedure, a method for identifying a clinically 

relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

1 0 4.13.5 FAB FRAGMENTS AND SINGLE CHAIN ANTIBODIES 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F ab expression 
libraries (see e.g., Huse, et al., 1989 Science 246, 1275-1281) to allow rapid and effective 

15 identification of monoclonal F ab fragments with the desired specificity for a protein or 

derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F( a b')2 fragment produced by pepsin digestion of an antibody molecule; 
(ii) an F a b fragment generated by reducing the disulfide bridges of an F (ab -)2 fragment; (iii) an 

20 F ab fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 

4.13.6 BISPECIFIC ANTIBODIES 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
25 that have binding specificities for at least two different antigens. In the present case, one of 
the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 
receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
30 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305, 537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
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produce a potential mixture often different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually accomplished 
by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993, and in Traunecker et al, 1991 EMBO J., 10, 3655-3659. 
5 Antibody variable domains with the desired binding specificities (antibody-antigen 

combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of 

10 the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in Enzymology, 121, 210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 

15 pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
that are recovered from recombinant cell culture. The preferred interface comprises at least 
a part of the CH3 region of an antibody constant domain. In this method, one or more small 
amino acid side chains from the interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or 

20 similar size to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 
threonine). This provides a mechanism for increasing the yield of the heterodimer over other 
unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full-length antibodies or antibody fragments 

25 (e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from 

antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229, 81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 
fragments. These fragments are reduced in the presence of the dithiol complexing agent 

30 sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. 
The Fab 5 fragments generated are then converted to thionitrobenzoate (TNB) derivatives. 
One of the Fab' -TNB derivatives is then reconverted to the Fab 5 -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other Fab' -TNB 
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derivative to form the bispecific antibody. The bispecific antibodies produced can be used 
as agents for the selective immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med., 175, 217-225 (1992) 
5 describe the production of a fully humanized bispecific antibody F(ab')2 molecule. Each 
Fab 5 fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as 
trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

10 Various techniques for making and isolating bispecific antibody fragments directly 

from recombinant cell culture have also been described. For example, bispecific antibodies 
have been produced using leucine zippers. Kostelny et ah, J. Immunol. 148(5), 1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were reduced 

15 at the hinge region to form monomers and then re-oxidized to form the antibody 

heterodimers. This method can also be utilized for the production of antibody homodimers. 
The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90, 
6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody 
fragments. The fragments comprise a heavy-chain variable domain (V H ) connected to a 

20 light-chain variable domain (V L ) by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the V H and V L domains of one fragment are 
forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody 
fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et 

25 al., J. Immunol. 152, 5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147, 60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm 

30 of an immunoglobulin molecule can be combined with an arm which binds to a triggering 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), 
or Fc receptors for IgG (FC7R), such as FcyRI (CD64), FC7RII (CD32) and FC7RIII (CD 16) 
so as to focus cellular defense mechanisms to the cell expressing the particular antigen. 
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Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a 
particular antigen. These antibodies possess an antigen-binding arm and an arm which binds 
a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPT A, DOTA, or TETA. 
Another bispecific antibody of interest binds the protein antigen described herein and further 
5 binds tissue factor (TF). 

4.13.7 HETEROCONJXJGATE ANTIBODIES 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 

10 antibodies have, for example, been proposed to target immune system cells to unwanted cells 
(U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins can be constructed using a disulfide exchange reaction 

15 or by forming a thioether bond. Examples of suitable reagents for this purpose include 
iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. 
Patent No. 4,676,980. 

4.13.8 EFFECTOR FUNCTION ENGINEERING 

20 It can be desirable to modify the antibody of the invention with respect to effector 

function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 

25 mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
al., J. Exp Med., 176, 1191-1195 (1992) and Shopes, J. Immunol., 148, 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobifunctional cross-linkers as described in Wolff et al. Cancer Research, 53, 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and can 

30 thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al,, 
Anti-Cancer Drug Design, 3, 219-230 (1989). 



4.13.9 IMMUNOCONJUGATES 
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The invention also pertains to immunoconjugates comprising an antibody conjugated 
to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 
toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive 
isotope (i.e., a radioconjugate). 
5 Chemotherapeutic agents useful in the generation of such immunoconjugates have 

been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 

10 (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
tricothecenes. A variety of radionuclides are available for the production of radioconjugated 
antibodies. Examples include 212 Bi, 131 I, 131 In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 

1 5 Afunctional protein-coupling agents such as N-succinimidyl-3 -(2-pyridyldithiol) propionate 
(SPDP), iminothiolane (IT), Afunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates 

20 (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 
2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al., Science, 238: 1098 (1987). Carbon- 14 -labeled l-isothiocyanatobenzyl-3- 
methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 
conjugation of radionucleotide to the antibody. See W094/1 1026. 

25 In another embodiment, the antibody can be conjugated to a "receptor" (such 

streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 



4.14 COMPUTER READABLE SEQUENCES 
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In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any medium which can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
5 storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 
comprising computer readable medium having recorded thereon a nucleotide sequence of the 

10 present invention. As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently known 
methods for recording information on computer readable medium to generate manufactures 
comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 

1 5 computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor programs 
and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented in a 

20 word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any 
number of data processor structuring formats (e.g. text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence information of 

25 the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-91 1, or 1823-2478 or a 
representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the 
nucleotide sequences of SEQ ID NO: 1-911, or 1823-2478 in computer readable form, a 
skilled artisan can routinely access the sequence information for a variety of purposes. 

30 Computer software is publicly available which allows a skilled artisan to access sequence 
information provided in a computer readable medium. The examples which follow 
demonstrate how software which implements the BLAST (Altschul et al., J. Mol. Biol. 
215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. Chem. 17:203-207 (1993)) search 



WO 03/054152 PCT/US02/39555 

99 

algorithms on a Sybase system is used to identify open reading frames (ORFs) within a 
nucleic acid sequence. Such ORFs may be protein-encoding fragments and may be useful in 
producing commercially important proteins such as enzymes used in fermentation reactions 
and in the production of commercially useful metabolites. 
5 As used herein, "a computer-based system" refers to the hardware means, software 

means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, and 
data storage means. A skilled artisan can readily appreciate that any one of the currently 

10 available computer-based systems are suitable for use in the present invention. As stated 
above, the computer-based systems of the present invention comprise a data storage means 
having stored therein a nucleotide sequence of the present invention and the necessary 
hardware means and software means for supporting and implementing a search means. As 
used herein, "data storage means" refers to memory which can store nucleotide sequence 

1 5 information of the present invention, or a memory access means which can access 

manufactures having recorded thereon the nucleotide sequence information of the present 
invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target structural 

20 motif with the sequence information stored within the data storage means. Search means are 
used to identify fragments or regions of a known sequence which match a particular target 
sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the 
computer-based systems of the present invention. Examples of such software includes, but 

25 is not limited to, Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA 

(NPOLYPEPTIDEI A) . A skilled artisan can readily recognize that any one of the available 
algorithms or implementing software packages for conducting homology searches can be 
adapted for use in the present computer-based systems. As used herein, a "target sequence" 
can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more 

30 amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the 
less likely a target sequence will be present as a random occurrence in the database. The 
most preferred sequence length of a target sequence is from about 10 to 300 amino acids, 
more preferably from about 30 to 100 nucleotide residues. However, it is well recognized 



10 
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that searches for commercially important fragments, such as sequence fragments involved in 
gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) are chosen based on 
a three-dimensional configuration which is formed upon the folding of the target motif. 
There are a variety of target motifs known in the art. Protein target motifs include, but are 
not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs include, 
but are not limited to, promoter sequences, hairpin structures and inducible expression 
elements (protein binding sequences). 



4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used 
to control gene expression through triple helix formation or antisense DNA or RNA, both of 
which methods are based on the binding of a polynucleotide sequence to DNA or RNA. 

1 5 Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and 
are designed to be complementary to a region of the gene involved in transcription (triple 
helix-see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 15241, 456 
(1988); and Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense- 
Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 

20 Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences of 
the present invention is necessary for the design of an antisense or triple helix 

25 oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a 
30 nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise 
associated with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
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polynucleotide for a period sufficient to form the complex, and detecting the complex, so 
that if a complex is detected, a polynucleotide of the invention is detected in the sample. 
Such methods can also comprise contacting a sample under stringent hybridization 
conditions with nucleic acid primers that anneal to a polynucleotide of the invention under 
5 such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is 
amplified, a polynucleotide of the invention is detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the 
polypeptide for a period sufficient to form the complex, and detecting the complex, so that if 

10 a complex is detected, a polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying 
for binding of the nucleic acid probes or antibodies to components within the test sample. 
Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

15 Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. 
One skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or immunological assay formats can readily be adapted to employ the nucleic 
acid probes or antibodies of the present invention. Examples of such assays can be found in 

20 Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in 
Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 
(1985); Tijssen, P., Practice and Theory of immunoassays: Laboratory Techniques in 
Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The 

25 Netherlands (1985). The test samples of the present invention include cells, protein or 
membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
urine. The test sample used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample to 
be assayed. Methods for preparing protein extracts or membrane extracts of cells are well 

30 known in the art and can be readily be adapted in order to obtain a sample which is 
compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the 
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invention provides a compartment kit to receive, in close confinement, one or more 
containers which comprises: (a) a first container comprising one of the probes or antibodies 
of the present invention; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound probe or 
antibody. 

In detail, a compartment kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers or 
strips of plastic or paper. Such containers allows one to efficiently transfer reagents from 
one compartment to another compartment such that the samples and reagents are not 
cross-contaminated, and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the antibodies used 
in the assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound 
antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled 
secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, 
or antibody binding reagents which are capable of reacting with the labeled antibody. One 
skilled in the art will readily recognize that the disclosed probes and antibodies of the present 
invention can be readily incorporated into one of the established kit formats which are well 
known in the art. 

4.17 MEDICAL IMAGING 

The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or 
infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve 
chemical attachment of a labeling or imaging agent, administration of the labeled 
polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging the labeled 
polypeptide in vivo at the target site. 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present 
invention further provides methods of obtaining and identifying agents which bind to a 
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polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set forth 
in SEQ ID NO: 1-911, or 1823-2478, or bind to a specific domain of the polypeptide 
encoded by the nucleic acid. In detail, said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the 
5 present invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide 
of the invention for a time sufficient to form a polynucleotide/compound complex, and 

10 detecting the complex, so that if a polynucleotide/compound complex is detected, a 
compound that binds to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to 
a polypeptide of the invention can comprise contacting a compound with a polypeptide of 
the invention for a time sufficient to form a polypeptide/compound complex, and detecting 

1 5 the complex, so that if a polypeptide/compound complex is detected, a compound that binds 
to a polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polypeptide of the invention can 
also comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression 

20 of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
sequence expression, so that if a polypeptide/compound complex is detected, a compound 
that binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 

25 activity observed in the absence of the compound). Alternatively, compounds identified via 
such methods can include compounds which modulate the expression of a polynucleotide of 
the invention (that is, increase or decrease expression relative to expression levels observed 
in the absence of the compound). Compounds, such as compounds identified via the 
methods of the invention, can be tested using standard assays well known to those of skill in 

30 the art for their ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
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selected and screened at random or rationally selected or designed using protein modeling 
techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents 
and the like are selected at random and are assayed for their ability to bind to the protein 
5 encoded by the ORF of the present invention. Alternatively, agents may be rationally 

selected or designed. As used herein, an agent is said to be "rationally selected or designed" 
when the agent is chosen based on the configuration of the particular protein. For example, 
one skilled in the art can readily adapt currently available procedures to generate peptides, 
pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in 

10 order to generate rationally designed antipeptide peptides, for example see Hurby et al., 
Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's 
Guide, W.H. Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 
28:9230-8 (1989), or pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as broadly 

1 5 described, can be used to control gene expression through binding to one of the ORFs or 
EMFs of the present invention. As described above, such agents can be randomly screened 
or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single 
ORF or multiple ORFs which rely on the same EMF for expression control. One class of 

20 DNA binding agents are agents which contain base residues which hybridize or form a triple 
helix formation by binding to DNA or RNA. Such agents can be based on the classic 
phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric 
derivatives which have base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 

25 designed to be complementary to a region of the gene involved in transcription (triple helix - 
see Lee et al., Nucl. Acids Res. 6, 3073 (1979); Cooney et al., Science 241, 456 (1988); and 
Dervan et al., Science 251, 1360 (1991)) or to the mRNA itself (antisense-Okano, J. 
Neurochem. 56, 560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally results in 

30 a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences of 
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the present invention is necessary for the design of an antisense or triple helix 
oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention 
can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the 
5 ORFs of the present invention can be formulated using known techniques to generate a 
pharmaceutical composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic 
10 acid hybridization probes capable of hybridizing with naturally occurring nucleotide 

sequences. The hybridization probes of the subject invention may be derived from any of 
the nucleotide sequences SEQ ID NO: 1-91 1 ? or 1823-2478. Because the corresponding 
gene is only expressed in a limited number of tissues, a hybridization probe derived from 
any of the nucleotide sequences SEQ ID NO: 1-91 1, or 1823-2478 can be used as an 
15 indicator of the presence of RNA of cell type of such a tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 
additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used 
in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. 
20 The probe will comprise a discrete nucleotide sequence for the detection of identical 

sequences or a degenerate pool of possible sequences for identification of closely related 
genomic sequences. 

Other means for producing specific hybridization probes for nucleic acids include the 
cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such 

25 vectors are known in the art and are commercially available and may be used to synthesize 
RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or 
SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide 
sequences may be used to construct hybridization probes for mapping their respective 
genomic sequences. The nucleotide sequence provided herein may be mapped to a 

30 chromosome or specific regions of a chromosome using well-known genetic and/or 

chromosomal mapping techniques. These techniques include in situ hybridization, linkage 
analysis against known chromosomal markers, hybridization screening with libraries or 
flow-sorted chromosomal preparations specific to known chromosomes, and the like. The 
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technique of fluorescent in situ hybridization of chromosome spreads has been described, 
among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic 
Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
5 chromosome mapping techniques may be correlated with additional genetic map data. 
Examples of genetic map data can be found in the 1994 Genome Issue of Science 
(265: 198 If). Correlation between the location of a nucleic acid on a physical chromosomal 
map and a specific disease (or predisposition to a specific disease) may help delimit the 
region of DNA associated with that genetic disease. The nucleotide sequences of the subject 
10 invention may be used to detect differences in gene sequences between normal, carrier or 
affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for 
example, directly synthesizing the oligonucleotide by chemical means, as is commonly 

1 5 practiced using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those 
of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy 
is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can 
be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol. 28(6), 1469- 

20 72); using UV light (Nagata et al, 1985; Dahlen et al, 1987; Morrissey & Collins, (1989) Mol. 
Cell Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller et al, 1988; 
1989); all references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 
interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. Sci. USA 91(8), 

25 3072-6, describe the use of biotinylated probes, although these are duplex probes, that are 
immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be 
purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating 
any surface with streptavidin. Biotinylated probes may be purchased from various sources, 
such as, e.g., Operon Technologies (Alameda, CA). 

30 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. 

Nunc Laboratories have developed a method by which DNA can be covalently bound to the 
microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with 
secondary amino groups (>NH) that serve as bridgeheads for further covalent coupling. 
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CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound 
to CovaLink exclusively at the 5-end by a phosphoramidate bond, allowing immobilization of 
more than 1 pmol of DNA (Rasmussen etal, (1991) Anal. Biochem. 198(1) 138-42). 

The use of CovaLink NH strips for covalent binding of DNA molecules at the 5 f -end 
5 has been described (Rasmussen et al., (1991). In this technology, a phosphoramidate bond is 
employed (Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as 
immobilization using only a single covalent bond is preferred. The phosphoramidate bond joins 
the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer 
arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link 

10 an oligonucleotide to CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus 
must have a 5 f -end phosphate group. It is, perhaps, even possible for biotin to be covalently 
bound to CovaLink and then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/pl) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1- 

15 methylimidazole, pH 7.0 (1-Melm7), is then added to a final concentration of 10 mM 1 -Melm 7 . 
A ss DNA solution is then dispensed into CovaLink NH strips (75 jul/well) standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), 
dissolved in 10 mM 1-Melm7, is made fresh and 25 pi added per well. The strips are incubated 
for 5 hours at 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; 

20 first the wells are washed 3 times, then they are soaked with washing solution for 5 min., and 
finally they are washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS 
heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is 
that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated 

25 herein by reference. This method of preparing an oligonucleotide bound to a support involves 
attaching a nucleoside 3 '-reagent through the phosphate group by a covalent phosphodiester link 
to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on 
the supported nucleoside and protecting groups removed from the synthetic oligonucleotide 
chain under standard conditions that do not cleave the oligonucleotide from the support. 

30 Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
arrays may be employed. For example, addressable laser-activated photodeprotection may be 
employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described 
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by Fodor et al (1991) Science 251(4995), 767-73, incorporated herein by reference. Probes 
may also be immobilized on nylon supports as described by Van Ness et al (1991) Nucleic 
Acids Res., 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) 
Anal. Biochem. 169(1), 104-8; all references being specifically incorporated herein. 
5 To link an oligonucleotide to a nylon support, as described by Van Ness et al (1991), 

requires activation of the nylon surface via alkylation and selective activation of the 5'-amine of 
oligonucleotides with cyanuric chloride. 

One particular way to prepare support bound oligonucleotides is to utilize the 
light-generated synthesis described by Pease et al, (1994) Proc. Natl. Acad. Sci., USA 91(1 1), 

1 0 5022-6, incorporated herein by reference). These authors used current photolithographic 
techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These 
methods, in which light is used to direct the synthesis of oligonucleotide probes in high-density, 
miniaturized arrays, utilize photolabile 5'-protected N-acyl-deoxynucleoside phosphoramidites, 
surface linker chemistry and versatile combinatorial synthesis strategies. A matrix of 256 

1 5 spatially defined oligonucleotide probes may be generated in this manner. 

4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, 
genomic DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC 
inserts, and KNA, including mRNA without any amplification steps. For example, Sambrook 
20 et al (1989) describes three protocols for the isolation of high molecular weight DNA from 
mammalian cells (p. 9.14-9.23). 

DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PGR or other amplification methods. 
Samples may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA 
25 samples may be prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of 
skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of 
Sambrook et al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1990) 
30 Nucleic Acids Res. 1 8(24), 7455-6, incorporated herein by reference). In this method, DNA 
samples are passed through a small French pressure cell at a variety of low to intermediate 
pressures. A lever device allows controlled application of low to intermediate pressures to the 
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cell. The results of these studies indicate that low-pressure shearing is a useful alternative to 

sonic and enzymatic DNA fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the 

two base recognition endonuclease, CWJI, described by Fitzgerald et al (1992) Nucleic Acids 
5 Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and 

fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun 

cloning and sequencing. 

The restriction endonuclease CvzJI normally cleaves the recognition sequence PuGCPy 

between the G and C to leave blunt ends. Atypical reaction conditions, which alter the 
1 0 specificity of this enzyme (Cvi JI**), yield a quasi-random distribution of DNA fragments form 

the small molecule pUC19 (2688 base pairs). Fitzgerald et al. (1992) quantitatively evaluated 

the randomness of this fragmentation strategy, using a CWJI** digest of pUC19 that was size 

fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z 

minus Ml 3 cloning vector. Sequence analysis of 76 clones showed that CvzJI** restricts 
1 5 pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated 

at a rate consistent with random fragmentation. 

As reported in the literature, advantages of this approach compared to sonication and 

agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 jug instead of 

2-5 |Ltg); and fewer steps are involved (no preligation, end repair, chemical extraction, or 
20 agarose gel electrophoresis and elution are needed). 

Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, 

it is important to denature the DNA to give single stranded pieces available for hybridization. 

This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is 

then cooled quickly to 2°C to prevent renaturation of the DNA fragments before they are 
25 contacted with the chip. Phosphate groups must also be removed from genomic DNA by 

methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon 
membrane. Spotting may be performed by using arrays of metal pins (the positions of which 
30 correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a 
DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density 
of the wells is achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the 
type of label used. By avoiding spotting in some preselected number of rows and columns, 



WO 03/054152 PCT/US02/39555 

110 

separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic 
segment of DNA (or the same gene) from different individuals, or may be different, overlapped 
genomic clones. Each of the subarrays may represent replica spotting of the same samples. In 
one example, a selected gene segment may be amplified from 64 patients. For each patient, the 
5 amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). 
A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be 
spotted on one 8 x 12 cm membrane. Subarrays may contain 64 samples, one from each patient. 
Where the 96 subarrays are identical, the dot span may be 1 mm 2 and there may be a 1 mm 
space between subarrays. 

1 0 Another approach is to use membranes or plates (available from NUNC, Naperville, 

Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the 
membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell 
plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure 
to flat phosphor-storage screens or x-ray films. 

1 5 The present invention is illustrated in the following examples. Upon consideration of 

the present disclosure, one of skill in the art will appreciate that many other embodiments and 
variations may be made in the scope of the present invention. Accordingly, it is intended that 
the broader aspects of the present invention not be limited to the disclosure of the following 
examples. The present invention is not to be limited in scope by the exemplified embodiments 

20 which are intended as illustrations of single aspects of the invention, and compositions and 
methods which are functionally equivalent are within the scope of the invention. Indeed, 
numerous modifications and variations in the practice of the invention are expected to occur to 
those skilled in the art upon consideration of the present preferred embodiments. Consequently, 
the only limitations which should be placed upon the scope of the invention are those which 

25 appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated 
by reference in their entirety. 

5.0 EXAMPLES 

5.1 EXAMPLE 1 

30 Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from 
various human tissues and in some cases isolated from a genomic library derived from human 
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chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing 
techniques. The inserts of the library were amplified with PCR using primers specific for the 
vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon 
membrane filters and screened with oligonucleotide probes (e.g., 7-mers) to obtain signature 
5 sequences. The clones were clustered into groups of similar or identical sequences. 
Representative clones were selected for sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied 
1 0 Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. 

5.2 EXAMPLE 2 
Assemblage of Novel Nucleic Acids 

The contigs or nucleic acids of the present invention, designated as SEQ ID NO: 1823- 
2478 were assembled using an EST sequence as a seed. Then a recursive algorithm was used to 

1 5 extend the seed EST into an extended assemblage, by pulling additional sequences from 
different databases (i.e., Hyseq's database containing EST sequences, dbEST, gb pri, and 
UniGene, and exons from public domain genomic sequences predicated by GenScan) that 
belong to this assemblage. The algorithm terminated when there were no additional sequences 
from the above databases that would extend the assemblage. Further, inclusion of component 

20 sequences into the assemblage was based on a BLASTN hit to the extending assemblage with 
BLAST score greater than 300 and percent identity greater than 95%. 

5.3 EXAMPLE 3 
Novel Nucleic Acids 

The novel nucleic acids of the present invention were assembled from sequences that 
25 were obtained from a cDNA library by methods described in Example 1 above, and in some 
cases sequences obtained from one or more public databases. The nucleic acids were 
assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the 
seed EST into an extended assemblage, by pulling additional sequences from different 
databases (Hyseq's database containing EST sequences, dbEST, gb pri, and UniGene) that 
30 belong to this assemblage. The algorithm terminated when there was no additional sequences 
from the above databases that would extend the assemblage. Inclusion of component sequences 
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into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST 
score greater than 300 and percent identity greater than 95%. 

Using PHRAP (Univ. of Washington) or CAP4 (Paracel), a full-length gene cDNA 
sequence and its corresponding protein sequence were generated from the assemblage. Any 
5 frame shifts and incorrect stop codons were corrected by hand editing. During editing, the 
sequences were checked using FASTY and/or BLAST against Genebank (i.e., dbEST, gb pri, 
UniGene, and Genpept) and the Geneseq (Derwent). Other computer programs which may 
, have been used in the editing process were phredPhrap and Consed (University of Washington) 
and ed-ready, ed-ext and cg-zip-2 (Hyseq, Inc.). The full-length nucleotide and amino acid 
10 sequences, including splice variants resulting from these procedures are shown in the Sequence 
Listing as SEQ ID NO: 1-1822. 

Table 1 shows the various tissue sources of SEQ ID NO: 1-911. 

The homologs for polypeptides SEQ ID NO: 912-1822, that correspond to nucleotide 
sequences SEQ ID NO: 1-91 1 were obtained by a BLASTP version 2.0al 1 9MP- WashU 
15 searches against Genpept and Geneseq (Derwent) using BLAST algorithm. The results 

showing homologues for SEQ ID NO: 912-1822 from Genpept 127-129 are shown in Table 
2A. The results showing homologues for SEQ ID NO: 912-1822 from Genpept 131 are 
shown in Table 2B. 

Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J. 

20 Comp. Biol., Vol. 6, 219-235 (1999), http://motif.stanford.edu/ematrix-search/ herein 
incorporated by reference), all the polypeptide sequences were examined to determine 
whether they had identifiable signature regions. Scoring matrices of the eMatrix software 
package are derived from the BLOCKS, PRINTS, PFAM, PRODOM, and DOMO 
databases. Table 3 shows the accession number of the homologous eMatrix signature found 

25 in the indicated polypeptide sequence, its description, and the results obtained which include 
accession number subtype; raw score; p-value; and the position of signature in amino acid 
sequence. The results showing homologous signatures for SEQ ID NO: 912-1822 from 
eMatrix version 1.0 are shown in Table 3 A. The results showing homologous signatures for 
SEQ ID NO: 912-1822 from eMatrix version 2.0 are shown in Table 3B. 

30 Using the Pfam software program (Sonrihammer et al., Nucleic Acids Res., Vol. 

26(1) pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences 
were examined for domains with homology to certain peptide domains. Table 4A shows the 
name of the Pfam model found, the description, the e-value and the Pfam score for the 
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identified model within the sequence using Pfam versions 7.0 and 7.2. Table 4B shows the 
name of the Pfam model found, the description, the e-value and the Pfam score for the 
identified model within the sequence using Pfam version 7.5. Further description of the 
Pfam models can be found at http://pfam.wustl.edu/ . 
5 The GeneAtlas™ software package (Molecular Simulations Inc. (MSI), San Diego, 

CA) was used to predict the three-dimensional structure models for the polypeptides 
encoded by SEQ ID NO: 1-911 (i.e. SEQ ID NO: 912-1 822). Models were generated by (1) 
PSI-BLAST which is a multiple alignment sequence profile-based searching developed by 
Altschul et al, (Nucl. Acids. Res. 25, 3389-3408 (1997)), (2) High Throughput Modeling 

1 0 (HTM) (Molecular Simulations Inc. (MSI) San Diego, CA,) which is an automated sequence 
and structure searching procedure nitt p://www.msi.comO , and (3) SeqFold™ which is a fold 
recognition method described by Fischer and Eisenberg (J. Mol. Biol. 209, 779-791 (1998)). 
This analysis was carried out, in part, by comparing the polypeptides of the invention with 
the known NMR (nuclear magnetic resonance) and x-ray crystal three-dimensional structures 

15 as templates. Table 5 shows: "PDB ID", the Protein DataBase (PDB) identifier given to 
template structure; "Chain ID", identifier of the subcomponent of the PDB template 
structure; "Compound Information", information of the PDB template structure and/or its 
subcomponents; "PDB Function Annotation" gives function of the PDB template as 
annotated by the PDB files (http:/www.rcsb.org/PDB/) : start and end amino acid position of 

20 the protein sequence aligned; PSI-BLAST score, the verify score, the SeqFold score, and the 
Potential(s) of Mean Force (PMF). The verify score is produced by GeneAtlas™ software 
(MSI), is based on Dr. Eisenberg's Profile-3D threading program developed in Dr. David 
Eisenberg's laboratory (US patent no. 5,436,850 and Luthy, Bowie, and Eisenberg, Nature, 
356:83-85 (1992)) and a publication by R. Sanchez and A. Sali, Proc. Natl. Acad. Sci. USA, 

25 95: 13597-12502. The verify score produced by GeneAtlas normalizes the verify score for 
proteins with different lengths so that a unified cutoff can be used to select good models as 
follows: Verify score (normalized) = (raw score - 1/2 high score)/(l/2 high score) 

The PFM score, produced by GeneAtlas™ software (MSI), is a composite scoring 
function that depends in part on the compactness of the model, sequence identity in the 

30 alignment used to build the model, pairwise and surface mean force potentials (MFP). As 
given in table 5, a verify score between 0 to 1.0, with 1 being the best, represents a good 
model. Similarly, a PMF score between 0 to 1.0, with 1 being the best, represents a good 
model. A SeqFold™ score of more than 50 is considered significant. A good model may 
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also be determined by one of skill in the art based all the information in Table 5 taken in 
totality. 

Table 6 shows the position of the signal peptide in each of the polypeptides and the 
maximum score and mean score associated with that signal peptide using Neural Network 
5 SignalP VL 1 program (from Center for Biological Sequence Analysis, The Technical 
University of Denmark). The process for identifying prokaryotic and eukaryotic signal 
peptides and their cleavage sites are also disclosed by Henrik Nielson, Jacob Engelbrecht, 
Soren Brunak, and Gunnar von Heijne in the publication " Identification of prokaryotic and 
eukaryotic signal peptides and prediction of their cleavage sites" Protein Engineering, Vol. 
10 10, no. 1, pp. 1-6 (1997), incorporated herein by reference. A maximum S score and a mean 
S score, as described in the Nielson et al reference, was obtained for the polypeptide 
sequences. 

Table 7 correlates nucleotide sequences of the invention to a specific chromosomal 
location when assignable. 
15 Table 8 shows the number of transmembrane regions, their location(s), and TMPred 

score obtained, for each of the SEQ ID NO: 912-1822 that had a TMPred score of 500 or 
greater, using the TMpred program 

Th11p://www.ch.embnet.org/software/TMPRED form.html) . 

Table 9 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 

20 911, their corresponding polypeptide sequences SEQ ID NO : 9 1 2- 1 822, their corresponding 
priority contig nucleotide sequences SEQ ID NO: 1823-2478, their corresponding priority 
contig polypeptide sequences SEQ ID NO: 2479-3 134, and the US serial number of the 
priority application (all of which are herein incorporated in their entirety), in which the 
contig sequence was filed. 

25 Table 10 is a correlation table of the novel polynucleotide sequences SEQ ID NO: 1- 

911, the novel polypeptide sequences SEQ ID NO: 912-1822, and the US application serial 
number and corresponding SEQ ID NO in which the sequence was previously filed. 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 


^Mixture of 16 
tissues - mRNA 


Various Vendors 


CGdOll 


58 74 147 267 277 432 489 496 618 


^Mixture of 1 6 
tissues - mRNA 


Various Vendors 


CGd012 


3-4 6 9 19-20 41 56-58 62 71-72 74 78 
81 91-92 114 118-119 128 133 141 
147-149 153 155 165-167 176-177 185 
187 195 215 218 220 228-229 237 245 
249 257 259 267 271 277-280 302 3 10 
313 326 353 355 366 368-369 380 385 
391 394 398 405-406 409-410 417 
432 439-440 442 449 452 460 462 464 
471-472 475 480 489 496 498 505 524- 
528 535 546 555 581 585 594 610 618 
636 804 811 816 845 871 


^Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd013 


55 71 78 92 165 228 271 276 279-280 
310 353 355 422 424 496 526 531 606 
618 908 


* Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd015 


10 97 100 194-195 246 258-260 266 
270 282 311 355-357 398 408 423 428- 
430 439 495 526 544 805 


* Mixture of 16 
tissues - mRNA 


Various Vendors 


CGd016 


10 14 20 51 71 75 83 116 139 149 168 
174-175 194-195 209 230 233 238-239 
248 257 266-267 279-280 289 31 1-312 
355 403 420-421 493 503 513 515 532 
543 689 707 909 


* Mixture of 16 
tissues - mRNA 


Various Vendors j 


CGdOlO 


176 185 352 380 386 403 422 425-426 
517 537 606 


adrenal gland 


Clontech 


ADR002 


7 21 36 40 43 64 69 88 101 120 130 
147 157 162-163 169 184 187 195 203 
218 225 230 236 275 283 289 298-299 
317 355 381^-382 392 425 427 430 
446 449 460 471 477 503 510 524 531 
536 540 544 578 622 627 648 668 729 
788-789 


adult bladder 


Invitrogen 


BLD001 


19 33 54 128 195 220 312 355 395 421 
423 446 464 503 543 618 829 


adult brain 


BioChain 


ABR012 


184 246 355 364 


adult brain 


BioChain 


ABR013 


102 226 355 423 432 


adult brain 


GIBCO 


AB3001 


3 87 97 100 236 269 304-306 355 377 
427 497 536 540 


adult brain 


GIBCO 


ABD003 


16 20-21 27 40 59 62 87 94 100 105 
117 120 128-129 134 140 157 184 191- 
192 195 201 234 238-240 245-246 250 
256 260 267-268 270 272 295 304-306 
316 335 351-353 355 362 369 377 384 
399 412 415 419 423 430 440 457-460 
464 470 477 496 498 503 519 536 539- 
540 543 555 589 637 


adult brain 


Invitrogen 


ABR014 


120 199 226 246 248 312 319 355 401 
445 477 503 611 


adult brain 


Invitrogen 


ABR015 


20 27 42 49 100 135 157 186 195 199 
226 246 256 430 432 446 472 484 543 
868 


adult brain 


Invitrogen 


ABR016 


16 100 120 195 226 267 355 459 496 
526 


adult brain 


Clontech 


ABR001 


21 27 40 59 87 101 1 12 128 226 268 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








272 355 364 382 387 428-429 464 503 
517 543 619-620 644 838 847 852 


adult brain 


Clontech 


ABR006 


17 23 30 33 41 43 59 64 88 94 98 120- 
121 124 131 133 154 179 183-184 194 
199 207 214 220 226 231 238-240 250 
256 257 266 278 281 304 312 351 353 
355 365 376 382 392 401 412 415 419 
427 430 433 445 460 464 479 516-517 
519 535-537 540 542-543 546 548 555 
582 728 747-748 754 816 


adult brain 


Clontech 


ABR008 


3 17 23 26 29 31 36 40-41 44 49 51 57 
59 62 78 83 87-88 90 100-102 109 115 
121 124 127-128 130 131 133-136 145- 
146 149 154 158 161-163 167 169-171 
182 184 186 195-196 202 207 21 1 214 
224 227 231 244-246 248 250-25 1 256 
261-262 267 276 281 285-286 296-297 
312 319 337 351 353 355 364-365 368 
372 376 380 383 385 390-391 394 401 
403 412 423 427 431 433 435-436 437 
444_445 447-448 452 457-458 460-461 
465-466 479-481 489 503 506 508 513 
517 526 535 545 577-578 585 592 598 
611 621 624-626 648 674-675 722 816 
833 837 839 847-848 852 857 865-866 
884 


adult brain 


Clontech 


ABR011 


26 285-286 431 


adult brain 


Invitrogen 


ABT004 


19-21 30 35 44 57 59 64 87 128 131 
140 144 157 161 187 198 202 226 230 
250 268 272 293 295 351 355 365 372 
387 395 415 424 427 428 442-443 446 
452 455 457-458 478 498 513 516 524 
540 543 816 831 


adult cervix 


BioChain 


CVX001 


10 22-23 36 41 43 62-63 77 100-101 
105-106 109 120 130 134 137 141 154 
160 162-163 178-180 183-184 187 192 
195 198 215 217 226 240 246 249 260 
262 267 272 285 297 304 310-31 1 316 
334-336 353 355 373-374 377 379-380 
385 395 400 408-409 412 417 425 
431 452 464 468 476-477 482 496-497 
503 508 512 516 540 546 578 592 626 
645 660-662 676 680 888 


adult colon 


Invitrogen 


CLN001 


19 21 57 109 128 130 131 148 168 230 
240 301 364 412 424 440 452 464 469 
543 664 


adult heart 


GIBCO 


AHR001 


3 19-20 22 26 41 45 59 62 67 94 97 99- 
101 106 109 120 124 126 128 134 138 
157 171 183 186-190 195 197-198 202 
205 21 1 215 226-227 238-240 248 249 
261-262 267-268 281 285 311 316 335 
353 373-374 379-380 399 412 419 423 
425 430 433 440 464 477 480 496 498 
503 506 543 546 553 566 570 580 


adult kidney 


GIBCO 


AKD001 


3 17 19-20 25 32 35-37 41 49 57 59 62 
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Tissue Origin 


Library /RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








64 87 94 98-102 105 109 120 128 134- 
137 145 153-154 157-158 160 176 179- 
180 182 184 186 192 195 198 215 226- 
227 234 238-240 246 248 256 257 262 
267 272 278 283 285 297 304 310-311 
316 326 353 355 362 364 372 375 377 
379 381-382 391 395 412 415 421-422 
423-424 430-432 434 437 440 444 452 
459 464 470 477 496 503 506 510 519 
527-528 540 546 551 566 578 585 606 
837 


adult kidney 


Invitrogen 


AKT002 


1 17 20 26 59 62 73 87 94 100-101 120 
133 135 154 160 177 187 195 208 226 
228 238-239 248 249 257 260 262-264 
267-268 276 293 299 353 355 359 362 
380 412 419 421 425 427 431 434 436 
439 452 462 464 498 503 508 515 519 
532 540 543 546 586 590 


adult liver 


Invitrogen 


ALV002 


1 3-4 14-16 19-20 32-33 40 48 54 63 
77 87 94 97 101 128-129 135-136 138- 
139 149 157 180 182 187 193 195-196 
203 226-227 246 248 257 262 277-278 
305-306 316 347 355-357 362 364 383 
391 412 423 427 430 446 477 496 503 
510 525 540 553 629 636 852 


adult liver 


Clontech 


ALV003 


14 24 34 40 94 160 195 227 257 278 
355-357 362 369 424 436 496 527-528 
750 901 


adult lung 


GIBCO 


ALG001 


20 36 40 100 105 120 128 130 154 157 
195 198 263-264 267 273 336 373-374 
420 437 446 464 477 503 543 


adult ovary 


Invitrogen 


AOV001 


1 13 15 17 19-22 36 41 49 51 57 59 62 
68 70-71 77 87-88 97 100-101 105-106 
109 112 116 120 125 128 130 134 144 
157-160 169 176-177 179 181-182 184 
187 191-193 195 198 203 211 213 215 
226 230 236-237 240 246 248 249 
261-264 267-268 271-272 279-280 
288-289 295 298-299 304 3 10 320 335 
355 362 364-365 376-377 380 383-386 
389 391 394-395 399 408 412 415 417 
422 424 434 436 440 446 452 456 459- 
460 477 479 496-497 503 506 510 518- 
519 535-536 540 543-544 546 555 591 
612-614 794 823 827 829-830 833-834 
860 


adult placenta 


Clontech 


APL001 


16 195 256 285 399 421 424 446 464 
477 792 


adult spleen 


Clontech 


SPLcOl 


17 22 36 46 88 91 93 97 103 1 12 135- 
136 142 147 162-163 187 195 256 263- 
264 268 281 303-306 311-312 353 376 
427 433 436 437 452 477 480 506 515 
518 535 537 548 


adult spleen 


GIBCO 


ASP001 


3 10 46 77 81 94 97 100 105 120 129 
134-136 160 166 186-187 195 226 240 
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Tissue Origin 


Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








246 248 256 257 267 303-304 311 316 
355 377 382 389 391 423 427 430 432 
436 452 477 536 540 543 582 586 


adult testis 


GIBCO 


ATS001 


20-21 100-101 105-106 109 128 154 
157 160 169 191-193 195 198 213 248 
256 267 269 295 316 336 355 380 388 
415 452 460 477 496 508 518 540-541 


bone marrow 


GF 


BMD002 


10 17 20 25 36 41 43 45-46 58-59 70- 
71 75 81 86 88 100 106 109 115 120 
124 130 134 136 138 147 154-155 157 
162-163 166-167 182-184 187-190 193 
195-196 198 202 216-217 220 226 231 
233 240 248 256 258-259 267-268 272 
279-280 338-339 359 364 373-374 1 
376-377 380 382 389 391 398 403 412 
415 417 423 427 432 436 437 444 446 
452 460 464 477 480 490 493 497 503 
505 513 517-518 526-528 532 537 540 
543 546 548 551 581-582 597 613 626 
694 697 704 723 729 741-742 744 751 
754-755 768 775 815 822 855 859 874 
885 891 i 


bone marrow 


Clontech 


BMD001 


10 16 20 22 25 36 41 49 59 62 71 81 97 
99-100 120 128 130 134 136 143 157 
179 184 187-190 195 198 207 211 215- 
217 236 247 256 258-260 263-265 267 
269 279-280 282 285 298 304 312 353 
364 373-375 377-378 380 389 398 419 
421-422 423 425 430 432 434 452 
477 480 497 503 506 518-519 536-537 
544 546 551 556 566 569 571-572 602 
604 615 728 815 818 


bone marrow 


Clontech 


BMD004 


259 398 


bone marrow 


Clontech 


BMD007 


36 259 398 


bone marrow 


BMD008 


259 


398 


bone marrow 


null 


STM001 


91 136 166 195 256 362 377 412 423- 
425 477 503 537 540 


CGSP009 




255 




cultured 
preadipocytes 


Stratagene 


ADP001 


9 17 76 100 120 134 136 179 195 226 
246 248 263-264 267 275 281 295 323 
345-346 377 391 423-425 427 430-431 
464 477 503 506 531 543 546 548 592 
701 876 


endothelial cells 


Stratagene 


EDT001 


3-4 10 20 22-23 35-36 41 47 51 57 59 
62 64 87 97 99-102 105-106 108-110 
120 128 134-135 144 153 157 167 177 
179 182-183 186 191 195 204 214-215 
226 240 246 248 251 256 262 267-268 
270 272-273 275 282 285 295 304 353 
355 362 364-365 373-374 376-377 379 
382 385-386 388 391 394 403 412 419 
423-425 427 430-432 440 444-445 477 
497 503 506 508 546 548 566 608 


esophagus 


BioChain 


ESO002 


532 607 


fetal brain 


Invitrogen 


FBT002 


3 20 42 57 64 100 130 134-135 154 



WO 03/054152 PCT/US02/39555 



Table 1 
119 



Tissue Origin 


Library/RNA Source 
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Name 


SEQ ID NOS: 








157 162-163 169 214 226 250 262 
265 267 272 283 294 336 351 355 365 
407 412 423 431 440 444-445 460 470 
477 498-499 526 544 578 610 


fetal brain 


GIBCO 


HFB001 


4 20 22 26-31 36 42 49 59 64 77 85 87 
97 100-102 105 108-109 120 128 134 
157 179 184 186 191-192 195 198-199 
211 226 238-239 246 248 256 261-262 
267-268 285 295 300 310 335 353 355 
365 376-377 379-380 390 415 419 421 
423-424 430 446 448 460 477-478 
489 496 503 506 513 516 536 542 546 
595 793 817 


fetal brain 


Clontech 


FBRs03 


49 


fetal brain 


Clontech 


FBR001 


30 87 100 182 199 268 415 435 516 
578 


fetal brain 


Clontech 


FBR004 


36 214 224 229 376 394 445 460 513 
516 578 611 787 816 861 897 


fetal brain 


Clontech 


FBR006 


3 12 15 20-21 23 25 33-34 36 44 49 58- 
59 62 78 83 88 91 99-100 103 106 124 
128 130 131 134-136 149 154 161 169 
182 184 186-187 193 227 231 236 242 
245 256 261 281 285 292 301 311-312 
320 353 355 360-361 364-365 376 383 
385 401-403 412 422 423-424 427 433 
435 441 444 446 452 457-458 460 465 
471 478-480 486 496 502-503 506 513 
515 517 526 535 537 546 548 566 756- 
757 759-760 784-785 858-859 886 


fetal heart 


Invitrogen 


FHR001 


3 7 12 16 19-20 26 31 84 94 101-102 
106 109 124 135-136 138 142 158 183- 
184 193 195 227 229 236 241 249 265 
267 270 273 277 285 307 310 353 355 
376 380 382 385 387 394 396 408 412 
419 425-426 430-432 444-445 452 
464 477 498 503 506-509 513 526 543 
546 548 592 598 648 761-763 765-766 
776-778 859-860 893 


fetal kidney 


Invitrogen 


FKD007 


26 111 162-163 


fetal kidney 


Clontech 


FKD001 


46 186-187 195 267 335 421 452 


fetal kidney 


Clontech 


FKD002 


12 44 49 71 93 97 100 102 106 128 130 
136 154 162-163 179 183 186 195 227 
267 276 279-280 285 310 316 376 384 
387 389 464 477 480 496 498-499 510 
535 554 582 729 768 894 897 


fetal liver 


Clontech 


FLV002 


20 34 40 68 78 85 93 125 182 194-195 
207 257 259 266 278 297 356-357 383 
398 457-458 477 496 548 767 


fetal liver 


Clontech 


FLV004 


5 12 14 20 24 32-33 40 49 94 100 115 
128 130 135 157 159-160 169 176 179- 
180 195-196 203 224 226-227 229 246 
249 253 257 259 267 278 295 305-306 
355-357 385 398 424 427 437 459 464 
477 496 498 537 546 581 603 64 1 743 
768 
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Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 


fetal liver 


Invitrogen 


FLV001 


20 32-33 36 85 94 128 144 149 154 
157 171 180 182 195 248 278 292 312 
316 355 362 423 430-431 436 441 472 
477 498 540 543 555 


fetal liver- 
spleen 


Columbia University 


FLS001 


2-3 10 12 14-17 19-24 26-27 32-36 39 
41 43 49 51 57 59 62 80 90-94 96-97 
99-100 105 107 109 120 128-130 134 
136 149 153 157-158 162-163 167 176- 
177 179-187 191 194-195 198 211 215 
226-228 230 234 237 240 243 245-246 
248 254-259 262 266-268 270 277- 
278 282-283 285 295 297 310-311 316 
320-326 330-331 335-336 338 353 
355-357 362 370-374 376 380 382 384 
391 394 398 412-413 415 419-421 423 
430 439-440 446 452 456 459-460 464 
472 477 479-480 489 496-499 503 506 
510 515 525-526 534-535 538 540 543- 
544 546 548 555 557-560 562-566 568 
573-574 576 578 591 593 600-601 732- 
735 809-812 814 820 827 


fetal liver- 
spleen 


Columbia University 


FLS002 


2-3 6 10 14-15 17-20 22-24 26 33 37 
41 48-49 56-57 62 69 77 81 85-86 90 
92 94 99-100 104 107-109 114 120 129 
134-135 144 149 157 168-169 174-175 
179-181 185-187 194 211 215 224 227 
230-231 233 237 240 245-246 257-259 
262 266-268 270 272 275 278 298 300 
304-306 311 313 316-317 320 326-336 
352-354 362 364-365 376-377 382-384 
398 41 1-412 415 419 421 425 430 433 
436 439 441 446 452 456 459-460 464 
477 479 491-492 496 499 503 505-506 
508 510 515 519 526 535 539 544 546 
566 575 578 591 593 601 654-657 659 
685 687-691 705 707 709 774 779-780 
809 812 820 833 837 854-856 872 896 


fetal liver- 
spleen 


Columbia University 


FLS003 


14 17 20 26 34 63 86 99 107 167 179 
187 195 211 220 227 231 248 257 275 
278 320-321 328 335 355-357 365 384 
394 4i 1-412 416 423 425-427 430-431 
440 464 486 497 506 510 525 546 558 
566 648 732 735 738-740 746 779-781 
787 809 812 814 837 


fetal lung 


Clontech 


FLG001 


22 49 160 176 195 237 259 267 353 
398 421 430 436 456 815 


fetal lung 


Clontech 


FLG004 


12 41 506 


fetal lung 


Invitrogen 


FLG003 


15 64 115 128 135 142 151 154 186- 
187 193 195 220 236 253 276 355 421 
446 464 506 519 527-528 537 578 626 


fetal muscle 


Invitrogen 


FMS001 


20 94 100 128 138 171 184 195 226 
246 259 267 311 398 412 443 478 498 
531 585 610 


fetal muscle 


Invitrogen 


FMS002 


12 21 38 57 71 86 100-101 104 109 
116 129-130 138 179 195-196 204 253 
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Library/RNA Source 


HYSEQ 
Library 
Name 


SEQ ID NOS: 








285 312-313 336 376 384-385 391 403 
423 427 430 446 452 473 498 507 517 
535 546 598 643 892 


fetal skin 


Invitrogen 


FSK001 


2 6 12 30-31 56 59 62-63 74 88 97 100 
109 115-118 128 134-135 138 141 151- 
152 154 160 168 178 181-182 185 187 
193 195 229 236 248 249 252 267 272- 
273 276 282 301 313-316 320 362 364- 
365 385 391 404-407 410 415 421 423 
427 430 446 452 460 471 473-474 477 
486-489 499 503 512 526-528 537 540 
543 548 551-552 594 598 635 681-682 
813 829 870-871 888 


fetal skin 


Invitrogen 


FSK002 


9 12 17 20 33 46 58 74 76-78 83 103 
118 124 128-129 135 142 162-163 167- 
169 177 193 195 203 229 231 244 249 
254 256 276 3 1 1 3 13-3 14 342 346 363 
372 376 382 391 393-394 410 421 423 
427 433 464 473 477-478 480-481 486- 
487 497-499 511-513 515 518 535 537- 
538 543 589 592 626 701 731 768-769 
771 781 895 898-899 


fetal spleen 


BioChain 


FSP001 


46 256 


induced neuron- 
cells 


Stratagene 


NTD001 


15 20 49 87 109 1 19-120 157 179 195 
208 226 234 238-239 267 282 340-344 
364 494-495 504 506 516 546 597 698- 
699 887 


infant brain 


Columbia University 


IB2003 


21-22 26 31 48 77 82 98-99 103 128 
131 134-135 154 179 198 211 214 230 
234 295 312 339 348-349 351 353 355 
362 365 391 415 423 431 445 452 460 
470 478 497 513 515-516 535 537 540 
543 566 578 639-640 726 816 833 849 


infant brain 


Columbia University 


IBS001 


21 59 77 98 131 154 224 244 371 513 
540 543 578 639 812 


infant brain 


Columbia University 


IBM002 


19 26 77 248 516 


infant brain 


Columbia University 


IB2002 


19 21-22 26-27 31 36 57 62 77 93 97- 
98 120 128-129 131 134 140 149 154 
169 179 182 187 208 214 226 230 234 
237 256 267-268 273 281 284 295 336 
351 353 355 365 371 387 415 421 429 
431 435 445-446 452 460-461 478-479 
496 503 508 513 515 536-537 543 546 
555 578 605 613 639-640 816 826 849 
853 


leukocyte 


GIBCO 


LUC001 


3 20 23 35-36 39 59 62 64 70-71 87-88 
94 100-101 105-106 120 128-130 134- 
137 139 148 154-155 160 165 176 179 
184 186-190 192-193 195-196 198 211 
216-217 225-226 236 240 246 251 256 
257 260 262-264 267-268 270 279-280 
285 298 304 311 353 355 362 375 377 
380 412 415 417 423-426 432 437 
440 444 452 456 460 477 479 497 503 
505-506 508 519 540 543 546 565 582- 
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Name 


SEQ ID NOS: 








585 587-588 596 646 821 


leukocyte 


Clontech 


LUC003 


41 47 59 71 100 144 186-187 195 240 
263-264 267 279-280 298 385 432 437 
446 497 503 


lung tumor 


Invitrogen 


LGT002 


20 35-37 41 43-44 47-49 54-55 59 62 
64 71 77 93-94 100 102 106 120 134 
136 139 154 157 176 186 194-195 198 
211-212 215 218 222 226 234 237-239 
246 249 256 257 260 262-264 267 276 
279-280 289 305-306 311 336 355 362 
364 380 382 385 388 394 396 417 419 
421 425 427 432-433 438 444 446 449 
477 483-485 496-497 503 506 508 527- 
528 540 543 589 606 611 616-617 651 
679 799 828 835 867-869 


lung, fibroblast 


Stratagene 


LFB001 


3-4 16 22 35-36 41 94 100 105 1 16 120 
128 134-135 177 191 195 235 256 267 
353 355 364 377 379 425 459 464 477 
497 503 703 813 829 


lymph node 


Clontech 


ALN001 


10 53 57 81 136 182 195 198 209 246 
263 267 353 355 389 432 503 506 


lymphocyte 


null 


DGD001 


267 311 373-374 412 430-432 460 464 
480 503 536 


lymphocytes 


ATCC 


LPC001 


35-36 41 57 77 99-100 105 115 120 
128 139 148 155 171 176 179 182 187- 
190 192 196 226 232-233 236-237 260 
263-264 267-268 270 379 417 427 450 
452 497-498 514 518 540 547-548 566 
583 587 619 693 851 


macrophage 


Invitrogen 


HMP001 


87 128 177 187 196-197 257 277 355 
362 385 412 415 420 423 446 464 497 
537 543 551 782-783 795 


mammary gland 


Invitrogen 


MMG001 


8 16 20 36 44 56-57 59-60 64 74 77 90 
94 96-97 100 109 120 128 130 134-136 
154 157 160 162-163 168 179 182 187 
193 195 198 211 219-220 226 240 248 
249 251 257 262 267-268 272-273 276 
281 287 300-301 316 355 362 364 372 
383 388 393-394 417 421 423 427 430- 
432 440 446 452 460 477-478 497 503 
506 516 527-528 537 540 543 555 585 
597 680 842-843 


melanoma 
from-cell-line- 
ATCC-#CRL 
Clontech 


MEL004 


3 20 26 41 
59 100 
120 130 
160 195 
223 237 
246 267 
269 275 
305-306 
376 409 
412 419 
423 437 
446 459 
464 477 
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Library/RNA Source 


HYSEQ 
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Name 


SEQ ID NOS: 






486 647 
669 862 




neuronal cells 


Stratagene 


NTU001 


21 49 51 57 59 83 96 120 134 179 195 
230-231 256 262 267 289 351 355 383 
412 423 427 497 503 527-528 727 875 


pituitary gland 


Clontech 


PIT004 


16 19 68 87 100 1 12 128 192 195 246 
262 268 275 355 372 394 416 427 430- 
431 477 503 578 


placenta 


Invitrogen 


APL002 


16 36 57 77 141 187 230 248 262 295 
355 423 430-431 446 496 527-528 555 


placenta 


Clontech 


PLA003 


20 33 70 88 90 109 115 120 124 129 
141 181 193 237 245 249 285 297 316 
355 382 391 403 411 421 446 464 477 
486 497 513 548 772-773 


prostate 


Clontech 


PRT001 


10 20 36 49 85 94 100 135 142 188- 
191 195 198 237 267 282 285 316 320 
353 379 394 421 423 425 437 460 506 
589 631-632 


rectum 


Invitrogen 


REC001 


35 77 120 128 136 153 159 168 182 
256 268 272 301 423 430 494 499 503 
505 537 543-544 552 578 664 


retinoic acid- 

induced- 

neuronal-cells 


Stratagene 


NTR001 


120 130 179 183 193 256 355 460 478 
503 890 


saliva gland 


Clontech 


SALs03 


475 


salivary gland 


Clontech 


SAL001 


19 38 66 134-136 139 191 195 206 240 
246 256 311 355 362 376 382-383 395 
421 432 456 460 464 475 477 494 
497-498 503 527-528 666 790 826 


skeletal muscle 


Clontech 


SKM001 


17 41 100 195 197-198 211 215 236 
238-239 298 302 369 415 443 452 
475 496 503 


skin fibroblast 


ATCC 


SFB001 


49 464 


skin fibroblast 


ATCC 


SFB002 


49 246 267 477 


skin fibroblast 


ATCC 


SFB003 


49 


small intestine 


Clontech 


SIN001 


3 6 10 46 57 59 62 77 90 94 97 100 102 
106 109 112 115 120 123 128 142 162- 
163 168 177 179 183 186 188-190 195 
224 226 240 246 248 262 267-268 275 
289 302 311 316 324 353 355 364 372- 
376 380 382-384 394 408 412 423 425 
430 432 439 442 444 452 459 464 477- 
478 499 503 532 536 540 547-548 626- 
627 633 648 729 736-737 745-746 823 
828 844 


spinal cord 


Clontech 


SPC001 


3 17 27 34 41 57 64 87 100-101 128 
157 183 187 192 195 199 204 226 229 
234 24U 24o 24o 20U-201 203-204 20 / 
304 308-309 3 1 1 355 369 382 384 388 
412 415 419 428-429 434 446 457-460 
478 496 503 519 648 652 


stomach 


Clontech 


STO001 


36 57 104 115 120 134-135 168 195 
267 311 330-331 364 414 423 425 452 
456 460 540 886 


thalamus 


Clontech 


THA002 


19 87 113 121 160 182 195 202 248 
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249 262 272 283 289 355 365 424 427 
451 457-458 460 477 480 503 535-536 
543 634 841 


thymus 


Clontech 


THMc02 


3 12 20 36 58-60 70-72 90 93 97 101- 
102 109 114-115 130 135 147-148 160 
162-163 167 169 171 179 183-184 187 
195-196 217 237 249 259 276 279-280 
303 307 312 320 355 376 383-384 398 
402 419 422 432-433 436 437 444 464 
471 479-480 506 508 513 515 535 537 
550 589 641-642 671-672 787 830 858- 
859 864 


thymus 


Clontech 


THM001 


20 53 59 71 100-101 105-106 110 114 
128 130 134 154 157 179 188-191 195 
215 224 230 240 248 259 267 279 335 
380 382 385 391 398 408 412 427 432 
437 452 459 476 497 543 546 61 1 833 
900 


thyroid gland 


Clontech 


THR001 


3 16 19-20 23 33 35 41 43-44 48-49 57 
59 87 94 100-101 120 133-135 147 149 
157 162-163 179 187 191 193 195 198 
206 209 215 217 229 246-247 256 261- 
262 267 272 276 283 287 297 304-306 
309 312 316 335 353 355 362 364 371- 
374 380 385 394 399 412 419 421 431 
442 444 452-453 460 462 475 477 479 
496-497 503 506 519-520 536 543 546 
548 596 649 653 704 731 812 845 851 


trachea 


Clontech 


TRC001 


100-101 120 130 184 195 267 300 355 
380 396-397 399 415 421 423 464 475 
503 527-528 546 709 725 


umbilical cord 


BioChain 


FUC001 


7 26 36 40 43 46 50-51 57 68 71 77-78 
96 100 102 109 115 120 128 142 158- 
159 172-173 179 184 186 191 195 198 
215 226 228 230 236-237 240 246 250 
267 273 275 279-280 283 287 300 304 
335 351 355 364-365 381 388 394 399 
408 425 430 440 452 459 464 475 
477 503 536-537 540 543 546 576 589 
630 635 669 682 684 904 


uterus 


Clontech 


UTR001 


16 22 64 77 87 100 128 134 195 226 
249 335 366 412 425 427 452 454 464 
546 551 


young liver 


GIBCO 


ALV001 


5 14 20 26 32-33 57 71 87 94 97 100 
102 105 120 128 134 154 157 179 192 
194-195 227 257 266 274 278 326 355- 
357 437 446 498 503 516 532 846 



*The 16 tissue/mRNAs and their vendor sources are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
Normal adult kidney mRNA (Invitrogen), 3) Normal fetal brain mRNA (Invitrogen), 4) Normal adult liver 
mRNA (Invitrogen), 5) Normal fetal kidney mRNA (Invitrogen), 6) Normal fetal liver mRNA (Invitrogen), 7) 
normal fetal skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) Human bone marrow 
mRNA (Clontech), 10) Human leukemia lymphoblastic mRNA (Clontech), 1 1) Human thymus mRNA 
(Clontech), 12) human lymph node mRNA (Clontech), 13) human so\spinal cord mRNA (Clontech), 14) 



WO 03/054152 PCT/US02/39555 

Table 1 
125 

human thyroid mRNA (Clontech), 15) human esophagus mRNA (BioChain), 16) human 

conceptional umbilical cord mRNA (BioChain). 
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912 


gil2803215 


Homo sapiens 


glutathione S-transferase theta 2, 
clone MGC:744 IMAGE:3164017, 
mRNA, complete cds. 


1263 


100 


912 


gi601918 


Homo sapiens 


glutathione S-transferase theta 2 
(GSTT2) mRNA, complete cds. 


1263 


100 


912 


gi9937244 


Homo sapiens 


glutathione S-transferase theta 2 
(GSTT2) and glutathione S- 
transferase theta 1 (GSTT1) genes, 
complete cds. 


1259 


99 


913 


gil3872813 


Homo sapiens 


partial mRNA for fibulin-6 (FIBL- 
6 gene). 


4548 


93 


913 


gil4575679 


Homo sapiens 


hemicentin mRNA, complete cds. 


4542 


93 


913 


gil4041957 


Homo sapiens 


cDNA FLJ14438 fis, clone 
HEMBB 10003 17, weakly similar 
to FIBULIN-1, ISOFORM D 
PRECURSOR. 


2795 


85 


914 


gi 13543645 


Homo sapiens 


D-dopachrome tautomerase, clone 
MGC: 14637 IMAGE:4082777, 
mRNA, complete cds. 


450 


94 


914 


gi 15930 140 


Homo sapiens 


Similar to D-dopachrome 
tautomerase, clone MGC:9158 
IMAGE:3902943, mRNA, 
complete cds. 


450 


94 


914 


gi2352915 


Homo sapiens 


D-dopachrome tautomerase (DDT) 
gene, exon 3 and complete cds. 


450 


94 


915 


gi 13543645 


Homo sapiens 


D-dopachrome tautomerase, clone 
MGC: 14637 IMAGE:4082777, 
mRNA, complete cds. 


537 


86 


915 


gil5930140 


Homo sapiens 


Similar to D-dopachrome 
tautomerase, clone MGC:9158 
IMAGE:3902943, mRNA, 
complete cds. 


537 


86 


915 


gi2352915 


Homo sapiens 


D-dopachrome tautomerase (DDT) 
gene, exon 3 and complete cds. 


537 


86 


916 


gi4190954 


Homo sapiens 


gene for hepatocyte growth factor 
activator, complete cds. 


2089 


100 


916 


gi219681 


Homo sapiens 


Human mRNA for hepatocyte 
growth factor (HGF) activator 
precursor, complete cds. 


2089 


100 


916 


AAR89197 


Homo sapiens 


Human hepatocellular growth 
factor single chain precursor 
protein. 


2089 


100 


917 


gi5441937 


Homo sapiens 


BAC clone CTB-15P3 from 7q22- 
q31.2, complete sequence. 


8964 


100 


917 


AAY15457 


Homo sapiens 


Human laminin beta 4 protein. 


6164 


96 


917 


AAY15459 


Homo sapiens 


SEQ ID5ofWG9919347. 


6046 


97 


918 


gil0998440 


Mus musculus 


EGF-related protein SCUBE1 


2116 


52 


918 


gi8052320 


Mus musculus 


Cegpl protein 


1124 


79 


918 


AAY07735 


Homo sapiens 


Human breast-specific BS200 
protein. 


1125 


79 


919 


gil3543621 


Homo sapiens 


parathyroid hormone-like 
hormone, clone MGC: 1461 1 
IMAGE:4050706, mRNA, 
complete cds. 


905 


100 


919 


gil90712 


Homo sapiens 


Human parathyroid hormone-like 
protein (PLP) gene, exon 4, clones 
lambda-PLPg(l,3,7-2). 


905 


100 
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919 


gil90713 


Homo sapiens 


Human parathyroid hormone-like 
protein (PLP) gene, exon 5, clones 
lambda-PLPg(l,3,7-2). 


905 


100 


920 


gi3449294 


Rattus 
norvegicus 


MEGF6 


2229 


78 


920 


AAY72715 


Homo sapiens 


HFICU08 clone human attractin- 
like protein. 


554 


43 


920 


AAG75479 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6243. 


518 


43 


921 


gi 12966 10 


Homo sapiens 


H. sapiens mRNA for chemokine 
CC-2and CC-1. 


346 


100 


921 


gi 1004267 


Homo sapiens 


H. sapiens gene for chemokine 
HCC-1. 


346 


100 


921 


gi 1004269 


Homo sapiens 


H.sapiens mRNA for chemokine 
HCC-1. 


346 


100 


922 


gi35330 


Homo sapiens 


H. sapiens mRNA for 
procarboxypeptidase Al. 


1126 


61 


922 


AAR97618 


Homo sapiens 


Human carboxypeptidase Al. 


1126 


61 


922 


AAY28915 


Homo sapiens 


Human regulatory protein HRGP- 
1. 


1126 


61 


923 


gi790817 


Homo sapiens 


Human microfibril-associated 
glycoprotein 4 (MFAP4) mRNA, 
3' end of cds. 


1198 


99 


923 


AAY42563 


Homo sapiens 


Human microfibril-associated 
glycoprotein 4 splice variant 
(MAG4V). 


1197 


100 


923 


AAY85177 


Homo sapiens 


Microfibril associated glycoprotein 
4 splice variant protein sequence. 


1197 


100 


924 


AAY06940 


Homo sapiens 


Human secretory protein ZSIG-1 1. 


1648 


100 


924 


AAY66650 


Homo sapiens 


Membrane-bound protein PR0536. 


1648 


100 


924 


AAY50944 


Homo sapiens 


Human adult heart cDNA clone 
vf 11 derived protein. 


1648 


100 


925 


gil3528981 


Homo sapiens 


apolipoprotein A-II, clone 
MGC: 12334 IMAGE: 3 934476, 
mRNA, complete cds. 


438 


90 


925 


gi28748 


Homo sapiens 


Human mRNA for apolipoprotein 
All precursor. 


438 


90 


925 


gi296633 


Homo sapiens 


Human DNA for apolipoprotein A- 
II. 


438 


90 


926 


AAY76156 


Homo sapiens 


Human secreted protein encoded 
by gene 33. 


419 


90 


926 


AAE04850 


Homo sapiens 


Human SGP014 phosphatase 
polypeptide related exon 4. 


63 


33 


926 


gi599904 


Albinaria turrita 


ATPase subunit 8 


40 


39 


927 


gil3097252 


Homo sapiens 


Similar to FK506 binding protein 
2 (13 kDa), clone MGC:5177 
IMAGE:3445148, mRNA, 
complete cds. 


640 


91 


00^7 

yZ 1 


gi337370 


Homo sapiens 


Human rapamycin- and FK506- 
binding protein, complete cds. 


640 


91 


927 


AAQ31004_ 
aal 


Homo sapiens 


hRFKBP cDNA. 


636 


90 


928 


AAB53360 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID NO:900. 


230 


95 


928 


AAG73789 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:4553. 


230 


95 
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928 


AAY86330 


Homo sapiens 


Human secreted protein 
HKGC027, SEQ ID NO:245. 


60 


66 


929 


gil91613 


Mus musculus 


acetylcholine receptor 


55 


43 


929 


AAY07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


55 


40 


930 


gil5157181 


Agrobacterium 
tumefaciens 


AGR_C_3718p 


1148 


50 


930 


gil4022240 


Mesorhizobium 
loti 


probable D-lactate dehydrogenase 


1106 


47 




gil5075160 


Sinorhizobium 
meliloti 


PUTATIVE D-LACTATE 
DEHYDROGENASE 
(CYTOCHROME) PROTEIN 


1097 


48 


QQ 1 

yj i 


AAWo4d4o 


Homo sapiens 


Human epidermoid carcinoma cell 
line KB clone HP 10 179 protein. 


231 


100 


yj i 


AAW78146 


Homo sapiens 


Human secreted protein encoded 
by gene 21 clone HWTAZ75. 


231 


100 


yj i 


gi71U677o 


Homo sapiens 


HSPC194 


151 


75 


932 


AAY24793 


Homo sapiens 


Human secreted protein yc2 1 . 


1107 


99 


932 


gi3858883 


Acanthamoeba 
castellanii 


myosin I heavy chain kinase 


143 


30 


932 


gil3751817 


Leishmania 
major 


C2 domain protein 


126 


32 


933 


gil5028816 


Homo sapiens 


mRNA for beta- 1,3- 
galactosyltransferase b3Gal-T8. 


172 


28 


933 


gi8927164 


Mus musculus 


corel UDP-galactose:N- 
acetylgalactosamine-alpha-R beta 
1 ,3-galactosyl transferase 


173 


28 


933 


gi8927166 


Rattus 
norvegicus 


corel UDP-galactose:N- 
acetylgalactosamine-alpha-R beta 
1 ,3-galactosyltransferase 


173 


26 


934 


AAY73383 


Homo sapiens 


HTRM clone 2280456 protein 
sequence. 


1571 


100 




gi 15929 192 


Homo sapiens 


clone MGC:9522 
IMAGE:3909690, mRNA, 
complete cds. 


1487 


99 


yj4 


AAG75282 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6046. 


619 


99 


935 


gil5705411 


Homo sapiens 


peptidoglycan recognition protein 
L precursor (PGLYRP) mRNA, 
complete cds. 


3041 


99 


935 


AAY72664 


Homo sapiens 


Murine peptidoglycan recognition 
protein-related liver protein. 


1484 


76 


935 


gi6651239 


Mus musculus 


TAGL-alpha 


1471 


76 


936 


AAB24450 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 14 SEQ ID 
NO:75. 


361 


97 


936 


AAB49409 


Homo sapiens 


hCRFl-TM8 construct peptide. 


39 


69 


936 


gil2619689 


Conus 
ventricosus 


conotoxin scaffold WVII 
precursor 


58 


21 


yo / 


gllzc5U43 lo 


Homo sapiens 


tyrosine 3- 

monooxygenase/tryptophan 5- 
monooxygenase activation protein, 
eta polypeptide, clone MGC:675 
IMAGE:3543571, mRNA, 
complete cds. 


1094 


94 


937 


gil711232 


Homo sapiens 


Human DNA for 14-3-3 protein eta 
chain, exon2 and complete cds. 


1094 


94 
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937 


gi531590 


Homo sapiens 


H. sapiens gene for 14-3-3 protein. 


1094 


94 


938 


gil2804681 


Homo sapiens 


SI 00 calcium-binding protein, 
beta (neural), clone MGC:1323 
IMAGE:3543825, mRNA, 
complete cds. 


479 


100 


938 


gi337730 


Homo sapiens 


Human SI 00 protein beta-subunit 
gene, exon 3. 


479 


100 


938 


gil61 18441 


Oryctolagus 
cuniculus 


S-100 calcium-binding protein beta 
subunit 


479 


100 


939 


AAW75082 


Homo sapiens 


Human secreted protein encoded 
by gene 26 clone HTLEV12. 


392 


92 


939 


AAG00461 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4542. 


72 


50 


939 


AAY86322 


Homo sapiens 


Human secreted protein 
HTLFG05, SEQ ID NO:237. 


72 


50 


941 


gi9280025 


Macaca 
fascicularis 


Nogo receptor 


804 


49 


941 


gil5080005 


Homo sapiens 


nogo receptor, clone MGC:1983 1 
IMAGE:4040540, mRNA, 
complete cds. 


792 


49 


941 


gil 2407653 


Homo sapiens 


Nogo receptor mRNA, complete 
cds. 


792 


49 


942 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID 
NO:63. 


768 


98 


942 


gil 4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene, clone 
MGC: 19636 IMAGE: 2 8223 23, 
mRNA, complete cds. 


238 


90 


942 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


238 


90 


943 


AAY85678 


Homo sapiens 


Human kidney disease associated 
protein SEQ ID 10. 


751 


98 


943 


gi3127193 


Rattus 
norvegicus 


kidney-specific protein 


686 


75 


943 


gi50 19275 


Bos taurus 


xenobiotic/medium-chain fatty 
acid:CoA ligase form XL-III 


474 


54 


944 


gil 197499 


Homo sapiens 


H. sapiens gene for CI inhibitor 
exon 2 (and joined CDS). 


2527 


100 


944 


gi29535 


Homo sapiens 


Human gene for CI -inhibitor. 


2527 


100 


944 


gil5029894 


Homo sapiens 


serine (or cysteine) proteinase 
inhibitor, clade G (CI inhibitor), 
member 1, clone MGC: 17091 ( 
IMAGE:4150091, mRNA, 
complete cds. 


2524 


99 


945 


gil5157854 


Agrobacterium 
tumefaciens 


AGR_C_4799p 


218 


38 


945 


gil 5076099 


Sinorhizobium 
meliloti 


HYPOTHETICAL PROTEIN 


233 


36 


945 


gi9951121 


Pseudomonas 
aeruginosa 


ribosomal protein LI 1 
methyltransferase 


116 


38 


946 


AAB03948 


Homo sapiens 


Human mesenchymal stem cell 
polypeptide. 


462 


98 


946 


AAB64909 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID 
NO:87. 


166 


63 


946 gil531983 


Homo sapiens 


H. sapiens mRNA for CC- 


77 


31 
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chemokine, eotaxin variant (clone 
53). 






947 


gil3958036 


Homo sapiens 


FYVE-finger protein EIP1 mRNA, 
complete cds. 


2917 


97 


947 


AAY29861 


Homo sapiens 


Human secreted protein clone 
cb98 4. 


2910 


96 


QAl 

?H 1 


gli 1 DDoU44 


Mus musculus 


TT\7"T 7T"» jy a ' • . • 

FYVE-finger containing protein 


2752 


91 


948 


AAW75000 


Homo sapiens 


Human secreted protein encoded 
by gene 146 clone HSNAK17. 


49 


29 


948 


gil0176829 


Arabidopsis 
thaliana 


gene_id:MBB18.16~ 


79 


32 


c\A n 
yny 


gi 13279266 


Homo sapiens 


clone MGC: 10946 
IMAGE:3631700, mRNA, 
complete cds. 


466 


100 


949 


AAG81435 


Homo sapiens 
. 


Human AFP protein sequence SEQ 
ID NO:388. 


466 


100 




a a undone 


Homo sapiens 


Human gene 7 encoded secreted 
protein HNTDL21, SEQ ID 
NO:58. 


466 


100 


Z2 *J \J 


A A VI £787 


Homo sapiens 


Human secreted protein (clone 
dy41 2). 


557 


100 




glOOoZo 1 o 


Sus scrofa 


1 _ 1*1 • J* IT" T^TT j 

lectm-hke oxidized LDL receptor- 
1 


235 


29 


950 


AAW52837 


Homo sapiens 


Human C-type lectin MCTL. 


206 


34 


yj 1 


AAY41720 


Homo sapiens 


Human PR0792 protein sequence. 


1140 


87 


951 


AAB44276 


Homo sapiens 


Human PRQ792 (UNQ431) 
protein sequence SEQ ID NO:231. 


1140 


87 


951 


AAB24055 


Homo sapiens 


Human PR0792 protein sequence 
SEQIDNO:31. 


1140 


87 


yjZ 


^.l 1 ^ C~l f\'~l A Z" 

gi7670746 


Homo sapiens 


UDP-glucose : glycoprotein 
glucosyltransferase 1 precursor, 
mRNA, complete cds. 


7968 


98 


yDZ 


gll3275621 


synthetic 
construct 


Rat RUGT 


7313 


91 


yDZ 


gi7677176 


Rattus 
norvegicus 


UDP-glucose 

glycoproteimglucosyltransferase 
precursor 


7313 


91 


yjj 


A A D^/nnn 

AABo439Q 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA22. 


409 


51 


953 


AAG00975 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5056. 


212 


46 


953 


gi2623224 


Mus musculus 


FK-506 binding protein homolog 


118 


30 




gll / /oUol 


Saccharomyces 
cerevisiae 


transcription/repair factor TFIIH 
subunit Tfb3 


83 


31 


954 


gi927727 


Saccharomyces 
cerevisiae 


Tfb3p: TFIIH subunit Tfb3; 
YDR460W 


83 


31 


954 


gi3372804 


Rattus 
norvep"ici]«j 


focal adhesion kinase-related 

JJ1VJLC1I1 


65 


25 


955 


AAG81399 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:316. 


1910 


100 


955 


AAB61421 


Homo sapiens 


Human TANGO 300 protein. 


1904 


99 


955 


AAB23618 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 36. 


1902 


99 


956 


gi296532 


Homo sapiens 


H. sapiens mRNA for I beta 1-6 N- 
acetylglucosaminyltransferase. 


1043 


66 
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956 


gil315909 


Homo sapiens 


Human beta- 1 ,6~N- 
acetylglucosaminyltransferase 
(IGnT) gene, exon 3, complete cds. 


1043 


66 


956 


gi307298 


Homo sapiens 


Human I beta 1-6 N- 
acetylglucosaminyltransferase 
mRNA, complete cds. 


1043 


66 


957 


gi2065165 


Homo sapiens 


H. sapiens mRNA for extracellular 
matrix protein collagen type XIV, 
N-terminus. 


776 


93 


957 


AAG00322 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4403. 


482 


98 


957 


gi288875 


Gallus gallus 


collagen XIV 


467 


68 


958 


gil5530201 


Homo sapiens 


Similar to matrix 
metalloproteinase 1 (interstitial 
collagenase), clone MGC: 10479 
IMAGE:3834572, mRNA, 
complete cds. 


2381 


93 


958 


gi30126 


Homo sapiens 


H.sapiens mRNA for type I 
interstitial collagenase. 


2381 


93 


958 


gil688258 


Homo sapiens 


Human collagenase and 
stromelysin genes, complete cds, 
and metalloelastase gene, partial 
cds. 


2381 


93 


959 


AAY25868 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 57. 


993 


99 


959 


gi4008584 


Rattus 
norvegicus 


huntingtin 


61 


66 


959 


gi6006615 


Saimiri sciureus 


Alanine Glyoxylate 
Aminotransferase 


71 


31 


960 


AAB58288 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 626. 


474 


85 


960 


gil3378304 


Homo sapiens 


dynein-associated protein HKM23 
(km23) mRNA, complete cds. 


473 


98 


960 


gil2803327 


Homo sapiens 


HSPC162 protein, clone 
MGC:773 IMAGE: 3347555, 
mRNA, complete cds. 


473 


98 


961 


gi930078 


Homo sapiens 


Human Koxl6 mRNA for zinc 
finger protein, partial. 


56 


34 


961 


AAE01364 


Homo sapiens 


Human gene 13 encoded secreted 
protein HDPIW06, SEQ ID 
NO:86. 


54 


43 


961 


AAE01398 


Homo sapiens 


Human gene 13 encoded secreted 
protein HDPIW06, SEQ ID 
NO: 120. 


54 


43 


962 


gi9956936 


Mus mus cuius 


Su(var)3-9 homolog Suv39h2 


1313 


84 


962 


gi 10440094 


Homo sapiens 


cDNA: FLJ23414 fis, clone 
HEP20704. 


1236 


100 


962 


gi 13623277 


Homo sapiens 


suppressor of variegation 3-9 
(Drosophila) homolog 1, clone 
MGC: 10376 IMAGE: 3 945 632, 
mRNA, complete cds. 


821 


55 


963 


gi 12654023 


Homo sapiens 


NADH dehydrogenase 
(ubiquinone) 1 alpha subcomplex, 
5 (13kD, B13), clone MGC:5042 
IMAGE:3453236, mRNA, 
complete cds. 


432 


90 


963 


gil373173 


Homo sapiens 


NADH: ubiquinone oxidoreductase 


432 


90 
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subunit B13 (B13) mRNA, 
complete cds. 






963 


gil698653 


Homo sapiens 


Human NADHiubiquinone 
oxidoreductase subunit B 13 
mRNA, complete cds. 


432 


90 


964 


gi37347 


Homo sapiens 


Human mRNA for T-cell 
rearranging gamma gene (TRG) 
V(g)8-J(g)2-C(g)2. 


1645 


94 


964 


gi339407 


Homo sapiens 


Human T-cell receptor Ti 
rearranged gamma-chain mRNA 
V-J-C region, complete cds. 


1617 


93 


964 


gi37018 


Homo sapiens 


Human mRNA for T-cell receptor 
gamma-chain. 


1439 


87 


965 


AAE06606 


Homo sapiens 


Human protein having 
hydrophobic domain, HP 10794. 


566 


96 


965 


gi6996628 


Triticum 
aestivum 


phenylalanine ammonia lyase 


72 


46 


965 


gi 12667442 


Rattus 
norvegicus 


synaptotagmin VIIT1 


75 


34 


966 


gil3654639 


Bos taurus 


D-glucuronyl C5 epimerase 


3165 


97 


966 


gi 13442978 


Mus musculus 


D-glucuronyl C5~epimerase 


3142 


95 


966 


gil 1935177 


Mus musculus 


heparin/heparan sulfate: glucuronic 
acid C5 epimerase 


3137 


95 


967 


AAG71875 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1556. 


709 


86 


967 


AAG71443 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 124. 


692 


79 


967 


AAG71816 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1497. 


684 


82 


968 


gil2655133 


Homo sapiens 


Similar to CGI-63 protein, clone 
MGC:708 IMAGE: 3 139846, 
mRNA, complete cds. 


1918 


100 


968 


AAB73679 


Homo sapiens 


Human oxidoreductase protein 
ORP-12. 


1918 


100 


968 


gi4929595 


Homo sapiens 


CGI-63 protein mRNA, complete 
cds. 


1874 


98 


969 


gi6062874 


Homo sapiens 


candidate tumor suppressor protein 
DICE1 mRNA, complete cds. 


1313 


81 


969 


AAY 15344 


Homo sapiens 


Tumour suppressor protein del-27. 


1313 


81 


969 


AAY28995 


Homo sapiens 


Tumour suppressor Del-27 protein 
sequence. 


1313 


81 


970 


gi387011 


Homo sapiens 


Human pyruvate dehydrogenase 
El -alpha subunit mRNA, cds. 


2187 


99 


970 


gil2803199 


Homo sapiens 


pyruvate dehydrogenase 
(lipoamide) alpha 1, clone 
MGC:8609 IMAGE:2961286, 
mRNA, complete cds. 


2049 


100 


970 


gi35379 


Homo sapiens 


Human mRNA for brain pyruvate 
dehydrogenase (EC 1.2.4.1). 


2049 


100 


971 


gi2275569 


Homo sapiens 


T cell receptor beta locus, 
TCRBV6S4A1 to TCRBV8S1 
region. 


660 


99 


971 


gi2 104755 


Homo sapiens 


T cell receptor V-beta 23 
(TCRBV) gene, partial cds. 


660 


99 


971 


g i495404 


Pan troglodytes 


T cell receptor beta chain 


657 


88 


972 


AAG89141 


Homo sapiens 


Human secreted protein, SEQ ID 


1272 


100 
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NO: 261. 






972 


gil2053273 


Homo sapiens 


mRNA; cDNA DKFZp434K1172 
(from clone DKFZp434Kl 172); 
complete cds. 


1271 


99 


972 


gi292836 


Homo sapiens 


Human trichohyalin (TRHY) gene, 
complete cds. 


146 


29 


973 


gi 12654633 


Homo sapiens 


protein inhibitor of activated 
STAT3, clone MGQ1417 
IMAGE:3528679, mRNA, 
complete cds. 


2708 


96 


973 


gi4996563 


Homo sapiens 


PIAS3 mRNA for protein inhibitor 
of activated STAT3, complete cds. 


2708 


96 


973 


AAE02937 


Homo sapiens 


Human TFRP protein. 


2708 


96 


974 


gil 5099957 


Homo sapiens 


diacylglycerol acyltransferase 2- 
like protein mRNA, complete cds. 


856 


79 


974 


gil 5099955 


Mus musculus 


diacylglycerol acyltransferase 2- 
like protein 


676 


59 


974 


AAY94889 


Homo sapiens 


Human protein clone HP02485. 


578 


56 


975 


gi7022243 


Homo sapiens 


cDNA FU 10300 fis, clone 
NT2RM2000030. 


1432 


100 


975 


AAB92669 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11033. 


1432 


100 


975 


gi3834427 


Drosophila 
melanogaster 


cytoplasmic dynein intermediate 
chain isoform DIClc 


134 


24 


976 


gil2803965 


Homo sapiens 


clone MGC:4294 
IMAGE:3636069, mRNA, 
complete cds. 


519 


100 


976 


gil5919713 


Homo sapiens 


isolate NSB11-3-K3-A10 
immunoglobulin kappa chain 
variable region gene, partial cds. 


62 


32 


976 


gil5919629 


Homo sapiens 


isolate NSB1 1-4-K3-B6 
immunoglobulin kappa chain 
variable region gene, partial cds. 


60 


33 


977 


gil 209685 


Homo sapiens 


Human salivary peroxidase 
mRNA, complete cds. 


3408 


99 


977 


gil 63307 


Bos taurus 


lactoperoxidase 


2906 


77 


977 


gil 1990122 


Camelus 
dromedarius 


peroxidase 


2882 


84 


978 


AAY01603 


Homo sapiens 


Amino acid sequence of the human 
defensin (Def-X) protein. 


501 


97 


978 


gi29735 


Homo sapiens 


H.sapiens mRNA for corticostatin 
HP-4 precursor. 


214 


46 


978 


gi665927 


Homo sapiens 


Human corticostatin/defensin HP-4 
precursor gene, complete cds. 


214 


46 


979 


AAG81415 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:348. 


848 


100 


979 


gil 1559416 


Oryctolagus 
cuniculus 


NADPH-dependent retinol 
dehydrogenase/reductase 


75 


31 


980 


AAR94422 


Homo sapiens 


Bactencidal/permeabihty 
increasing peptide (BPI.245). 


63 


JO 


980 


AAR76424 


Homo sapiens 


Bacterial permeability-increasing 
peptide BPI.245. 


63 


56 


980 


AAW63485 


Homo sapiens 


Human BPI protein derived 
peptide XMP.245. 


63 


56 


981 


gi4309953 


Homo sapiens 


BAC clone RP1 1-121A8 from 
7pl4-pl3, complete sequence. 


951 


95 
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981 


gi296680 


Homo sapiens 


Human germline TCR-gamma 
gene variable region V-gamma 5. 


570 


85 


981 


gi4309952 


Homo sapiens 


BAG clone RP1 1-121A8 from 
7pl4-pl3, complete sequence. 


544 


68 


982 


gil223888 


synthetic 
construct 


T cell receptor alpha chain 


1070 


79 


982 


gi338766 


Homo sapiens 


Human T-cell receptor rearranged 
alpha-chain V-region (V-D-J) 
mRNA, complete cds. 


994 


73 


982 


gi3089419 


Homo sapiens 


SSC1 1 rearranged T cell receptor 
alpha chain (TCRAV17) gene, 
complete cds. 


976 


71 


983 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
0610008P16 gene, clone 
MGC: 15937 IMAGE: 3 53 7224, 
mRNA, complete cds. 


309 


46 


983 


AAB73512 


Homo sapiens 


Human transferase HTFS-19, SEQ 
ID NO: 19. 


309 


46 


983 


gi2828262 


Bos taurus 


aralkyl acyl-CoA: amino acid N- 
acyltransferase 


285 


42 


984 


AAG71251 


Homo sapiens 


Human gene 9-encoded secreted 
protein HMSDL37, SEQ ID 
NO:99. 


311 


90 


984 


AAG71286 


Homo sapiens 


Human gene 9-encoded secreted 
protein HMSDL37, SEQ ID 
NO:135. 


311 


90 


984 


gil 3096922 


Mus musculus 


Similar to nadrin 


76 


44 


985 


gi45 19541 


Mus musculus 


thrombospondin type 1 domain 


1299 


88 


985 


gil3625176 


Homo sapiens 


clone 1 thrombospondin mRNA, 
complete cds. 


646 


46 


985 


AAW85607 


Homo sapiens 


Secreted protein clone da228_6. 


646 


46 


986 


AAG81417 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:352. 


287 


100 


986 


gil76558 


Aotus trivirgatus 


involucrin (large allele) 


53 


40 


986 


AAG77243 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO: 8009. 


36 


70 


987 


gi2104856 


Rattus 
norvegicus 


platelet glycoprotein V 


551 


36 


987 


gi6449037 


Mus musculus 


platelet glycoprotein V 


550 


36 


987 


gi3 12502 


Homo sapiens 


H. sapiens GPV gene encoding 
platelet glycoprotein V precursor. 


486 


34 


988 


gil2803851 


Homo sapiens 


nucleotide binding protein 2 
(E.coli MinD like), clone 
MGC-.3473 IMAGE:3633393, 
mRNA, complete cds. 


1385 


100 


988 


gil4124958 


Homo sapiens 


nucleotide binding protein 2 
(E.coli MinD like), clone 
MGC: 15834 IMAGE: 3 507768, 
mRNA, complete cds. 


1385 


100 


988 


AAB59022 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 730, 


1385 


100 


989 


gi9864185 


Drosophila 
melanogaster 


Crossveinless 2 


1031 


35 


989 


gi7768636 


Xenopus laevis 


Kielin 


911 


31 


989 


gi9887910 


Oryctolagus 
cuniculus 


zonadhesin precursor 


540 


31 
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990 


gil3161011 


Homo sapiens 


sclerostin gene, complete cds. 


417 


92 


990 


gil3161020 


Homo sapiens 


sclerostin mRNA, complete cds. 


417 


92 


990 


gil3236418 


Homo sapiens 


SOST (SOST) mRNA, complete 
cds. 


417 


92 


991 


gi335148 


Tacaribe virus 


P-ll protein 


69 


39 


991 


gi5596437 


Petunia x 
hybrida 


Stigl 


58 


30 


992 


gi4490538 


Homo sapiens 


Human U266 rearranged DNA for 
lambda-immunoglobulin light 
chain. 


933 


81 


992 


gi33746 


Homo sapiens 


Human rearranged 
immunoglobulin lambda light 
chain mRNA. 


920 


80 


992 


AAB36212 


Homo sapiens 


Human immune system associated 
protein HISAP-10. 


913 


81 


993 


AAG03466 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7547. 


104 


31 


993 


gi259589 


Cercopithecine 
herpesvirus 1 


glycoprotein J; gJ 


88 


42 


993 


gi35 14059 


Cercopithecine 
herpesvirus 1 


glycoprotein gJ 


87 


42 


994 


gi203246 


Rattus 
norvegicus 


cell adhesion-like molecule 


1739 


97 


994 


gi5 14374 


Homo sapiens 


Human (clone pHOM) opioid- 
binding cell adhesion molecule 
mRNA, complete cds. 


1666 


94 


994 


gi586 


Bos taurus 


put. pre-OPCAM (AA 1 - 345) 


1643 


93 


995 


AAB88408 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0164. 


1358 


92 


995 


AAY13392 


Homo sapiens 


Amino acid sequence of protein 
PR0328. 


1355 


100 


995 


AAB01373 


Homo sapiens 


Neuron-associated protein. 


1355 


100 


996 


AAB85144 


Homo sapiens 


Human NKCR polypeptide (clone 
ID HMSOM53). 


982 


89 


996 


gi31332 


Homo sapiens 


Human mRNA for high affinity Fc 
receptor (FcRI). 


364 


42 


996 


AAY96226 


Homo sapiens 


Human high affinity Fc receptor, 
FcgammaRL 


364 


42 


997 


gil4165486 


Homo sapiens 


clone MGC: 15041 
IMAGE:3831657, mRNA, 
complete cds. 


70 


37 


997 


gi9909361 


Homo sapiens 


sphingosine kinase (SPHK) 
mRNA, complete cds. 


70 


37 


997 


gi8 132868 


Homo sapiens 


sphingosine kinase- 1 mRNA, 
complete cds. 


70 


37 


998 


AAB56802 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1380. 


417 


100 


998 


gi2983321 


Aquifex aeolicus 


cytochrome c-type biogenesis 
protein 


61 


36 


998 


gi6941880 


Escherichia coli ! 


flagellin 


63 


35 


999 


AAB08732 ; 


Homo sapiens 


Amino acid sequence of a human 
OLD-35 polypeptide. 


1375 


87 


999 


gil4042110 


Homo sapiens 


cDNA FLJ14531 fis, clone 
NT2RM2000371, weakly similar 
to POLYRIBONUCLEOTIDE 


1265 


88 
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NUCLEOTIDYLTRANSFERASE 
(EC 2.7.7.8). 






999 


AAB92684 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11065. 


1265 


88 


1000 


gil99582 


Mus musculus 


B(2)-microglobulin 


616 


95 


1000 


gi50105 


Mus musculus 


beta2-microglobulin precursor (aa 
-20 to 99) 


614 


95 


1000 


gil99576 


Mus musculus 


B(2)~microglobulin 


609 


94 


1001 


gi7023273 


Homo sapiens 


cDNA FLJ10922 fis, clone 
OVARC1 000420. 


672 


99 


1001 


AAB93335 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12441. 


672 


99 


1001 


AAM06337 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
68. 


328 


75 


1002 


gil 5559608 


Homo sapiens 


Similar to zinc finger protein 16 
(KOX 9), clone MGC:20886 
IMAGE:4549240, mRNA, 
complete cds. 


2079 


100 


1002 


gi55471 


Mus musculus 


Zfp-29 


725 


59 


1002 


gi6409345 


Homo sapiens 


zinc finger protein ZNF1 80 
(ZNF180) mRNA, complete cds. 


730 


54 


1003 


gi488555 


Homo sapiens 


Human zinc finger protein ZNF135 
mRNA, complete cds. 


1664 


60 


1003 


gil 769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


1633 


49 


1003 


gil0436789 


Homo sapiens 


cDNA FLJ14345 fis, clone 
THYRO1001 189, weakly similar 
to ZINC FINGER PROTEIN 91. 


1619 


57 


1005 


AAB23641 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 97. 


609 


100 


1005 


AAE01594 


Homo sapiens 


Human gene 12 encoded secreted 
protein HHSGW69, SEQ ID 
NO: 144. 


93 


35 


1005 


gi5668598 


Homo sapiens 


Wiskott-Aldrich syndrome protein 
interacting protein (WASPIP) 
mRNA, partial cds. 


92 


35 


1006 


gil 1493473 


Homo sapiens 


PR02225 


163 


67 


1006 


gi9654986 


Vibrio cholerae 


autoinducer-2 production protein 


64 


33 


1006 


AAG02328 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6409. 


58 


45 


1008 


AAM00955 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 431. 


704 


100 


1008 


gi4902661 


Homo sapiens 


Novel human gene mapping to 
chomosome 22. 


635 


100 


1008 


gil2052896 


Homo sapiens 


mRNA; cDNA DKFZp564F1978 
(from clone DKFZp564F1978); 
complete cds. 


635 


100 


1009 


AAE01420 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO: 144. 


612 


100 


1009 


gi4028545 


Homo sapiens 


LIM domain only 7 (LM07) gene, 
exon T and alternative splice 
products, partial cds. 


65 


44 


1009 


gil 620061 


Paramecium 
bursaria 
Chlorella virus 1 


a389R 


66 


46 
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1010 


gi9408382 


Raja eglanteria 


eos 


79 


26 


1010 


gi2062744 


Xenopus i 
laevis/gilli 


Ikaros homolog 


63 


27 


1010 


gil 199604 


Homo sapiens 


Human zinc finger protein C2H2- 
25 mRNA, complete cds. 


97 


27 


1011 


gil3097207 


Homo sapiens 


ribosomal protein, large, PI, clone 
MGC:5215 IMAGE:2900S46, 
mRNA, complete cds. 


332 


100 


1011 


gil 4043204 


Homo sapiens 


ribosomal protein, large, PI, clone 
MGC: 15616 IMAGE:3343021, 
mRNA, complete cds. 


332 


100 


1011 


gil 90234 


Homo sapiens 


Human acidic ribosomal 
phosphoprotein PI mRNA, 
complete cds. 


332 


100 


1012 


AAW88457 


Homo sapiens 


Human lysophospholipase IHLP. 


1113 


93 


1012 


AAY64648 


Homo sapiens 


Human lysophospholipase 
homology protein. 


825 


100 


1012 


gil0303289 


Neurospora 
crass a 


related to lysophospholipase 


201 


34 


1013 


g i 140434 17 


Homo sapiens 


clone IMAGE:3953868, mRNA, 
partial cds. 


919 


98 


1013 


gil5080096 


Homo sapiens 


clone MGC:20451 
IMAGE:3830864, mRNA, 
complete cds. 


919 


98 


1013 


AAG81374 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:266. 


919 


98 


1014 


gil3543427 


Homo sapiens 


Similar to RIKEN cDNA 
5730469M10 gene, clone 
MGC:4248 IMAGE:30 10078, 
mRNA, complete cds. 


191 


36 


1014 


AAW67858 


Homo sapiens 


Human secreted protein encoded 
by gene 52 clone HAUCC47. 


191 


36 


1014 


AAY66746 


Homo sapiens 


Membrane-bound protein 
PROH98. 


191 


36 


1015 


AAB56791 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO-.1369. 


496 


98 


1015 


gi2984030 


Aquifex aeolicus 


phosphofructokinase 


70 


25 1 


1015 


gil388150 


Haemophilus 
ducreyi 


ribosomal protein L31 


51 


45 


1016 


gi29509 


Homo sapiens 


Human BTG1 mRNA. 


655 


100 


1016 


gi50188 


Mus musculus 


btgl 


655 


100 


1016 


gi293306 


Mus musculus 


B-cell translocation gene-1 protein 


655 


100 


1017 


gil 3623633 


Homo sapiens 


clone MGC:13105 
IMAGE:3957973, mRNA, 
complete cds. 


2984 


100 


1017 


AAB81188 


Homo sapiens 


Human zinc finger protein 52 
(ZFP-52). 


2389 


98 


1017 


AAB95368 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17684. 


1712 


100 


1018 


gil 2002 127 


Homo sapiens 


CCK1 protein (CCK1) mRNA, 
complete cds. 


55 


45 


1018 


gi9392591 


Homo sapiens 


CC chemokine CCL28 (SCYA28) 
mRNA, complete cds. 


55 


45 


1018 


gil0312152 


Homo sapiens 


mucosae-associated epithelial 
chemokine mRNA, complete cds. 


55 


45 
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1019 


AAB90816 


Homo sapiens 


Human shear stress-response 
protein SEQ ID NO: 140. 


365 


100 


1019 


gil0039341 


Dugesia 
dorotocephala 


Vasa-related protein P1VAS 1 


67 


45 


1019 


gil5158371 


Agrobacterium 
tumefaciens 


AGR_L_42GMp 


64 


30 


1020 


gil377897 


Homo sapiens 


heart protein (FHL-2) mRNA, 
complete cds. 


1631 


100 


1020 


gi 1160932 


Homo sapiens 


(clone 35.3) DRAL mRNA, 
complete cds. 


1625 


99 


1020 


gi7209525 


Homo sapiens 


gene for DRAL/Slim3/FHL2, exon 
5 and complete cds. 


1625 


99 


1021 


gi7770259 


Homo sapiens 


PR02975 


794 


100 


1021 


gil 142588 


Trypanosoma 
brucei 


CR3 


55 


40 


1021 


gil237130 


Escherichia coli 


O antigen polymerase 


89 


28 


1022 


AAG81348 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:214. 


530 


63 


1022 


gi4206763 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


119 


30 


1022 


gi2623666 


Canis familiaris 


dentatorubro-pallidoluysian 
atrophy protein 


101 


30 


1023 


AAG01390 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5471. 


297 


100 


1023 


AAB57280 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO: 1858. 


66 


34 


1023 


gi456681 


Pseudorabies 
virus 


helicase 


80 


26 


1024 


gil017722 


Homo sapiens 


Human repressor transcriptional 
factor (ZNF85) mRNA, complete 
cds. 


1344 


58 


1024 


gil4348591 


Homo sapiens 


KRAB zinc finger protein (KR19) 
mRNA, complete cds. 


1342 


58 


1024 


g i4454678 


Homo sapiens 


zinc finger protein 4 


1321 


59 


1025 


gil65680 


Oryctolagus 
cuniculus 


recombination activating protein 


2666 


91 


1025 


gi2576246 


Mus musculus 


RAG-2 protein 


2594 


88 


1025 


gil5809414 


Mormoops 
blainvillii 


recombination activator protein 2 


2387 


91 


1026 


AAG75278 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6042. 


1061 


95 


1026 


gi211896 


Gallus gallus 


h-caldesmon 


153 


29 


1026 


gi600255 


Gallus gallus 


caldesmon 


153 


29 


1027 


AAY87341 


Homo sapiens 


Human signal peptide containing 
protein HSPP-118 SEQ ID 
NO: 11 8. 


699 


99 


1027 


AAY12929 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


66 


51 


1027 


gi930084 


Homo sapiens 


Human Kox21 mRNA for zinc 
finger protein, partial. 


41 


31 


1028 


gil73324 


Candida glabrata 


metallothionein II 


37 


71 


1028 


gil 73326 


Candida glabrata 


metallothionein (MTII) 


37 


71 


1028 


gil73328 


Candida glabrata 


metallothionein 11 


37 


71 


1029 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


598 


44 
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1029 


gil2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


557 


46 


1029 


gil2655456 


Homo sapiens 


partial mRNA for keratin 
associated protein 4.9 (KRTAP4.9 
gene). 


540 


42 


1030 


gil0639287 


Thermoplasma 
acidophilum 


amino acid transporter related 
protein 


66 


29 


1030 


gil4325303 


Thermoplasma 
volcanium 


amino acid transporter 


64 


28 


1030 


gil5341701 


Homo sapiens 


clone MGC: 19805 
IMAGE:3939974, mRNA, 
complete cds. 


60 


34 


1031 


gil2653801 


Homo sapiens 


peptidylprolyl isomerase A 
(cyclophilin A), clone MGC:2351 
IMAGE:3349335, mRNA, 
complete cds. 


820 


93 


1031 


gil2804335 


Homo sapiens 


clone IMAGE:2823490, mRNA, 
partial cds. 


820 


93 


1031 


gil3529080 


Homo sapiens 


peptidylprolyl isomerase A 
(cyclophilin A), clone MGC: 12404 
IMAGE:3935025, mRNA, 
complete cds. 


820 


93 


1032 


gi207621 


Rattus 
norvegicus 


uromodulin 


98 


36 


1032 


gi912817 


Rattus sp. 


Tamm-Horsfall protein; THP 


98 


36 


1032 


gi602768 


Mytilus 

galloprovincialis 


adhesive plaque matrix protein 
precursor 


86 


29 


1033 


AAG03055 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7136. 


269 


100 


1033 


AAY73471 


Homo sapiens 


Human secreted protein clone 
ydl53 1 protein sequence SEQ ID 
NO: 164. 


62 


33 , 


1034 


AAB38043 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 10 clone 
HWHGP71. 


124 


38 


1034 


gi5305335 


Mycobacterium 
tuberculosis 


proline-rich mucin homolog 


114 


36 


1034 


gi22599 


Arabidopsis 
thaliana 


APG 


112 


37 


1035 


gi6467206 


Homo sapiens 


GIOT-4 mRNA for gonadotropin 
inducible transcription repressor-4, 
complete cds. 


2004 


60 


1035 


AAY58627 


Homo sapiens 


Protein regulating gene expression 
PRGE-20. 


1732 


56 


1035 


gi3953593 


Mus musculus 


Zinc finger protein si 1-6 


1720 


54 


1036 


AAB95007 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16685. 


518 


86 


1036 


gi45906 


Proteus vulgaris 


hlyC protein (AA 1-54) 


72 


45 


1036 


AAB56607 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO: 11 85. 


74 


56 


1037 


AAY27616 


Homo sapiens 


Human secreted protein encoded 
by gene No. 50. 


562 


99 


1037 


gil2957417 


Casuarius 
bennetti 


ATPase subunit 8 


62 


35 


1037 


gi332009 


Murine leukemia 
virus 


pl5-gag protein 


57 


27 
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1038 


gil5822827 


Homo sapiens 


mRNA for pendrin-like protein 1, 
complete cds. 


1154 


39 


1038 


gi 13344999 


Homo sapiens 


solute carrier family 26 member 6 
(SLC26A6) mRNA, complete cds. 


1300 


37 


1038 


AAY71067 


Homo sapiens 


Human membrane transport 
protein, MTRP-12. 


1297 


37 


1039 


AAY57945 


Homo sapiens 


Human transmembrane protein 
HTMPN-69. 


780 


100 


1039 


AAY76141 


Homo sapiens 


Human secreted protein encoded 
by gene 18. 


780 


100 


1039 


AAB24037 


Homo sapiens 


Human PRO 1555 protein sequence 
SEQ IDNO:49. 


356 


47 


1040 


AAY59672 


Homo sapiens 


Secreted protein 108-0Q6-5-0-E6- 
FL. 


553 


83 


1040 


gi 104352 14 


Homo sapiens 


cDNA FLJ13263 fis, clone 
OVARC 1000924. 


549 


82 


1040 


AAB94543 


Homo sapiens 


Human protein sequence SEQ ID 
NO.15290. 


549 


82 


1041 


AAY92710 


Homo sapiens 


Human membrane-associated 
protein Zsig24. 


704 


97 


1041 


AAY87250 


Homo sapiens 


Human signal peptide containing 
protein HSPP-27 SEQ ID NO:27. 


566 


99 


1041 


AAG00627 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4708. 


260 


100 


1042 


gil4572521 


Homo sapiens 


NEPH1 (NEPH1) mRNA, 
complete cds. 


1512 


51 


1042 


AAB37996 


Homo sapiens 


Human secreted protein encoded 
by gene 13 clone HIBEU15. 


1164 


92 


1042 


gil0434261 


Homo sapiens 


cDNA FLJ12646 fis, clone 
NT2RM4001987, weakly similar 
to NEURAL CELL ADHESION 
MOLECULE 1, LARGE 
ISOFORM PRECURSOR. 


1035 


41 


1043 


gi29806 


Homo sapiens 


Human mRNA for CD59, an LY- 
6-like protein regulating 
complement membrane attack. 


710 


100 


1043 


gi825637 


Homo sapiens 


H.sapiens gene for CD59 protein, 
exon 2. 


710 


100 


1043 


gi29815 


Homo sapiens 


Human mRNA for CD59 antigen. 


710 


100 


1044 


gi6841140 


Homo sapiens 


HSPC100 mRNA, partial cds. 


498 


100 


1044 


gi2828808 


Bacillus subtilis 


glucose transporter 


111 


25 


1044 


gi9 106658 


Xylella 

fastidiosa 9a5c 


glucose/galactose transporter 


140 


23 


1045 


AAB56632 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1210. 


3377 


99 


1045 


gil 3097708 


Homo sapiens 


ribophorin II, clone MGC:1817 
IMAGE:3546673, mRNA, 
complete cds. 


3152 


100 


1045 


g i5834424 


Homo sapiens 


RIBIIR gene (partial), exon 1 and 
joined CDS. 


3152 


100 


1046 


gil3 182757 


Homo sapiens 


HTPAP mRNA, complete cds. 


598 


100 


1046 


AAG89279 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 399. 


598 


100 


1046 


AAB70690 


Homo sapiens 


Human hDPP protein sequence 
SEQ ID NO:7. 


598 


100 
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1047 


gi2276448 


Homo sapiens 


Human MHC class I HLA-A 
(HLA-A-0302-new allele) mRNA, 
complete cds. 


1794 


93 


1047 


gi6815812 


Homo sapiens 


MHC class I antigen heavy chain 
(HLA-A) mRNA, HLA-A*0302 
allele, complete cds. 


1794 


93 


1047 


gil245460 


Homo sapiens 


Human MHC class I HLA-A allele 
(HLA-A) mRNA, complete cds. 


1786 


92 


1048 


AAB95392 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17743. 


567 


"70 


1048 


AAB29645 


Homo sapiens 


Human membrane-associated 
protein HUMAP-2. 


548 


70 


1048 


AAB95049 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16845. 


396 


HQ 

/o 


1049 


gil4017773 


Mus musculus 


Cgl0671-like 


15 17 


yo 


1049 


gil4017764 


Mus musculus 


CG10671-like 


1517 


96 


1049 


gil6198091 


Drosophila 
melanogaster 


LD30661p 


184 


30 


1050 


AAG81431 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:380. 


503 


97 


1050 


gi6707026 


Monodelphis 
domestica 


immunoglobulin light chain kappa 


108 


26 


1050 


gi6653413 


Oryctolagus 
cuniculus 


immunoglobulin light chain VJ 
kappa region 


102 


27 


1051 


gil2836893 


Gallus gallus 


IPR328-like protein 


158 


29 


1051 


gi3093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


151 


29 


1051 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma- 3 subunit mRNA, 
complete cds. 


151 


29 


1052 


gi4337100 


Homo sapiens 


MSH55 gene, partial cds; and 
CLIC1, DDAH, G6b, G6c, G5b, 
G6d, G6e, G6f, BAT5, G5b, 
CSK2B, BAT4, G4, Apo M, 
BAT3, BAT2, AIF-1, 1C7, LST-1, 
LTB, TNF, and LTA genes, 
complete cds. 


400 


100 


1052 


gi5304878 


Homo sapiens 


genes encoding RNCC protein, 
DDAH protein, Ly6-C protein, 
Ly6-D protein and 
immunoglobulin receptor. 


400 


100 


1052 


AAY27597 


Homo sapiens 


Human secreted protein encoded 
by gene No. 31. 


400 


100 


1053 


AAB88325 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0020. 


912 


99 


1053 


AAB53257 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID NO:797. 


859 


99 


1053 


gil3325409 


Homo sapiens 


clone IMAGE:3845253, mRNA, 
partial cds. 


774 


100 


1054 


gil234787 


Xenopus laevis 


up-regulated by thyroid hormone 
in tadpoles; expressed specifically 
in the tail and only at 
metamorphosis; membrane bound 
or extracellular protein; C-terminal 
basic region 


917 


61 


1054 


gil0435980 


Homo sapiens 


cDNA FLJ13840 fis, clone 


812 


62 
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THYRO 1000783, moderately 
similar to Xenopus laevis tail- 
specific thyroid hormone up- 
regulated (gene 5) mRNA. 






1054 


AAB94773 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15860. 


812 


62 ! 


1055 


gi4099139 


Homo sapiens 


Human P2X4 purinoreceptor gene, 
exons 9, 10, 11 and 12 and 
complete cds. 


2014 


100 


1055 


gi4099121 


Homo sapiens 


Human P2X4 purinoreceptor 
mRNA, complete cds. 


2014 


100 


1055 


AAW47066 


Homo sapiens 


Human brain P2X-1 receptor 
polypeptide. 


2014 


100 


1056 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1020 


99 


1056 


gil5637151 


Beta vulgaris 


glycine decarboxylase subunit P 


62 


36 


1056 


gi5 824822 


Caenorhabditis 
elegans 


Y53F4A.2 


62 


25 


1057 


gi972946 


Mus musculus 


ZP1 precursor 


2217 


67 


1057 


gill 13794 


Mus musculus 


zona pellucida 


2210 1 


67 


1057 


gi2804566 


Rattus 
norvegicus 


zona pellucida 1 glycoprotein 


2200 


67 


1058 


gil5779156 


Homo sapiens 


Similar to RIKEN cDNA 
1810073N04 gene, clone 
MGC: 15523 IMAGE: 3 02 8 844, 
mRNA, complete cds. 


1858 


100 


1058 


gil3097045 


Mus musculus 


Similar to RIKEN cDNA 
1810073N04 gene 


1719 


91 


,1058 


gi603254 


Saccharomyces 
cerevisiae 


Yel064cp 


319 


27 


1059 


AAW03516 


Homo sapiens 


Prostaglandin DP receptor. 


1467 


100 


1059 


gi940379 


Homo sapiens 


Human DP prostanoid receptor 
(PTGDR) gene, 5' region and 
partial cds. 


1467 


100 


1059 


gi4567038 


Rattus 
norvegicus 


prostaglandin D2 receptor 


1127 


77 


1060 


gi2811122 


Xenopus laevis 


NaDC-2 


1274 


56 


1060 


gil098557 


Homo sapiens 


Human renal sodium/dicarboxylate 
cotransporter (NADC1) mRNA, 
complete cds. 


1618 


55 


1060 


gi3 168585 


Rattus 
norvegicus 


sodium-dependent dicarboxylate 
transporter 


1614 


54 


1061 


gi3036840 


Homo sapiens 


mRNA for cystinosin. 


1686 


88 


1061 


gi3036851 


Homo sapiens 


CTNS gene, exon 3, flanking 
intronic regions and joined CDS. 


1686 


88 


1061 


gi7239176 


Homo sapiens 


vanilloid receptor gene, partial 
sequence; CARKL and CTNS 
genes, complete cds; TIP1 gene, 
partial cds; P2X5b and r2X5a 
genes, complete cds; and 
HUMINAE gene, partial cds. 


1686 


88 


1062 


gi41077 


Escherichia coli 


cal protein precursor (aa 1-51) 


63 


42 


1062 


gi6474978 


Schizosaccharom 
yces pombe 


Amino acid permease 


62 


27 


1062 


AAB40157 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:67. 


60 


27 
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1063 


AAY36071 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 456. 


1252 


92 


1063 


gi 15990604 


Homo sapiens 


RAE-l-iike transcript 4 mRNA, 
complete cds. 


1022 


97 


1063 


AAG00501 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4582. 


533 


95 


1064 


gil4290560 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2, clone 
MGC:9286 IMAGE:3 874367, 
mRNA, complete cds. 


1548 


98 


1064 


gil5277509 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2, clone 
MGC:17157 IMAGE:42 14662, 
mRNA, complete cds. 


1548 


97 


1064 


gi3211722 


Homo sapiens 


lamin B receptor homolog 
TM7SF2 (TM7SF2) mRNA, 
complete cds. 


1132 


77 


1066 


AAE06611 


Homo sapiens 


Human protein having 
hydrophobic domain, HP03696. 


1552 


99 


1066 


gil 3676372 


Homo sapiens 


clone MGC:4595 
IMAGE:3345729, mRNA, 
complete cds. 


469 


50 


1066 


AAY41690 | 


Homo sapiens 


Human PR0329 protein sequence. 


469 


50 


1067 


AAG72119 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1800. 


1036 


81 


1067 


gi3769616 


Rattus 
norvegicus 


olfactory receptor 


887 


81 


1067 


gil2054453 


Homo sapiens 


6M1-18*01 gene for olfactory 
receptor, cell line BM28.7. 


547 


42 


1068 


gi7 106778 


Homo sapiens 


HSPC194 


530 


95 


1068 


AAW64547 


Homo sapiens 


Human stomach cancer cell clone 
HP10175 protein. 


530 


95 


1068 


AAY35949 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 198. 


530 


95 


1069 


gi402185 


Homo sapiens 


H.sapiens ALK-2 mRNA. 


1572 


100 


1069 


gi338219 


Homo sapiens 


Human novel serine kinase 

* T"» "VTA 1 j 1 

receptor mRNA, complete cds. 


1572 


100 


1069 


AAR85206 


Homo sapiens 


Human ALK-2. 


1572 


100 


1070 


gi4128041 


Homo sapiens 


claudin-9 (CLDN9) gene. 


227 


35 


1070 


AAB64401 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA33. 


227 


35 


1070 


gi4325296 


Mus musculus 


claudin-9 


214 


34 


1071 


gil405893 


Homo sapiens 


H.sapiens MICA gene. 


1896 


93 


1071 


AAW60043 


Homo sapiens 


Human MHC class I chain-related 
gene A (MICA) polypetide. 


1896 


93 


1071 


gi508492 


Homo sapiens 


Human MHC class I-related 
protein mRNA, complete cds. 


1838 


90 


1072 


gil 5292437 


Drosophila 
melanogaster 


LP10272p 


AAA 


z>y 


1072 


gi4877582 


Homo sapiens 


lipoma HMGIC fusion partner 
(LHFP) mRNA, complete cds. 


221 


28 


1072 


AAY87336 


Homo sapiens 


Human signal peptide containing 
protein HSPP-1 13 SEQ ID 
NO: 113. 


221 


28 


1073 


1 AAB58289 


Homo sapiens 


Lung cancer associated 


1338 


100 
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polypeptide sequence SEQ ID 627. 






1073 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584_2 protein sequence. 


1338 


100 


1073 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:114. 


1247 


100 


1074 


AAB58289 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 627. 


1338 


100 


1074 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584_2 protein sequence. 


1338 


100 


1074 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 114. 


1247 


100 


1075 


AAE04780 


Homo sapiens 


Human vesicle trafficking protein- 
23 (VETRP-23) protein. 


864 


100 


1075 


AAB28629 


Homo sapiens 


Human Bl lAgl antigen splice 
isoform B11C-8. 


546 


39 


1075 


AAB28630 


Homo sapiens 


Human Bl lAgl antigen splice 
isoform B11C-9-16. 


546 


39 


1076 


gil5811373 


Mus musculus 


G protein coupled receptor 
affecting testicular descent 


1269 


83 


1076 


gil0441730 


Homo sapiens 


leucine-rich repeat-containing G 
protein-coupled receptor 7 (LGR7) 
mRNA, complete cds. 


1004 


62 


1076 


AAY42170 


Homo sapiens 


Human LGR7 long form protein 
sequence. 


1004 


62 


1077 


gil3544043 


Homo sapiens 


clone IMAGE:3627317, mRNA, 
partial cds. 


1257 


52 


1077 


gil4249892 


Homo sapiens 


spinster-like protein, clone 
MGC: 15767 IMAGE:3501826, 
mRNA, complete cds. 


1257 


52 


1077 


gil2003980 


Homo sapiens 


spinster-like protein mRNA, 
complete cds. 


1257 


52 


1078 


AAB85029 


Homo sapiens 


Protein encoded by BAP28 cDNA 
consisting of exons 1 to 45. 


1618 


68 


1078 


AAW54099 


Homo sapiens 


Homo sapiens BAP28 sequence. 


1617 


67 


1078 


gi7022341 


Homo sapiens 


cDNA FLJ10359 fis, clone 
NT2RM2001243. 


1588 


92 


1079 


gil3491841 


Rattus 
norvegicus 


gamma-glutamyltranspeptidase- 
like protein 


209 


34 


1079 


AAG75266 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6030. 


217 


100 


1079 


gi57806 


Rattus sp. 


gamma-glutamyltranspeptidase 
(AA 1-568) 


186 


33 


1080 


gi5262646 


Homo sapiens 


mRNA; cDNA DKFZp434I091 
(from clone DKFZp434I091); 
partial cds. 


2917 


100 


1080 


gi6807820 


Homo sapiens 


mRNA; cDNA DKFZp434A2372 
(from clone DKFZp434A2372); 
partial cds. 


629 


100 


1080 


gil408182 


Homo sapiens 


Human LGN protein mRNA, 
complete cds. 


282 


31 


1081 


gi4878022 


Homo sapiens 


acyl-coenzyme A: cholesterol 
acyltransferase mRNA, complete 
cds. 


930 


98 


1081 


AAR53079 


Homo sapiens 


Acetyl coenzyme A: cholesterol 
acetyltransferase (AC AT). 


925 


98 
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1081 


AAW38416 


Homo sapiens 


Human acyl-coenzyme 

A: cholesterol acyltransferase I. 


925 


98 


1082 


gi458938 


Saccharomyces 
cerevisiae 


Yhrl86cp 


1004 


58 


1082 


gi5921144 


Schizosaccharom 
yces pombe 


mipl 


2049 


52 


1082 


gi9366720 


Trypanosoma 
brucei 


possible tl6ol 1.22 protem. 


277 


45 


1083 


gi402187 


Homo sapiens 


H.sapiens ALK-3 mRNA. 


1664 


99 


1083 


AAR55368 


Homo sapiens 


Human Activin receptor-like 
kinase 3 (hALK-3). 


1664 


99 


1083 


AAR85207 


Homo sapiens 


Human ALK-3. 


1664 


99 


1084 


gi609354 


Xenopus laevis 


BMP receptor 


1485 


90 


1084 


gi2446992 


Xenopus laevis 


'BMP receptor' 


1483 


89 i 


1084 


gi3551073 


Danio rerio 


type I serin/threonine kinase 
receptor 


1451 


87 


1085 


AAW90873 


Homo sapiens 


Human brain-specific dysferlin 
protein. 


1340 


53 


1085 


gi3600028 


Homo sapiens 


dysferlin mRNA, complete cds. 


1340 


53 


1085 


AAY82643 


Homo sapiens 


Human dysferlin protein sequence 
SEQIDNO:2. 


1340 


53 


1086 


gi3600028 


Homo sapiens 


dysferlin mRNA, complete cds. 


1866 


49 


1086 


AAY82643 


Homo sapiens 


Human dysferlin protein sequence 
SEQ ID NO:2. 


1866 


49 


1086 


AAW90868 


Homo sapiens 


Human dysferlin protein. 


1866 


49 


1087 


AAY92321 


Homo sapiens 


Human alpha-2-delta-D calcium 
channel subunit. 


5881 


99 


1087 


AAB62262 


Homo sapiens 


Human calcium channel 
alpha2delta subunit. 


5745 


99 


1087 


AAY92323 


Homo sapiens 


Human alpha-2-delta-D 
polypeptide from splice variant 1 . 


4976 


99 


1088 


gi2 104689 


Mus musculus 


alpha glucosidase II, alpha subunit 


1796 


55 


1088 


gi 1890664 


Sus scrofa 


glucosidase II 


1792 


55 


1088 


gi7672977 


Homo sapiens 


glucosidase II alpha subunit 
mRNA, complete cds. 


1783 


55 


1089 


AAY01143 


Homo sapiens 


Secreted protein encoded by gene 
9 clone HSIDY06. 


238 


100 


1089 


gi6692409 


Otus longicornis 


cytochrome b 


64 


38 


1089 


gil0312185 


Otus watsonii 


cytochrome b 


61 


43 


1090 


gi 13477285 


Homo sapiens 


structure specific recognition 
protein 1, clone MGC:1608 
IMAGE:3536048, mRNA, 
complete cds. 


3683 


100 


1090 


gi 184242 


Homo sapiens 


Human high mobility group box 
(SSRP1) mRNA, complete cds. 


3683 


100 


1090 


AAR38744 


Homo sapiens 


Human SSRP. 


3683 


100 


1091 


gil77814 


Homo sapiens 


Human alpha- 1-antitryp sin-related 
protein gene, exons 3, 4 and 5. 


1925 


90 


1091 


AAP50132 


Homo sapiens 


Sequence of the predominant form 
of human alpha- 1 -antitrypsin( AT) . 


828 


59 


1091 


gil 5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A 
(alpha- 1 antiproteinase, 
antitrypsin), member 1, clone 
MGC:23330 IMAGE:4644658, 
mRNA, complete cds. 


1409 


66 
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/o xucuiiiy 


1092 


AAB56819 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1397. 


1054 


100 


1092 


gi 1598 1490 


Yersinia pestis 


protease 


137 


28 


1092 


gi9654995 


Vibrio cholerae 


protease DegS 




zy 


1093 


gil3543976 


Homo sapiens 


clone IMAGE:3603998, mRNA, 
partial cds. 


1523 


100 


1093 


gil 5930240 


Homo sapiens 


Similar to CAP-binding protein 
complex interacting protein 2, 
clone MGC:9962 
1MAGE:3878011, mRNA, 
complete cds. 


1523 


100 


1093 


AAY57946 


Homo sapiens 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


1094 


AAY53031 


Homo sapiens 


Human secreted protein clone 
dd426_l protein sequence SEQ ID 
NO:68. 


590 


93 


1094 


AAY71062 


Homo sapiens 


Human membrane transport 
protein, MTRP-7. 


158 


26 


1094 


gil 5529 155 


Arabidopsis 
thaliana 


AT3g30390/T6J22_l 6 


lor 

135 


no 
JLJL 


1095 


gi4959568 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP (NPIP) mRNA, 
complete cds. 


IojU 


y4 


1095 


gi2342743 


Homo sapiens 


Human Chromosome 16 BAC 
clone CIT987SK-A-589H1, 
complete sequence. 


1627 


y3 


1095 


AAY10915 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


760 


88 


1096 


gi7022118 


Homo sapiens 


cDNA FLJ10213 fis, clone 
HEMBA 1006474, weakly similar 
to 40 KD PROTEIN. 


1074 


99 


1096 


AAB92609 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10874. 


1 A '7/1 
1U /4 


QO 

yy 


1096 


gi456886 


Borna disease 
virus 


p40 




41 


1097 


gi38432 


Homo sapiens 


H.sapiens gene for mitochondrial 
ATP synthase c subunit (P2 form). 


612 


90 


1097 


gi285910 


Homo sapiens 


P2 mRNA for ATP synthase 
subunit c, complete cds. 


OlZ 


OA 

yu 


1097 


AAB43694 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:1139. 


OlJ. 


OA 

yu 


1098 


gi897827 


Homo sapiens 


Human iron-responsive element- 
binding protein/iron regulatory 
protein 2 (1RE-BP2/1KFZ) mJKJNA, 
partial cds. 




yy 


1098 


gi897581 


Homo sapiens 


Human iron-regulatory protein 2 
(IRP2) mRNA, partial cds. 


4yuy 


no 

yy 


1098 


gi897583 


Rattus 
norvegicus 


iron-regulatory protein 2 


4700 


93 


1099 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 
partial cds. 


75 


32 


1099 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO:79. 


61 


35 


1099 


AAY76213 


Homo sapiens 


Human secreted protein encoded 
by gene 90. 


61 


35 


1100 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 


75 


32 
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partial cds. 






1100 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO: 79. 


61 


35 


1100 


AAY76213 


Homo sapiens 


Human secreted protein encoded 
by gene 90. 


61 


35 


1101. 


gi5732908 


Homo sapiens 


BPAGln3 (BPAG1) mRNA, 
partial cds. 


75 


32 


1101 


AAY87302 


Homo sapiens 


Human signal peptide containing 
protein HSPP-79 SEQ ID NO:79. 


61 


35 


1101 


AAY76213 


Homo sapiens 


Human secreted protein encoded 
by gene 90. 


61 


35 


1102 


AAY86234 


Homo sapiens 


Human secreted protein 
HNTNC20, SEQ ID NO: 149. 


88 


31 


1102 


gi5430769 


Arabidopsis 
thaliana 


Similar to somatic embryogenesis 
receptor-like kinase 


88 


32 


1102 


AAB24074 


Homo sapiens 


Human PROl 153 protein sequence 
SEQ ID NO:49. 


79 


22 


1103 


gil3447199 


Homo sapiens 


sphingosine- 1 -phosphate 
phosphatase mRNA, complete cds. 


1931 


98 


1103 


gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrolase 


1692 


83 


1103 


gil5778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


1692 


83 


1104 


gil2052824 


Homo sapiens 


mRNA; cDNA DKFZp564H1562 
(from clone DKFZp564H1562); 
complete cds. 


1544 


100 


1104 


gi5326797 


Homo sapiens 


junctional adhesion molecule 
(JAM1) mRNA, complete cds. 


1544 


100 


1104 


gi5731339 


Homo sapiens 


junctional adhesion molecule- 1 
mRNA, complete cds. 


1544 


100 


1105 


gi296636 


Homo sapiens 


Human apoC-II gene for 
preproapolipoprotein C-IL 


506 


100 


1105 


gi757915 


Homo sapiens 


Human mRNA for lipoprotein 
apoCII. 


506 


100 


1105 


gil78836 


Homo sapiens 


APOC2 gene, complete sequence; 
and apolipoprotein C-II (APOC2) 
gene, complete cds. 


506 


100 


1106 


gil3097159 


Homo sapiens 


tumor protein, translationally- 
controlled 1, clone MGC:5308 
IMAGE:2899964, mRNA, 
complete cds. 


794 


97 


1106 


gi7573519 


Homo sapiens 


TPT1 gene for translationally 
controlled tumor protein (TCTP), 
exons 1-6. 


794 


97 


1106 


gi37496 


Homo sapiens 


Human mRNA for translationally 
controlled tumor protein. 


794 


97 


1107 


gil2082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


3523 


84 


1107 


gil2082723 


Gallus gallus 


B cell phosphoinositide 3 -kinase 
adaptor 


2821 


oy 


1107 


AAB43816 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:1261. 


1257 


98 


1108 


gil0177622 


Arabidopsis 
thaliana 


gene_jd:K6M13.11~ 


201 


39 


1108 


gi!0437414 


Homo sapiens 


cDNA: FLJ21330 fis, clone 
COL02466. 


165 


34 
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1108 


gi499199 


Schizosaccharom 
yces pombe 


uvi22 


155 


n 


1109 


gi 13436446 


Homo sapiens 


myosin regulatory light chain, 
clone MGC:4405 
IMAGE:2906108, mRNA, 
complete cds. 


OOI 

OO 1 


oo 

yy 


1109 


gi829623 


Homo sapiens 


Human myosin regulatory light 
chain mRNA, complete cds. 


881 


99 


1109 


gil5076511 


Homo sapiens 


MLC-2 mRNA for nonmuscle 
myosin light chain 2, complete cds. 


881 


99 


1110 


gi5305502 


Mus museums 


phospholemman precursor 


153 


45 


1110 


gil916012 


Rattus 
norvegicus 


phospholemman chloride channel 


1 A1 


si 
jj 


1110 


gil916010 


Homo sapiens 


Human phospholemman chloride 
channel mRNA, complete cds. 


1 


A7 


1111 


gi 13272522 


Homo sapiens 


transcription factor NYD-splO 
mRNA, complete cds. 


1344 


90 


1111 


gil4278918 


Homo sapiens 


mRNA for transcription factor 
RFX4, complete cds. 


1166 


82 


1111 


gi583352 


synthetic 
construct 


does not include the start ot stop 
codon 


162 


29 


1112 


AAB47296 


Homo sapiens 


PRO4401 polypeptide. 


1062 


58 


1112 


AAY22496 


Homo sapiens 


Human secreted protein sequence 
clone cn621 8. 


1 A/CO 


Do 


1112 


gil4042441 


Homo sapiens 


cDNA FLJ 14724 fis, clone 
NT2RP3001716. 


400 


A1 

43 


1113 


gil5341863 


Homo sapiens 


Similar to RIKEN cDNA 
290005 2H21 gene, clone 
MGC:21625 IMAGE:4214683, 
mRNA, complete cds. 


758 


98 


1113 


AAY33297 


Homo sapiens 


Human membrane spanning 
protein MSP-4. 


/JO 


no 
yo 


1113 


AAB61149 


Homo sapiens 


TT \T/M 71 O -i — ? — 

Human NO VI 8 protein. 


7^Q 

/JO 


OR 

yo 


1114 


gilll25139 


Homo sapiens 


Novel human gene mapping to 
chomosome 22. 


476 


89 


1114 


AAY94914 


Homo sapiens 


Human secreted protein clone 
pw337 6 protein sequence SEQ ID 
NO:34. 


476 


89 


1114 


gi602584 


Methanosarcina 
mazei 


cytochrome b 


75 




1115 


AAG72267 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1948. 


1 m o 1 


1 f\f\ 

1UU 


1115 


AAG72407 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2088. 


1 O Q 1 

iZo I 


1UU 


1115 


AAG72270 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1951. 


yy / 




1116 


gil2002782 


Homo sapiens 


olfactory receptor-like protein 
JCG2 (JCG2) mRNA, partial cds. 


1538 


100 


1116 


gil2002784 


Homo sapiens 


olfactory receptor-like protein 
jKsVjz. yj v^vjx j gene, complete oub. 


1538 


100 


1116 


AAE04555 


Homo sapiens 


Human G-protein coupled 
receptor- 11 (GCREC-11) protein. 


1538 


100 


1117 


gi5802817 


Homo sapiens 


endogenous retrovirus HERV- 
K104 long terminal repeat, 
complete sequence; and Gag 
protein (gag) and envelope protein 
(env) genes, complete cds. 


479 


77 
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1117 


gil469243 


Human 
endogenous 
retrovirus K 


pol/env 


466 


77 


1117 


gi3 150438 


Human 
endogenous 
retrovirus K 


pol-env 


466 


77 


1118 


AAG89341 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 461. 


501 


99 


1118 


gi6651037 


Mus musculus 
domesticus 


similar to RNA binding protein 


411 


96 


1118 


AAG02095 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6176. 


167 


55 


1119 


AAB20155 


Homo sapiens 


Secreted protein SECPL 


3983 


51 


1119 


gi3080663 


Homo sapiens 


PAC clone RP5-1 168D1 1 from 
7p21-p22, complete sequence. 


1408 


47 


1119 


gi2897863 


Homo sapiens 


BAC clone GS1-164B5 from 
7p21-p22, complete sequence. 


1340 


50 


1120 


gi32329 


Homo sapiens 


Human HMG-17 gene for non- 
histone chromosomal protein 
HMG-17. 


429 


94 


1120 


gi306864 


Homo sapiens 


Human non-histone chromosomal 
protein HMG-17 mRNA, complete 
cds. 


429 


94 


1120 


AAB28199 


Homo sapiens 


Human HMG-17 non histone 
chromosomal protein. 


429 


94 


1121 


gil3905022 


Homo sapiens 


Similar to interferon induced 
transmembrane protein 3 (1-8U), 
clone MGC:5225 
IMAGE:2986145, mRNA, 
complete cds. 


444 


69 


1121 


gil4250038 


Homo sapiens 


Similar to interferon induced 
transmembrane protein 3 (1-8U), 
clone MGC: 14565 
IMAGE:4075453, mRNA, 
complete cds. 


436 


68 


1121 


gi23398 


Homo sapiens 


Human 1-8U gene from interferon- 
inducible gene family. 


435 


67 


1122 


gi70 19933 


Homo sapiens 


cDNA FLJ20071 fis, clone 
COLO 1887. 


2163 


100 


1122 


AAB36618 


Homo sapiens 


Human FLEXHT-40 protein 
sequence SEQ ID NO:40. 


1051 


100 


1122 


AAW88957 


Homo sapiens 


Polypeptide fragment encoded by 
gene 128. 


902 


1UU 


1123 


AAB60112 


Homo sapiens 


Human transport protein TPPT-32. 


775 


1 AA 
1UU 


1123 


gil 1558029 


Homo sapiens 


boct gene for organic cation 
transporter. 


382 


48 


1123 


gi9663117 


Homo sapiens 


mRNA for organic cation 
transporter. 


382 


48 


1124 


AAR28120 


Homo sapiens 


NKG2 transmembrane protein-D. 


727 


95 


1 1 1 A 

1124 


;l AO AOif c 

gi2980865 


Homo sapiens 


jNisAjziJ gene, exons z-z> ana 
joined mRNA and CDS. 






1124 


gi35063 


Homo sapiens 


Human mRNA for NKG2-D gene. 


724 


94 


1125 


gi7767239 


Homo sapiens 


nectin-like protein 2 (NECL2) 
mRNA, complete cds. 


612 


39 


1125 


gi45 19602 


Homo sapiens 


IGSF4 gene, exon 10 and complete 
cds. 


609 


38 


1125 


AAY45092 


Homo sapiens 


Human lymphoid derived dendritic 


609 


38 
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cell adhesion molecule. 






1126 


gi7020365 


Homo sapiens 


cDNA FLJ20336 fis, clone 
HEP 11722. 


4316 


99 


1126 


gi 10435 830 


Homo sapiens 


cDNA FLJ13727 fis, clone 
PLACE3000103. 


3079 


99 


1126 


AAB94738 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15776. 


3079 


99 


1127 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO: 148. 


678 


99 


1127 


AAB75542 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:96. 


294 


100 


1127 


gi 18640 11 


Homo sapiens 


mRNA for SHPS-1, complete cds. 


zol 


A 1 


1128 


gi7020372 


Homo sapiens 


cDNA FLJ20340 fis, clone 
HEP 12374. 


1692 


99 


1128 


gi4098525 


Prochlorothrix 
hollandica 


CytM 


OA 

oO 


1 1 


1128 


gi324932 


Influenza A virus 


PA polymerase 


67 


38 


1129 


gi7023403 


Homo sapiens 


cDNA FLJ11006 fis, clone 
PLACE 1003045. 


499 


59 


1129 


AAB93412 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12616. 


499 


59 


1129 


gil3542919 


Mus musculus 


Similar to mucolipin 1 


432 


61 


1130 


gil5488920 


Homo sapiens 


Similar to RIKEN cDNA 
2010107G23 gene, clone 
MGC:9596 IMAGE:3896656, 
mRNA, complete cds. 


107 


42 


1130 


AAW74777 


Homo sapiens 


Human secreted protein encoded 
by gene 48 clone HFCAI74. 


74 


40 


1130 


gi 1304441 


Pseudorabies 
virus 


Rsp40 


69 


32 


1131 


gilOl 19918 


Homo sapiens 


brain otoferlin short isoform 
(OTOF) mRNA, complete cds. 


1315 


49 


1131 


gilOl 19916 


Homo sapiens 


brain otoferlin long isoform 
(OTOF) mRNA, complete cds. 


1315 


49 


1131 


gi4588470 


Homo sapiens 


otoferlin (OTOF) mRNA, 
complete cds. 


2214 


43 


1132 


gil006665 


Homo sapiens 


H. sapiens mRNA for transcript 
associated with monocyte to 
macrophage differentiation. 


442 


98 


1132 


gil5155898 


Agrobacterium 
tumefaciens 


AGR_C_1653p 


167 


31 


1132 


gil5023850 


Clostridium 
acetobutylicum 


Predicted membrane protein, 
hemolysin III homolog 


117 


A A 

44 


1133 


AAG71803 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1484. 


1494 


92 


1133 


AAG71805 


Homo sapiens 


Human olfactory receptor 
poiypeptiae, oc,^ ijj jnli: 14oo. 


1205 


92 


1133 


AAG71807 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1488. 


1178 


70 


1134 


AAY70455 


Homo sapiens 


Human membrane channel protein- 
5 (MECHP-5). 


609 


91 


1134 


AAV83992_ 
aal 


Homo sapiens 


Nucleic acid encoding a protein 
with water channel activity. 


608 


92 


1134 


gi23 17274 


Homo sapiens 


mRNA for aquaporin adipose, 


608 


92 
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complete cds. 






1135 


gi33 19326 


Homo sapiens 


protein associated with Myc 
mRNA, complete cds. 


111 


33 


1136 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 mRNA, 
complete cds. 


2574 


97 


1136 


gil0880482 


Mus musculus 


monocarboxylate transporter 4 


393 


39 


1136 


gi2463634 


Homo sapiens 


monocarboxylate transporter 
(MCT3) mRNA, complete cds. 


394 


40 


1137 


gil3528675 


Homo sapiens 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC:3723 
IMAGE:3618755, mRNA, 
complete cds. 


705 


94 


1137 


gil3938484 


Homo sapiens 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC: 16271 
IMAGE:3831016, mRNA, 
complete cds. 


705 


94 


1137 


gil4043553 


Homo sapiens 


ATPase, H+ transporting, 
lysosomal (vacuolar proton pump) 
16kD, clone MGC: 12873 
IMAGE:4 127653, mRNA, 
complete cds. 


705 


94 


1138 


gil5080314 


Homo sapiens 


Similar to RIKEN cDNA 
0610010D20 gene, clone 
MGC:20590 IMAGE :43 10241, 
mRNA, complete cds. 


514 


100 


1138 


gil0580053 


Halobacterium 
sp. NRC-1 


dihydrodipicolinate synthase; 
DapA 


379 


33 


1138 


gil590977 


Methanococcus 
jannaschii 


dihydrodipicolinate synthase 
(dapA) 


336 


29 


1139 


AAE06614 


Homo sapiens 


Human protein having 
hydrophobic domain, HP03974. 


1394 


100 


1139 


gi520469 


Oryctolagus 
cuniculus 


597 aa protein related to 
Na/glucose cotransporters 


1231 


85 


1139 


gi338055 


Homo sapiens 


Human Na+/glucose cotransporter 
1 mRNA, complete cds. 


705 


57 


1140 


gi6708478 


Mus musculus 


formin-like protein 


1571 


66 


1140 


gi4101720 


Mus musculus 


lymphocyte specific formin related 
protein 


1543 


65 


1140 


gil914849 


Mus musculus 


WW domain binding protein 3; 
WBP3 


299 


54 


1142 


gil2052738 


Homo sapiens 


mRNA; cDNA DKFZp564H1322 
(from clone DKFZp564H1322); 
complete cds. 


1755 


96 


1142 


g i 10434632 


Homo sapiens 


cDNA FLJ12886 fis, clone 
NT2RP2004041, weakly similar to 
SYNAPSINS IA AND IB. 


1755 


96 


1 1 /ii 
1 142 


AAB94358 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14883. 


1755 


96 


1143 


AAW54370 


Homo sapiens 


G-protein coupled receptor 
HLTEX11. 


1815 


100 


1143 


AAB64854 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 36 SEQ ID 
NO: 140. 


1792 


100 


1143 


AAW70504 


Homo sapiens 


Leukocyte seven times membrane- 


821 


46 



WO 03/054152 PCT/US02/39555 



Table 2A 
152 



SEQ ID 


Hit ID 


Species 


Description 


S score 


% Identity 








penetrating type receptor protein 
JEG18. 






1144 


gil5278128 


Mus musculus 


chemokine-like factor 2 variant 2 


412 


49 


1144 


AAB51648 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 29 SEQ ID 
NO:88. 


410 


100 


1144 


AAE03929 


Homo sapiens 


Human gene 32 encoded secreted 
protein HTLIF12, SEQ ID NO: 92. 


410 


100 


1146 


gil3477335 


Homo sapiens 


vitamin A responsive; 
cytoskeleton related, clone 
MGC:1917 IMAGE:35 10436, 
mRNA, complete cds. 


111 


95 


1146 


gi3746652 


Homo sapiens 


JWA protein mRNA, complete 
cds. 


111 


95 


1146 


gi6563260 


Homo sapiens 


jmx protein mRNA, complete cds. 


111 


95 


1147 


gi2970431 


Florometra 
serratissima 


NADH dehydrogenase subunit 4 


94 


31 


1147 


gil5042530 


Chilo iridescent 
virus 


450L 


70 


24 


1147 


AAY87197 


Homo sapiens 


Human secreted protein sequence 
SEQIDNO:236. 


90 


27 


1148 


AAB93562 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12957. 


2402 


100 


1148 


gi7023538 


Homo sapiens 


cDNA FLJ11091 fis, clone 
PLACE1005313. 


860 


100 


1148 


AAB93489 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12790. 


860 


100 


1150 


gil 0438431 


Homo sapiens 


cDNA: FLJ221 55 fis, clone 
HRC00205. 


1995 


100 


1150 


gil0437336 


Homo sapiens 


cDNA: FLJ21267 fis, clone 
COL01717. 


1776 


99 


1150 


gi7020065 


Homo sapiens 


cDNA FLJ20152 fis, clone 
COL08515. 


705 


100 


1151 


gil2654159 


Homo sapiens 


interferon induced transmembrane 
protein 1 (9-27), clone MGC: 5 195 
IMAGE:3464598, mRNA, 
complete cds. 


569 


93 


1151 


gil 177476 


Homo sapiens 


H.sapiens mRNA for interferon- 
induced 17kDa membrane protein. 


569 


93 


1151 


gil 77802 


Homo sapiens 


Human interferon-inducible 
protein 9-27 mRNA, complete cds. 


563 


92 


1152 


AAG72230 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1911. 


1615 


100 


1152 


AAG72382 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2063. 


1615 


100 


1152 


gil5293613 


Homo sapiens 


clone OR5C1 olfactory receptor 
gene, partial cds. 


1097 


100 


1153 


gi784997 


Homo sapiens 


H.sapiens mRNA for tumour 
suppressor protein, HUGL. 


5025 


95 


1153 


gil 944491 


Homo sapiens 


Human LLGL mRNA, complete 
cds. 


4797 


91 


1153 


gi854124 


Homo sapiens 


H.sapiens mRNA for human giant 
larvae homolog. 


2837 


58 


1154 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18850. 


219 


72 


1154 


gi7959889 


Homo sapiens 


PR02221 


137 


49 
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1154 


gi2072969 


Homo sapiens 


Human LI element L1.24 p40 
gene, complete cds. 


133 


48 


1155 


gi 15277644 


Homo sapiens 


amino acid transporter (SLC7A10) 
gene, exon 1 1 and complete cds. 


2487 


1 A A 

100 


1155 


gi9309293 


Homo sapiens 


hasc-1 mRNA for asc-type amino 
acid transporter 1, complete cds. 


2487 


100 


1155 


gi7415938 


Mus musculus 


ascl 


2329 


91 


1156 


gi6760373 


Homo sapiens 


ODZ3 (ODZ3) mRNA, partial cds. 


2323 


100 


1156 


gi4760780 


Mus musculus 


Ten-m3 


2248 


96 


1156 


gi6010049 


Gallus gallus 


teneurin-2 protein 


878 


62 


1157 


gil4286298 


Homo sapiens 


clone MGC:3593 
IMAGE:2963628, mRNA, 
complete cds. 


630 


99 


1157 


gi4877285 


Homo sapiens 


mRNA for prenylated Rab 
acceptor 1. 


630 


99 


1157 


gi6563192 


Homo sapiens 


prenylated rab acceptor 1 mRNA, 
complete cds. 


630 


99 


1158 


gi 1780976 


Human 
endogenous 
retrovirus K 


protease 


915 


58 


1158 


gi5802824 


Homo sapiens 


endogenous retrovirus HERV- 
K109, complete sequence. 


909 


59 


1158 


gi9558703 


Homo sapiens 


tandemly repeated human 
endogenous retrovirus HERV-K 
(HML-2.HOM), complete 
sequence. 


905 


59 


1159 


gil3111941 


Homo sapiens 


vesicle-associated soluble NSF 
attachment protein receptor (v- 
SNARE; homolog of S. cerevisiae 
VTI1), clone MGC:3767 
IMAGE: 295 8320, mRNA, 
complete cds. 


804 


91 


1159 


gi3861488 


Homo sapiens 


vesicle soluble NSF attachment 
protein receptor VTI2 mRNA, 
complete cds. 


804 


91 


1159 


AAY73339 


Homo sapiens 


HTRM clone 2056042 protein 
sequence. 


804 


91 


1160 


gi 1922891 


Mus musculus 


alpha 3B chain of laminin-5 


10355 


75 


1160 


gi5777581 


Homo sapiens 


H. sapiens mRNA for laminin-5, 
alpha3b chain. 


9398 


99 


1160 


gi551597 


Homo sapiens 


laminin-related protein (LamA3) 
mRNA, complete cds. 


8690 


100 


1161 


AAD05056_ 
aal 


Homo sapiens 


HUMA- Human secreted protein- 
encoding gene 4 cDNA clone 
HKAAV61, SEQ ID NO:14. 


1524 


83 


1161 


AAE01167 


Homo sapiens 


HUMA- Human gene 4 encoded 
secreted protein HKAAV61, SEQ 
IDNO:68. 


1523 


82 


1161 


AAG67516 


Homo sapiens 


SMIK Amino acid sequence of a 
human secreted polypeptide. 


1523 


82 


1162 


AAM42034 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6965. 


900 


99 


1162 


AAM40248 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3393. 


821 


100 


1162 


gi491284 


synthetic 
construct 


IFN-pseudo-omega 2 


800 


98 


1163 


AAI70234 


Homo sapiens 


AMGE- Human interleukin-1 


819 


100 
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aal 




receptor antagonist related protein 
cDNA. 






1163 


AAD11148_ 
aal 


Homo sapiens 


IMMV Human FIL-1 theta full- 
length DNA. 


819 


100 


1163 


AAA89175_ 
aal 


Homo sapiens 


HYSE- Human interleukin-1 Hy2 
cDNA. 


819 


100 


1164 


gi340215 


Homo sapiens 


Human vascular endothelial 
growth factor gene, exon 8. 


1056 


97 


1164 


gi340301 


Homo sapiens 


Human vascular permeability 
factor mRNA, complete cds. 


1056 


97 


1164 


AAR91077 


Homo sapiens 


PRIZ- Human vascular endothelial 
growth factor-189, VEGF-189. 


1056 


97 


1166 


gil321816 


Gorilla gorilla 


interleukin-8 receptor type B 


602 


90 


1166 


gil 109691 


Homo sapiens 


Human interleukin-8 receptor type 
B (IL8RB) mRNA, complete cds. 


599 


88 


1166 


gil 86378 


Homo sapiens 


Human interleuken 8 receptor B 
mRNA, complete cds. 


599 


88 


1167 


gil 160967 


Homo sapiens 


palmitoyl-protein thioesterase 
gene, complete cds. 


1285 


100 


1167 


gil3 14355 


Homo sapiens 


Human palmitoyl protein 
thioesterase mRNA, complete cds. 


1285 


100 


1167 


gil4250054 


Homo sapiens 


palmitoyl-protein thioesterase 1 
(ceroid-lipofuscinosis, neuronal 1, 
infantile), clone MGC: 14590 
IMAGE:4249991, mRNA, 
complete cds. 


1285 


100 


1168 


gil77814 


Homo sapiens 


Human alpha- 1-antitryps in-related 
protein gene, exons 3, 4 and 5. 


1956 


90 


1168 


AAP50132 


Homo sapiens 


ZYMO- Sequence of the 
predominant form of human alpha- 
1 -antitrypsin(AT) . 


1009 


72 


1168 


gil 5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A 
(alpha- 1 antiproteinase, 
antitrypsin), member 1, clone 
MGC:23330 IMAGE:4644658, 
mRNA, complete cds. 


1450 


69 


1169 


AA012931 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26823. 


346 


100 


1169 


AAO02697 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16589. 


143 


66 


1169 


AAO08307 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 22199. 


137 


80 


1170 


AAR15222 


Homo sapiens 


TEXA Chronic myelogenous 
leukaemia-derived myeloid-related 
protein. 


635 


100 


1170 


gil 8 1527 j 


Homo sapiens 


Human neutrophil peptide 
(defensin) 1 mRNA, complete cds. 


493 


100 


1170 


gil81529 


Homo sapiens 


Human defensin 1 protein mRNA, 
complete cds. 


493 


100 


1171 


gil001697 


Synechocystis 
sp. PCC 6803 


sensory transduction histidine 
kinase 


67 


37 


1171 


gi2353225 


Acromyrmex 
octospinosus 


cytochrome oxidase I 


54 


40 


1171 \ 


AAG02950 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7031. 


54 


32 


1172 


gi4884282 


Homo sapiens 


mRNA; cDNA DKFZp566K0524 


1998 


99 
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(from clone DKFZp566K0524); 
partial cds. 






1172 


gi2665458 


Mus musculus 


protein-tyrosine-phosphatase 


1363 


63 


1172 


gi452194 


Homo sapiens 


Human mRNA for protein tyrosine 
phosphatase (PTP-BAS, type 3), 
complete cds. 


618 


48 


1173 


AAY48226 


Homo sapiens 


META- Human prostate cancer- 
associated protein 12. 


956 


96 


1173 


AAM25683 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1198. 


956 


96 


1173 


AAY99342 


Homo sapiens 


GETH Human PRO1018 
(UNQ501) amino acid sequence 
SEQ ID NO: 8. 


950 


94 


1174 


gil78018 


Homo sapiens 


Human activation (Act-2) mRNA, 
complete cds. 


125 


84 


1174 


gi34218 


Homo sapiens 


Human LAG-1 mRNA. 


125 


84 


1174 


gi533213 


Homo sapiens 


secreted T cell protein (H400; SIS- 
gamma) mRNA, complete cds. 


125 


84 


1175 


gil78018 


Homo sapiens 


Human activation (Act-2) mRNA, 
complete cds. 


123 


92 


1175 


gi34218 


Homo sapiens 


Human LAG-1 mRNA. 


123 


92 


1175 


gi533213 


Homo sapiens 


secreted T cell protein (H400; SIS- 
gamma) mRNA, complete cds. 


123 


92 


1176 


AAG03315 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7396. 


314 


100 


1176 


gi965069 


Serratia 
marcescens 


phage-holin analog protein 


71 


40 


1176 


gil6415877 


Octopus salutii 


cytochrome oxidase subunit III 


74 


28 


1177 


gil78836 


Homo sapiens 


APOC2 gene, complete sequence; 
and apolipoprotein C-II (APOC2) 
gene, complete cds. 


453 


89 


1177 


gi296636 


Homo sapiens 


Human apoC-II gene for 
preproapolipoprotein C-II. 


453 


89 


1177 


gi757915 


Homo sapiens 


Human mRNA for lipoprotein 
apoCIL 


453 


89 


1178 


AAO07986 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 21878. 


77 


34 


1179 


AAB60502 


Homo sapiens 


INCY- Human cell cycle and 
proliferation protein CCYPR-50, 
SEQ IDNO:50. 


1205 


100 


1179 


AAB12144 


Homo sapiens 


PROT- Hydrophobic domain 
protein isolated from WERI-RB 
cells. 


1205 


100 


1179 


AAG81333 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 184. 


687 


99 


1180 


AAW67880 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 74 clone 
HNTAC73. 


378 


97 


1180 


gi9949887 


Pseudomonas 
aeruginosa 


probable transcriptional regulator 


65 


41 


1180 


gi3 130050 


Schizosaccharom 
yces pombe 


haloacid dehalogenase-like 
hydrolase 


62 


32 


1181 


AAG01183 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5264. 


278 


94 


1181 


AAO00621 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 14513. 


84 


47 
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1181 


AAO02188 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16080, 


75 


A C\ 

4U 


1182 


ABB12063 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2433. 


326 


100 


1182 


AAE06730 


Homo sapiens 


SMIK Human CASB765 protein. 


200 


1 r\r\ 

100 


1182 


AAW75098 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 42 clone 
HSXBI25. 


143 


81 


1183 


gil3278924 


Homo sapiens 


neural proliferation, differentiation 
and control, 1, clone MGC:4597 
IMAGE:3347743, mRNA, 
complete cds. 


748 


98 


1183 


gi8515886 


Homo sapiens 


NPDCl-like protein (NPDC1) 
mRNA, complete cds. 


748 


98 


1183 


AAB43904 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1349. 


748 


98 


1184 


gil3128925 


Homo sapiens 


ULBP2 protein mRNA, complete 
cds. 


1025 


C\f\ 

90 


1184 


gil4530663 


Homo sapiens 


TIXT A J? AT A XT -~ 1 1 

mRNA for ALCAN-alpna, 
complete cds. 


1025 


90 


1184 


AAY15238 


Homo sapiens 


IMMV ULBP-2 amino acid 
sequence. 


1025 


90 


1185 


gi4164134 


Homo sapiens 


cosmid clone U169D2 from 
Xp22. 1-22.2, complete sequence. 


7o 


DO 


1185 


AAU22866 


Homo sapiens 


HUMA- Human prostate cancer 

. * n TT-V \T *1 ft IT 

antigen, Seq ID No 385. 


60 


35 


1185 


AAM96178 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 4836. 


60 


35 


1186 


gi7770223 


Homo sapiens 


PR02714 


282 




1186 


ABB15615 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
4272. 


63 


48 


1186 


AAO07531 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 21 423. 


67 


46 


1187 


AAE05962 


Homo sapiens < 


HYSE- Human phospholipase- 
related protein. 


2521 


99 


1187 


gi38 11347 


Homo sapiens 


cytosolic phospholipase A2 beta 
(cPLA2 beta) mRNA, complete 
cds. 


1209 


44 


1187 


gi4886978 


Homo sapiens 


cytosolic phospholipase A2 beta 
mRNA, complete cds. 


1209 


44 


1188 


AAO01938 


Homo sapiens 


HYSE- Human polypeptide SbQ 
ID NO 15830. 


yj 


jO 


1188 


gi2992470 


Mus sp. 


mitochondrial capsule 
selenoprotein; MCS 




O 1 


1188 


gil4717800 


Mus musculus 


seleno-protein 


95 


31 


1189 


gil 86600 


Homo sapiens 


Human inter-alpha-trypsin 
inhibitor light chain (ITI) gene, 
exons 7-10. 


1461 


93 


1189 


gi24479 


Homo sapiens 


Human mRNA for alpha- 1- 
microglobulin and HI-30. 


1461 


93 


1189 


gi32047 


Homo sapiens 


Human mRNA for protein HC 
(alpha- 1 -microgl obulin) . 


1461 


93 


1190 


gi673422 


Homo sapiens 


H.sapiens mRNA fragment for T- 
cell receptor alpha chain. 


1322 


93 
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1190 


AAU69943 


Homo sapiens 


CORI- Human T cell receptor Va 
chain of clone 4E5 for prostate 
protein P501S. 


1097 


77 


1190 


AAM01298 


Homo sapiens 


CORI- P501S-specific T cell clone 
4E5 Va chain T cell receptor 
amino acid. 


1097 


77 


1191 


gi673422 


Homo sapiens 


H.sapiens mRNA fragment for T- 
cell receptor alpha chain. 


859 


95 


1191 


gi623119 


Macaca mulatta 


T-cell receptor alpha 


605 


86 


1191 


AAU69943 


Homo sapiens 


CORI- Human T cell receptor Va 
chain of clone 4E5 for prostate 
protein P501S. 


594 


65 


1192 


gil3097588 


Homo sapiens 


clone MGC: 10745 
IMAGE:2820343, mRNA, 
complete cds. 


201 


100 


1192 


gi53861 


Mus musculus 


Q300 protein (AA 1-77) 


72 


38 


1192 


AAO02105 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 15997. 


50 


56 


1193 


AAB08894 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 SEQ 
IDNO:51. 


208 


57 


1193 


gil5088679 


Homo sapiens 


cysteine and tyrosine-rich protein 1 
(CYYR1) mRNA, complete cds. 


59 


28 


1193 


AAY87233 


Homo sapiens 


INCY- Human signal peptide 
containing protein HSPP-10 SEQ 
ID NO: 10. 


59 


28 


1194 


AAG03963 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8044. 


417 


80 


1194 


ABB 10168 


Homo sapiens 


HUMA- Human cDNA SEQ ID 
NO: 476. 


289 


100 


1194 


ABB 10412 


Homo sapiens 


HUMA- Human cDNA SEQ ID 
NO: 720. 


289 


100 


1195 


gi9758052 


Arabidopsis 
thaliana 




64 


46 


1195 


gi6815537 


Human 

immunodeficienc 
y virus type 1 


gag protein 


47 


62 


1195 


gil4269033 


Sus scrofa 


interferon beta- 1 


42 


47 


1196 


gi7582276 


Homo sapiens 


BM-002 


328 


100 


1196 


AAM78626 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1288. 


328 


100 


1196 


AAM79610 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3256. 


328 


100 


1197 


gill81885 


Mus musculus 


patched 


209 


62 


1197 


AAV21590_ 
aal 


Homo sapiens 


STRD Human patched (ptc) 
protein encoding cDNA. 


215 


63 


1197 


gil335864 


Homo sapiens 


Human patched homolog (PTC) 
mRNA, complete cds. 


214 


63 


1198 


AAB25674 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 10 SEQ 
IDNO:63. 


646 


84 


1198 


gi 14603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene, clone 
MGC: 19636 IMAGE:2822323, 
mRNA, complete cds. 


420 


94 


1198 


AAB36613 


Homo sapiens 


INCY- Human FLEXHT-35 
protein sequence SEQ ID NO: 35. 


420 


94 
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1200 


AAU12292 


Homo sapiens 


GETH Human PRO6027 
polypeptide sequence. 


990 


98 


1200 


AAU27673 


Homo sapiens 


ZYMO Human protein 
AFP235412. 


987 


99 


1200 


gil3477121 


Homo sapiens 


clone IMAGE: 3636082, mRNA, 
partial cds. 


291 


95 


1201 


AAB43928 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO:1373. 


216 


58 


1201 


gil3325337 


Homo sapiens 


clone MGC: 10520 
IMAGE:3938462, mRNA, 
complete cds. 


219 


A O 

48 


1201 


AAB21040 


Homo sapiens 


INCY- Human nucleic acid- 
binding protein, NuABP-44. 


219 


48 


1202 


AAB43928 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1373. 


223 


55 


1202 


gil 6550327 


Homo sapiens 


cDNA FLJ31005 lis, clone 
HLUNG2000068, weakly similar 
to ZINC FINGER PROTEIN 157. 


215 


67 


1202 


gil6552980 


Homo sapiens 


cDNA FLJ32768 lis, clone 

1 ha 1 12UUIo /y, weakly similar to 

r 7T"N.T/^ T?T7vT/'" , "E?'D T>T> HTCIM 1 KH 

ZilJNC rlJNijrJbK JrKUlrllfN ID/. 


215 


67 


1203 


gi432293o 


. 

Homo sapiens 


rlrK/OClv-llKe Kinase rivjriv 
mRNA, complete cds. 






1203 


gi4262362 


Caenorhabditis 
elegans 


alternatively spliced 
serine/threonine protein kinase 

MlLr- 1 J 


119 


81 


1203 


AAB50058 


Homo sapiens 


IMMV SS-4694. 


117 


81 


1204 


gil 754714 


Xenopus laevis 


oviductin 


378 


40 


1204 


gil 5277254 


Bufo japonicus 


oviductin 


net 


DO 


1204 


gi2981641 


Xenopus laevis 


polyprotein 


339 


46 


1205 


gil5214578 


Homo sapiens 


clone MGC: 9 135 
IMAGE:3865141, mRNA, 
complete cds. 


451 


76 


1205 


AAW67901 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 36 clone 
HODCL36. 


451 


76 


1205 


gi 1946205 


Homo sapiens 


H.sapiens mRNA tor CHD5 
protein. 


A A C 

44 D 


/D 


1206 


AAG01971 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6052. 


314 


100 


1206 


gi4200340 


Lycopersicon 
esculentum 


P69D protein 


oi 


A *3 

4J 


1206 


gi3 183989 


Lycopersicon 
esculentum 


P69E protein 


82 


43 


1207 


gil4043211 


Homo sapiens 


Similar to RIKEN cDNA 
4931428F04 gene, clone 
IMAGE:3346497, mRNA, partial 
cds. 


878 


83 


1207 


gi97 11595 


Xanthomonas 
oryzae pv. 
oryzae 


HpaA 


71 


24 


1207 


AAO10768 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 24660. 


72 


34 


1208 


AAY91512 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 62 SEQ 


606 


100 
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ID NO: 185. 






1208 


AAY91653 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 62 SEQ 
IDNO:326. 


606 


100 


1208 


AAY71277 


Homo sapiens 


ZYMO Human Zlipo3 protein. 


606 


100 


1209 


AAY27648 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 82, 


322 


98 


1209 


gi7959897 


Homo sapiens 


PR02379 


72 


39 


1209 


AAO03791 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17683. 


61 


68 


1210 


gil7431247 


Ralstonia 
solanacearum 


HYPOTHETICAL PROTEIN 


70 


38 


1211 


AAB08765 


Homo sapiens 


INCY- A human leukocyte and 
blood related protein (LBAP). 


339 


62 


1211 


AAB74718 


Homo sapiens 


INCY- Human membrane 
associated protein MEMAP-24. 


314 


66 


1211 


gi2587024 


Homo sapiens 


mRNA containing human 
endogenous retrovirus H and 
human endogenous retrovirus E 
sequences. 


259 


67 


1212 


gil0440139 


Homo sapiens 


cDNA: FLJ23447 fis, clone 
HSI03346. 


1339 


95 


1212 


AAY13396 


Homo sapiens 


GETH Amino acid sequence of 
protein PR0332. 


872 


48 


1212 


AAB33425 


Homo sapiens 


GETH Human PR0332 protein 
UNQ293 SEQ ID NO:57. 


872 


48 


1213 


AAG66547 


Homo sapiens 


HYSE- Human secreted 

metallocarboxypeptidase-like 

polypeptide. 


1551 


99 


1213 


AAG66565 


Homo sapiens 


HYSE- Human secreted 
metallocarboxypeptidase-like 
variant polypeptide. 


1548 


98 


1213 


AAB74682 


Homo sapiens 


INCY- Human protease and 
protease inhibitor PPIM-15. 


1482 


98 


1214 


gil5528833 


Homo sapiens 


Fc receptor-like protein 2 (FCRH2) 
mRNA, complete cds. 


528 


100 


1214 


AAB85464 


Homo sapiens 


HYSE- Human immunoglobulin 
domain-containing polypeptide. 


528 


100 


1214 


AAB82317 


Homo sapiens 


UYCO Human immunoglobulin 
receptor IRTA4 protein. 


528 


100 


1215 


AAU27663 


Homo sapiens 


ZYMO Human protein 
AFP285042. 


555 


100 


1215 


AAE06599 


Homo sapiens 


SAGA Human protein having 
hydrophobic domain, HP 10782. 


510 


100 


1215 


gil5558917 


Magnaporthe 
grisea 


similarity to Lec35 protein 


169 


30 


1216 


gil0439008 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


682 


99 


1216 


AAM87876 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15469. 


169 


70 


1216 


gil616747 


Tupaia glis 
belangeri 


GnRH preprohormone second 
form 


68 


33 


1217 


gi 10439008 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


529 


92 


1217 


AAM87876 


Homo sapiens 


HUMA- Human 


109 


71 
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immune/haematopoietic antigen 
SEQ ID NO: 15469. 






1217 


AAM87620 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQIDNO:15213. 


70 


35 


1218 


AAM6095 1 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 33056. 


58 


36 


1218 


AAM73644 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 33950. 


58 


36 


1 n 1 o 

1218 


AAO00109 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 14001. 


58 


45 


1219 


gil6950537 


Hop mosaic 
virus 


cysteine-rich nucleic acid binding 
protein 


41 


47 


1219 


AAY19474 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


43 


43 


1219 


AAO03071 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16963. 


45 


31 


1220 


gil 171589 


Plasmodium 
falciparum 


frameshift 


76 


37 


1220 


gi4512010 


Escherichia coli 


OrfY 


66 


50 


1220 


gil 870470 


Mus musculus 


anti-DNA immunoglobulin light 
chain IgG 


46 


37 


1221 


gil 3274524 


Homo sapiens 


complement-clq tumor necrosis 
factor-related protein (CTRP7) 
mRNA, complete cds. 


1451 


94 


1221 


AAB50371 


Homo sapiens 


ZYMO Human ZACRP7. 


1451 


94 


1221 


gil 32745 18 


Homo sapiens 


complement-clq tumor necrosis 
factor-related protein (CTRP2) 
mRNA, complete cds. 


831 


61 


1222 


AAO03899 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17791. 


67 


50 


1222 


AAG73465 


Homo sapiens 


HUMA- Human gene 12-encoded 
secreted protein fragment, SEQ ID 
NO:240. 


75 


31 


1222 


AAM85406 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 12999. 


60 


40 


1223 


gi8850245 


Homo sapiens 


activated p21cdc42Hs kinase 
(ACK1) mRNA, complete cds. 


5605 


100 


1223 


gi2921447 


Mus musculus 



non-receptor protein tyrosine 
kinase Ack 


4238 


79 




gl/U / WOO 


Bos taurus 


Cdc42-associated tyrosine kinase 
ACK-2 


3493 


90 


1224 


AArSo4oyo 


Homo sapiens 


ZYMO Amino acid sequence of a 
human zkunlO polypeptide. 


358 


35 


1224 


gl211622 


Gallus gallus 


alpha-3 collagen type VI 


308 


33 




.rt-TYi VJ.*t £t \JQ-7 


nomo sapiens 


Jri i ori- riuman polypeptide oxiv^ 
ID NO 7020. 


z/o 


5Z 


1225 


AAB66065 


Homo sapiens 


MILL- Human TANGO 294. 


2113 


99 


1225 


AAB66067 


Homo sapiens 


MILL- Human TANGO 294 
mature protein. 


2015 


99 


1225 


gi434306 


Homo sapiens 


H.sapiens mRNA for lysosomal 
acid lipase. 


1290 


60 


1226 


AAM06483 


Homo sapiens 


HYSE- Human foetal protein, SEQ 


282 


98 
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ID NO: 214. 






1226 


gi3694984 


Pimpinella 
brachycarpa 


metallothionein-l-like protein 


57 


30 


1226 


AAU22415 


Homo sapiens 


HUMA- Human cardiovascular 
system antigen polypeptide SEQ 
ID No 1189. 


56 


27 


1227 


gil5029634 


Homo sapiens 


Similar to tetranectin 
(plasminogen-binding protein), 
clone MGC: 13592 
IMAGE:4042921, mRNA, 
complete cds. 


725 


100 


1227 


gi37409 


Homo sapiens 


H. sapiens mRNA for tetranectin. 


725 


100 


1227 


gi825722 


Homo sapiens 


H.sapiens tetranectin gene, exon 1 . 


725 


100 


1228 


gi5790207 


Taenia saginata 


ATPase subunit 6 


—IC\ 

70 


32 


1228 


gi3927873 


Cyanidioschyzon 
merolae 


NADH-ubiquinone oxidoreductase 
chain 3 


44 


19 


1228 


gi4378776 


Pedinomonas 
minor 


*V T A TAT T lit "1 * i 

NADH dehydrogenase subunit 3 


47 


30 


1229 


AAE01790 


Homo sapiens 


HUMA- Human gene 21 encoded 
secreted protein HDPTW65, SEQ 
ID NO: 111. 


142 


59 


1229 


AAE01838 


Homo sapiens 


HUMA- Human gene 21 encoded 
secreted protein HDPTW65, SEQ 
ID NO: 159. 


140 


57 


1229 


ABB 11479 


Homo sapiens 


HYSE- Human reverse 
transcriptase homologue, SEQ ID 

XT/"*. 1 O A C\ 


92 


55 


1230 


AAE04775 


Homo sapiens 


INCY- Human vesicle trafficking 
protein- 1 8 ( VE I RP- 1 8) protein. 


327 


100 


1230 


gill 120502 


Homo sapiens 


ERGL mRNA, complete cds. 


327 


100 


1230 


AAW88699 


Homo sapiens 


HUMA- Secreted protein encoded 
by gene 166 clone HCEQA68. 


333 


100 


1231 


AAG00381 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4462. 


266 


91 


1231 


AAU19357 


Homo sapiens 


PHAA Human G protein-coupled 
receptor nGPCR-2290. 


125 


50 


1231 


AAO09238 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23130. 


109 


75 


1232 


A A "\ KC\/Z C C O 

AAM06558 


Homo sapiens 


t ta. rffn tt — „ j i i * or?A 

HYSE- Human foetal protein, SEQ 
ID NO: 289. 


301 


9c 


1232 


gi2648055 


Xenopus laevis 


corticotropin releasing factor 
receptor, type 2 


48 


29 


1232 


AAU21458 


Homo sapiens 


HUMA- Human novel foetal 
antigen, SEQ ID NO 1702. 


45 


36 


1233 


AAM06562 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 293. 


383 


100 


1233 


AAG72602 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, SEQ 
TH >JO* 99P.^ 

XXJ 1NVJ, ZZOJ, 


65 


42 


1233 


gi7547265 


Canis familiaris 


5-Hydroxytryptamine ID receptor 
subtype beta; 5-HT1D beta 


67 


39 


1234 


AAM92283 


Homo sapiens 


HUMA- Human digestive system 
antigen SEQ ID NO: 1632. 


76 


36 


1234 


AAO09955 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23847. 


70 


35 


1234 


gi8778849 


Arabidopsis 


T7N9.1 


69 


42 
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thaliana 








1235 


AAM63797 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35902. 


48 


34 


1235 


AAM76610 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 36916. 


48 


34 


1235 


ABB12222 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2592. 


52 


42 


1236 


gil60822 


Sarcocystis 
muris 


31-kDa major surface antigen 


55 


37 


1236 


gi5305067 


Mus musculus 


immunoglobulin light chain 
variable region 


61 


34 


1236 


AAM60441 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32546. 


61 


40 


1237 


gi4929790 


Homo sapiens 


angiopoietin-related protein 3 
(ANGPTL3) mRNA, complete 
cds. 


1489 


98 


1237 


AAY05395 


Homo sapiens 


GETH Human TIE ligand NL6 
protein sequence. 


1489 


98 


1237 


AAB12135 


Homo sapiens 


PROT- Hydrophobic domain 
protein from clone HP 10622 
isolated from Liver cells. 


1489 


98 


1238 


AAM06568 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 299. 


142 


57 


1238 


gil334599 


Magnaporthe 
grisea 


ubiquinol: cytochrome c 
oxireductase 


48 


42 


1238 


gil3487283 


Mycosphaerella 
fijiensis 


cytochrome b 


46 


42 


1239 


gi 15930235 


Homo sapiens 


clone IMAGE: 3 846772, mRNA, 
partial cds. 


46 


40 


1239 


gil334235 


Rattus rattus 


MIP protein (261 AA; AA 3 - 263) 


65 


45 


1239 


gill85419 


Mus musculus 


major intrinsic protein 


65 


45 


1240 


AAM60668 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32773. 


62 


31 


1240 


AAM73340 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 33646. 


62 


31 


1240 


gi975678 


Albinaria 
caerulea 


start codon=CAT, termination 
codon=TAA 


65 


27 


1241 


AAG03454 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7535. 


40 


26 


1241 


gil245208 


Danio rerio 


Zgl3 


57 


47 


1241 


AAY19486 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


33 


41 


1242 


gi3 157920 


Arabidopsis 
thaliana 


F12F1.6 


46 


37 


1242 


AAY48414 


Homo sapiens 


META- Human prostate cancer- 
associated protein 111. 


36 


46 


1242 


gi927722 


Saccharomyces 
cerevisiae 


Erdlp; YDR414C; CAI: 0.11 


61 


44 


1243 


gi9657469 


Vibrio cholerae 


soxR protein 


39 


46 


1243 


gi3493510 


Mus musculus 


Ymp 


47 


43 


1243 


gi2358254 [ 


Mus musculus 


HNMP-1 


47 


43 
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1244 


AA012129 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26021. 


64 


90 


1244 


gi3874749 


Caenorhabditis 
elegans 


C34E7.3 


50 


56 


1244 


AA012895 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26787. 


64 


44 


1245 


AAY99386 


Homo sapiens 


GETH Human PRO1305 
(UNQ671) amino acid sequence 
SEQ IDNO:153. 


71 


39 


1245 


AAO02040 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 15932. 


63 


33 


1245 


gi 15073483 


Sinorhizobium 
meliloti 


PUTATIVE SENSORY 
TRANSDUCTION HISTIDINE 
KINASE TRANSMEMBRANE 
PROTEIN 


77 


39 


1246 


AAG75420 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:6l84. 


49 


40 


1246 


gi4099021 


Helicobacter 
pylori 


amino acid permease 


47 


39 


1246 


gi23 14328 


Helicobacter 
pylori 26695 


glutamine ABC transporter, 
permease protein (glnP) 


47 


39 


1248 


gi2959352 


Brugia pahangi 


cuticle collagen 2(F) 


68 


37 


1248 


AAO03627 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 17519. 


59 


40 


1248 


gil3959819 


Myxococcus 
xanthus 


USC6-lp 


67 


43 


1249 


gi8249629 


Homo sapiens 


partial IGHV gene for 
immunoglobulin heavy chain 
variable region, clone B3 1 . 


62 


44 


1249 


gi6646882 


Paragonimus 
westermani 


NADH dehydrogenase subunit l 


63 


40 


1249 


AAR39641 


Homo sapiens 


CIB A Transforming Growth 
Factor-betal(44/45)beta2 hybrid. 


44 


37 


1250 


AAU04613 


Homo sapiens 


UNIW Gonadotropin analogue, 
beta subunit. 


40 


58 


1250 


gi3242155 


Drosophila 
melanogaster 


l53C9.b 


60 


43 


1250 


AAM63639 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35744. 


59 


42 


1251 


AA011677 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 25569. 


74 


54 ■ 


1251 


gi77 16782 


Drosophila 
simulans 


helicase pitchoune 


77 


44 


1251 


gi3342758 


Drosophila 
melanogaster 


helicase pitchoune 


77 


44 


1252 


gi482846 


Torgos 
tracheliotus 


cytochrome b 


51 


40 


1252 


'TIT) >~l 

gi22737 


Hordeum 
vulgare 


beta-hordothionin 


42 


38 


1252 


AAM79945 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 359 1. 


45 


40 


1253 


gi424891 


Human 

immunodeficienc 
y virus type 1 


envelope glycoprotein 


37 


33 


1253 


gi9654985 


Vibrio cholerae 


glutamate-cysteine ligase 


62 


28 
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Hit ID 
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Description 


S score 


% Identity 


1254 


gi3 805960 


Populus 
balsamifera 
subsp. 
trichocarpa 


laccase 


4/ 


79 
/Z 


1254 


gil5074016 


Sinorhizobium 
meliloti 


PUTATIVE TRANSCRIPTION 
REGULATOR PRO 1 blJN 


D / 




1254 


gi 12652993 


Homo sapiens 


clone IMAGE: 3357862, mRNA, 
partial cds. 


56 


47 


1255 


gil655739 


Peromyscus 
maniculatus 


NADH dehydrogenase subunit 4 


44 


24 


1255 


gil6551105 


Crotalus 
adamanteus 


NADH dehydrogenase subunit 5 


66 


28 


1255 


gil6551107 


Crotalus atrox 


NADH dehydrogenase subunit 5 


CO 


9S 
Zo 


1256 


AAO06799 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 20691. 


43 


17 


1256 


AAO00659 


Homo sapiens 


HYSE- Human polypeptide SEQ 

TT*V VTA 1 A C C 1 

ID NO 14551. 


A 1 

41 


^9 


1256 


AAB51937 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene y obv^ 
ID NO:69. 


37 


60 


1257 


gi6449037 


Mus musculus 


platelet glycoprotein V 


738 


38 


1257 


gi2104856 


Rattus 
norvegicus 


platelet glycoprotein V 




17 


1257 


gi2104845 


Mus musculus 


platelet glycoprotein V 


722 


37 


1258 


AAO 11326 


Homo sapiens 


HYSE- Human polypeptide bbQ 
ID NO 25218. 


01 




1258 


gi7576909 


Danio rerio 


her7-protein 


04 


17 


1258 


AAG81428 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 3 74. 


47 


38 


1259 


gil8349 


Daucus carota 


glycine rich protein (AA 1 - 96) 


65 


45 


1259 


gi336034 


Vesicular 
stomatitis virus 


M-protein 


/O 


zo 


1259 


gi335876 


Vesicular 
stomatitis virus 


matrix (M) protein 


70 


ZD 


1260 


AAO09307 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23199. 


A 1 

41 


17 


1260 


gil5042581 


Echinococcus 
granulosus 


NADH dehydrogenase subunit 2 


59 


A 1 
41 


1261 


gil0436454 


Homo sapiens 


cDNA FLJ14082 fis, clone 
HEMBB 1002300. 


983 


99 


1261 


AAB95686 


Homo sapiens 


HELL- Human protein sequence 
SEQ ID NO: 18490. 


yoj 


QQ 

yy 


1261 


AAY20668 


Homo sapiens 


UYRO- Human neurofilament-M 
wild type protein fragment 10. 


44 


50 


1263 


gi965014 


Mus musculus 


ADAM 4 protein precursor 


1303 


51 j 


1263 


gil061159 


Macaca 
fascicularis 


testicular Metalloprotease-like, 
Disintegrin-like, Cysteine-rich 
protein IVa 


1277 


z>y 


1263 


gilOollol 


Macaca 
fascicularis 


Disintegrin-like, Cysteine-rich 
protein IVb 


1249 


38 


1264 


AAM79049 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1711. 


1895 


98 


1264 


AAM80033 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3679. 


1895 


98 


1264 


AAM53458 


Homo sapiens 


MOLE- Human brain expressed 


1074 


100 
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SEQ ID 


Hit ID 


Species 


Description 


S score 


/o i a entity 








single exon probe encoded protein 
SEO ID NO: 25563. 






1265 


AAB44605 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 10 SEQ 
IDNO:70. 


yo 


/u 


1265 


AAG71789 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1470. 


42 


63 


1265 


AAG72517 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, SEQ 
ID NO: 2198. 


42 


63 


1266 


gil4714741 


Homo sapiens 


clusterin (complement lysis 
inhibitor, SP-40,40, sulfated 
glycoprotein 2, testosterone- 
repressed prostate message 2, 
apolipoprotein J), clone 
MGC: 18080 IMAGE :4 15 0452, 
mRNA, complete cds. 


1629 


99 


1266 


gi292843 


Homo sapiens 


Human TRPM-2 protein gene, 
exons 7,8,9 and complete cds. 


lozy 


OQ 

yy 


1266 


gi30251 


Homo sapiens 


Human SP-40,40 mRNA for 
complement- associated protein SP- 
40,40 alpha-1 and beta-1 chain. 


1 /Ton 

1629 


CiQ 

yy 


1267 


gi 11493504 


Homo sapiens 


PRO0309 


1 1 yz 


yo 


1267 


gi412723 


synthetic 
construct 


synthetic antithrombin III 


1192 


98 


1267 


gi583741 


synthetic 
construct 


Antithrombin III 


1192 


98 


1268 


gil 1493504 


Homo sapiens 


PRO0309 


1439 


98 


1268 


gi412723 


synthetic 
construct 


,i • • i * j t "f * TTT 

synthetic antithrombin III 


1439 


QQ 

yo 


1268 


gi583741 


synthetic 
construct 


Antithrombin III 


1439 


98 


1269 


gi203710 


Rattus 
norvegicus 


cytochrome c oxidase subunit VIc 


250 




1269 


gil200057 


Homo sapiens 


Human mRNA for cytochrome c 
oxidase subunit VIc. 


229 


61 


1269 


gil2652867 


Homo sapiens 


cytochrome c oxidase subunit VIc, 
clone MGC:1520 
IMAGE:3350637, mRNA, 
complete cds. 


229 


61 


1270 


AAM96033 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 4691. 


465 


98 


1270 


AAU18881 


Homo sapiens 


HUMA- Novel prostate gland 
antigen, Seq ID No 180. 




no 

yo 


1270 


gi9622236 


Homo sapiens 


cadherin-like protein VR20 
mRNA, partial cds. 


272 


1UU 


1271 


gi552137 


Drosophila 
melanogaster 


tropomyosin isoform 9E 


118 


27 


1271 


gii jooyj 


Drosophila 
melanogaster 


tropomyosin isoform 9A 


118 


27 


1271 


gil58696 


Drosophila 
melanogaster 


tropomyosin isoform 9D 


118 


27 


1272 


gi6689249 


Streptococcus 

dysgalactiae 

subsp. 

dysgalactiae 


M-like protein 


122 


24 
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SEQ ID 


Hit ID 


Species 


Description" 


S score 


% Identity 


1272 


gi6692690 


Arabidopsis 
thaliana 


F12K11.14 


114 


28 


1272 


gi4778 


Saccharomyces 
cerevisiae 


Usol protein 


122 


29 


1273 


gil3097573 


Homo sapiens 


Similar to thiosulfate 
sulfurtransferase (rhodanese), 
clone MGC: 10492 
IMAGE:361 1253, mRNA, 
complete cds. 


1469 


94 


1273 


gil6876913 


Homo sapiens 


mercaptopyruvate 
sulfurtransferase, clone 
MGC:24539 IMAGE :4 105 5 09, 
mRNA, complete cds. 


1469 


94 


1273 


gil75 11726 


Homo sapiens 


mercaptopyruvate 
sulfurtransferase, clone 
MGC:31798 IMAGE:4131927, 
mRNA, complete cds. 


1469 


94 


1274 


AAB85039 


Homo sapiens 


CURA- Human SER5 protein 
sequence. 


767 


48 


1274 


gi6137097 


Homo sapiens, 


serine protease DESC1 (DESC1) 
mRNA, complete cds. 


749 


48 


1274 


AAY99414 


Homo sapiens 


GETH Human PR01461 
(UNQ742) amino acid sequence 
SEQ ID NO:269. 


749 


48 


1275 


gil2584839 


Homo sapiens 


HT036-ISO (HT036-ISO) mRNA, 
complete cds. 


997 


94 


1275 


gil2584841 


Homo sapiens 


HT036 (HT036) mRNA, complete 
cds. 


820 


93 


1275 


gi 17427028 


Ralstonia 
solanacearum 


CONSERVED HYPOTHETICAL 
PROTEIN 


502 


42 


1276 


gi3 10691 


Simian virus 40 


small T antigen 


48 


47 


1276 


gi8886685 


Centris inermis 


cytochrome b 


53 


40 


1276 


gi625084 


Oncorhynchus 
tshawytscha 


heat-shock protein 30 


37 


44 


1277 


gi7106820 


Homo sapiens 


HSPC215 


261 


100 


1277 


AAU16225 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1178. 


261 


100 


1277 


AAG81441 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:400. 


261 


100 


1278 


AAM25840 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1355. 


208 


88 


1278 


AAM74914 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 35220. 


63 


68 


1278 


AAM06639 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 370. 


36 


70 


1279 


gi7023943 


Homo sapiens 


GC36 mRNA, complete cds. 


360 


35 


1279 


gi2502077 


Homo sapiens 


digestive tract-specific calpain 
(nCL-4) mRNA, complete cds. 


360 


35 


1279 


gi2358262 


Rattus 
norvegicus 


calpain large subunit 


351 


35 


1280 


gi4153951 


Homo sapiens 


H.sapiens gene from PACs 295C6 
and313L4. 


259 


37 


1280 


AAY32437 


Homo sapiens 


TEXA Absorptive hypercalciuria 
associated gene protein prodict. 


259 


37 


1280 


gil5383934 


Homo sapiens 


testicular soluble adenylyl cyclase 
mRNA, complete cds. 


259 


37 
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1281 


AAM89651 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 17244. 


66 


31 


1281 


gi408591 


Influenza A virus 


nonstructural protein 


62 


28 


1281 


AAM82524 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 101 17. 


62 


55 


1282 


gi4079820 


Mus musculus 


HERC2 


67 


40 


1282 


gi459017 


Allomyces 
macrogynus 


subunit 6 of the ATPase complex 


71 


44 


1282 


gil236414 


Allomyces 
macrogynus 


H(+)-transporting ATPase, F0 
subunit 6 


71 


44 


1283 


AAM63001 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35106. 


153 


67 


1283 


AAM75812 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 36118. 


153 


67 


1283 


AAE10197 


Homo sapiens 


HYSE- Human bone marrow 
derived peptide, SEQ ID NO: 41. 


60 


36 


1284 


AAG81367 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:252. 


816 


98 


1284 


gi7582286 


Homo sapiens 


BM-007 


530 


98 


1284 


AAG02907 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6988. 


408 


95 


1285 


AAG81367 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO:252. 


906 


98 


1285 


gi7582286 


Homo sapiens 


BM-007 


538 


99 


1285 


AAG02907 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6988. 


.416 


96 


1286 


AAW49716 


Homo sapiens 


PROT- Protein polymer adhesive 
substrate PPAS1-C monomer. 


54 


31 


1286 


AAW49721 


Homo sapiens 


PROT- Protein polymer adhesive 
substrate PPAS1-D monomer. 


54 


31 


1286 


gi683735 


Macaca 
fascicularis 


endothelin 3 


50 


62 


1287 


gi5689766 


Homo sapiens 


mRNA for zinc finger 2 (ZNF2 
gene). 


2092 


99 


1287 


gil4602980 


Homo sapiens 


clone MGC: 16594 
IMAGE:41 10322, mRNA, 
complete cds. 


1609 


100 


1287 


gi 13477207 


Homo sapiens 


clone MGC: 12980 
IMAGE:3350363, mRNA, 
complete cds. 


1604 


99 


1288 


AAB44228 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1673. 


284 


86 


1288 


AAM90208 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 17801. 


43 


50 


1288 


gi733438 


Cepaea 
nemoralis 


NADH dehydrogenase subunit 4L 


62 


31 


1289 


gi 10764264 


synthetic 
construct 


mutated NS5A 


67 


30 


1289 


AAO02625 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16517. 


58 


26 
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1289 


gil 0644 188 


Hepatitis C virus 
type la 


polyprotein 


67 


o a. 


1290 


AAG03150 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7231. 


30 / 


QQ 

yo 


1290 


AAW48931 


Homo sapiens 


CEDA- Schwannomin-binding 
protein C-terminal fragment. 


ZOO 


i f\r\ 
1UU 


1290 


AAO04324 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 18216. 


63 


33 


1291 


AAB60098 


Homo sapiens 


INCY- Human transport protein 
TPPT-18. 


1822 


92 


1291 


gil 537070 


Rattus 
norvegicus 


nucleoporin p54 


1767 


92 


1291 


gil5214835 


Homo sapiens 


clone MGC: 13407 
IMAGE:3931652, mRNA, 
complete cds. 


1822 


92 


1292 


AAY94621 


Homo sapiens 


MILL- Epidermal growth factor- 
like variant in skin-2 amino acid 
sequence. 


385 


100 


1292 


AAE06697 


Homo sapiens 


HYSE- Human TGF alpha-like 
protein. 


385 


100 


1292 


AAE06698 


Homo sapiens 


HYSE- Human TGF alpha-like 
splice variant protein. 


385 


100 


1293 


AAW78245 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by gene 
19. 


1018 


98 


1293 


ABB 11835 


Homo sapiens 


HYSE- Human secreted protein 
homologue, SEQ ID NO:2205. 


1018 


98 


1293 


AAM79352 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 2998. 


1018 


98 


1294 


AAM99920 


Homo sapiens 


HUMA- Human polypeptide SEQ 
ID NO 36. 


667 


97 


1294 


gil6552010 


Homo sapiens 


cDNA FLJ32009 fis, clone 
NT2RP7009498, weakly similar to 
FIBULIN-1, ISOFORM A 
PRECURSOR. 


667 


97 


1294 


AAM99933 


Homo sapiens 


HUMA- Human polypeptide SEQ 
ID NO 49. 


627 




1295 


gi2598167 


Homo sapiens 


zinc finger protein (HZF6) mRNA, 
5' UTR and partial cds. 


2772 


99 


1295 


gi5640019 


Mus musculus 


zinc linger protein ZFP235 


1565 


68 


1295 


gil 184371 


Mus musculus 


zinc finger protein; Method: 
conceptual translation supplied by 
author 


1278 


55 


1296 


gil5679947 


Homo sapiens 


endothelial zinc finger protein 
induced by tumor necrosis factor 
alpha, clone MGC: 11153 
IMAGE:3840512, mRNA, 
complete cds. 


2734 


100 


1296 


gi9502202 


Homo sapiens 


endothelial zinc finger protein 
induced by tumor necrosis factor 
alpha (EZFIT) mRNA, complete 
cds. 


2734 


100 


1296 


gil0437767 


Homo sapiens 


cDNA: FLJ21628 fis, clone 
COL08076. 


1713 


77 


1297 


AAM56742 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 28847. 


99 


55 
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Description 
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1297 


AAO09197 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23089. 


66 


45 


1297 


gil2543402 


Corynebacterium 
glutamicum 


FRXA00675 


79 


26 


1298 


AAM79176 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1838. 


601 


100 


1298 


ABB 11626 


Homo sapiens 


HYSE- Human Fas-associated 
phosphatase homologue, SEQ ID 
NO: 1996. 


559 


94 


1298 


AAM80160 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3806. 


559 


94 


1299 


gil2698338 


Homo sapiens 


matrix metalloproteinase-28 
precursor, mRNA, complete cds. 


2424 


96 


1299 


gil2698852 


Homo sapiens 


matrix metalloprotease MMP25 
mRNA, complete cds. 


2424 


96 


1299 


AAU12243 


Homo sapiens 


GETH Human PR04339 
polypeptide sequence. 


2424 


96 


1300 


gil4210477 


Homo sapiens 


interleukin 18 precursor, mRNA, 
complete cds. 


138 


92 


1300 


AAW31757 


Homo sapiens 


INCY- Interferon gamma inducing 
factor-2 (IGIF-2) R140I variant. 


138 


92 


1300 


gil0799833 


Ovis aries 


interleukin- 18 (IGIF) 


122 


78 


1301 


AAE05302 


Homo sapiens 


MILL- Human TANGO 457 
protein. 


623 


97 


1301 


AAE05303 


Homo sapiens 


MILL- Human mature TANGO 
457 protein. 


611 


100 


1301 


AAE05305 


Homo sapiens 


MILL- Human TANGO 457 
protein cytoplasmic domain. 


605 


100 


1302 


AAM55396 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 27501. 


64 


38 


1302 


AAM57742 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 29847. 


64 


38 


1302 


AAM67792 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 28098. 


64 


38 


1303 


AAM88370 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15963. 


38 

* 


35 


1303 


gi7330034 


Macaca mulatta 
rhadinovirus 26- 
95 


helicase-primase 


56 


30 


1303 


gi4494949 


Macaca mulatta 

rhadinovirus 

17577 


helicase/primase 


56 


30 


1304 


gi 190870 


Homo sapiens 


Human retinoic acid receptor 
gamma 2 mRNA, 5' end. 


274 


100 


1304 


gi297146 


Homo sapiens 


H.sapiens gene for retinoic acid 
receptor gamma-2. 


274 


100 


1304 


gi200660 


Mus musculus 


retinoic acid receptor gamma 2 


252 


92 


1305 


AAM39737 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2882. 


992 


99 


1305 


AAM39736 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2881. 


875 


100 


1305 


AAM41522 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6453. 


875 


100 
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1306 


ABB17891 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
6548. 


54 


38 


1306 


AAM88996 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 16589. 


57 


AC\ 

4U 


1306 


AAM65093 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 37198. 


55 


50 


1307 


A AY 19551 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


133 


AO 
4Z 


1307 


AAY75972 


Homo sapiens 


GENE- Human skin cell protein, 
SEQ ID 150. 


1 

133 


4Z 


1307 


AAB55911 


Homo sapiens 


GENE- Skin cell protein, SEQ ID 
NO: 150. 


133 


AO 
4Z 


1308 


AAU27671 


Homo sapiens 


ZYMO Human protein 
AFP355471. 


486 


100 


1308 


AA012566 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 26458. 


61 


28 


1308 


gi 167933 


Dictyostelium 
discoideum 


thioredoxin 


67 


34 


1309 


AAR15222 


Homo sapiens 


TEXA Chronic myelogenous 
leukaemia-derived myeloid-related 
protein. 


620 


100 


1309 


gil81527 


Homo sapiens 


Human neutrophil peptide 
(defensin) 1 mRNA, complete cds. 


493 


100 


1309 


gil81529 


Homo sapiens 


Human defensin 1 protein mRNA, 
complete cds. 


493 


1 A A 
1UU 


1310 


gi29 11559 


Human 

papillomavirus 
type 77 


E6 protein 


66 


27 


1310 


gi9800324 


rat 

cytomegalovirus 
Maastricht 


prl09 


62 


25 


1310 


gi397007 


Human 

papillomavirus 
type 3 


envelope protein 


60 


"5 1 

31 


1311 


AAU19632 


Homo sapiens 


TTT TH r A T T 1 

HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 282. 


one 

205 


AC\ 

4U 


1311 


gi3 127926 


Homo sapiens 


H.sapiens KIN A tor type VI 
collagen alpha3 chain. 


loo 


AO 


1311 


gi57960 


Mus musculus 


collagen alpha 3 chain type VI 


I/O 


AO 


1312 


gi 16508 176 


Homo sapiens 


small Cj l r-Dinamg tumor 
suppressor 1 mRNA, complete cds. 


1U 1Z 


i no. 

L\J\J 


1312 


gil 6555334 


Homo sapiens 


Rig protein mRNA, complete cds. 


1 C\ 1 o 
1U 1/ 




1312 


gil6508174 


Mus musculus 


small GTP-binding tumor 
suppressor 1 


963 


93 


1313 


AAG73984 


Homo sapiens 


T TT T"» r A T T 1 

HUMA- Human colon cancer 


55 


A A 

40 


1313 


gi3041771 


Homo sapiens 


mRNA for perilipin, complete cds. 


83 


32 


1313 


AAY22157 


Homo sapiens 


ABBO Human BS135 protein 
sequence. 


83 


32 


1314 


AAM71801 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32107. 


1872 


100 


1314 


gi!6549907 


Homo sapiens 


cDNA FLJ30663 fis, clone 


1203 


58 
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FCBBF1 000598, moderately 
similar to ZINC FINGER 
PROTEIN 84. 






1314 


AAM78565 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1227. 


1151 


55 


1315 


gi32472 


Homo sapiens 


H.sapiens mRNA for high-sulphur 
keratin. 


785 


76 


1315 


gi3228239 


Homo sapiens 


UHS KerA gene. 


774 


76 


1315 


gi34079 


Homo sapiens 


Human gene for ultra high-sulphur 
keratin protein. 


774 


76 


1316 


gil2655446 


Homo sapiens 


mRNA for keratin associated 
protein 4.4 (KRTAP4.4 gene). 


755 


80 


1316 


gil2655460 


Homo sapiens 


mRNA for keratin associated 
protein 4.12 (KRTAP4.12 gene). 


726 


75 


1316 


gil3278825 


Homo sapiens 


Similar to RIKEN cDNA 
1110054P19gene, clone 
MGC:2782 IMAGE:2959821, 
mRNA, complete cds. 


726 


75 


1317 


gil2655462 


Homo sapiens 


mRNA for keratin associated 
protein 4.14 (KRTAP4.14 gene). 


1102 


88 


1317 


gil2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


1081 


84 


1317 


gil2655460 


Homo sapiens 


mRNA for keratin associated 
protein 4.12 (KRTAP4.12 gene). 


997 


79 


1318 


AAM79404 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3050. 


850 


74 


1318 


AAM39466 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2611. 


844 


77 


1318 


AAM41252 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6183. 


836 


77 


1319 


gil6552172 


Homo sapiens 


cDNA FLJ32133 fis, clone 
PEBLM2000308, moderately 
similar to ZINC FINGER 
PROTEIN 135. 


800 


65 


1319 


gi6467200 


Homo sapiens 


GIOT-1 mRNA for gonadotropin 
inducible transcription repressor- 1, 
partial cds. 


775 


60 


1319 


gi498721 


Homo sapiens 


H.sapiens HZF10 mRNA for zinc 
finger protein. 


770 


63 


1320 


gi3036963 


Ciona savignyi 


CsCDC42 


163 


60 


1320 


gil 5072535 


Schizophyllum 
commune 


small GTPase CDC42 


162 


60 


1320 


gi520533 


Drosophila 
melanogaster 


Dcdc42 


161 


60 


1321 


AAE02058 


Homo sapiens 


HUMA- Human four disulfide core 
domain (FDCD)-containing 
protein. 


517 


43 


1321 


gil2655452 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4.7 gene). 


509 


44 


1321 


gi200964 


Mus museums 


serine 2 ultra high sulfur protein 


494 


42 


1322 


ABB 12490 


Homo sapiens 


HYSE- Human bone marrow 
expressed protein SEQ ID NO: 
329. 


169 


72 


1322 


gil4647047 


Puntius titteya 


ATP synthase 8 


56 


37 


1322 


gil4646929 


Barbus 

cyclolepis 

cyclolepis 


ATP synthase 8 


54 


44 
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1323 


gi5921470 


Homo sapiens 


mRNA for G8 protein (G8 gene, 
located in the class III region of the 
major histocompatibility complex). 


405 


89 


1323 


gi5921473 


Homo sapiens 


mRNA for G8 protein (G8 gene, 
located in the class III region of the 
major histocompatibility complex), 
alternative splice variant lacking 
exon 2. 


381 


92 


1323 


AAM39144 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2289. 


381 


92 


1324 


AAM87150 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 14743. 


74 


38 


1324 


AAB59115 


Homo sapiens 


HUMA- Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 823. 


62 


36 


1324 


gil5158712 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_L_725p 


46 


52 


1325 


AAY48404 


Homo sapiens 


META- Human prostate cancer- 
associated protein 101. 


55 


50 


1325 


AAM59935 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 32040. 


55 


3o 


1325 


AAM72530 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32836. 


55 


38 


1326 


gi466912 


Mycobacterium 
leprae 


tp2; B 1 549_C2_206 


oU 


4D 


1326 


gil220377 


Avian infectious 
bronchitis virus 


nucleocapsid protein 


fZA 

04 


DO 


1326 


gil3 177409 


Ectocarpus 
siliculosus virus 


EsV-1-135 


03 


34 


1327 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


zU / 




1327 


gi200962 


Mus musculus 


serine 1 ultra high sulfur protein 


202 


32 


1327 


gi32472 


Homo sapiens 


H.sapiens mRNA for high-sulphur 
keratin. 


196 


32 


1328 


AAR23732 


Homo sapiens 


MINU Gene 519 cDNA derived 
peptide. 


316 


oo 


1328 


gi35065 


Homo sapiens 


Human NKG5 mRNA, expressed 
in natural killer cells and T-cells. 


O 1 A 

314 


oo 


1328 


AAW59874 


Homo sapiens 


HUMA- Ammo acid sequence of 
the cDNA clone CAT-1 
(HTXET53). 


1 1 A 

314 


oo 


1329 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


359 


49 


1329 


AAM39466 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2611. 


341 


51 


1329 


AAM78420 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1082. 


337 


53 


1330 


gi 13937769 


Homo sapiens 


oimnar to j\j.jsjhn cl/in/v 
1200013F24 gene, clone 
MGC:12197 IMAGE: 3 997 840, 
mRNA, complete cds. 






1330 


gi7582294 


Homo sapiens 


BM-011 


781 


98 


1330 


AAM79664 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3310. 


255 


31 


1331 


1 gil4718451 


Homo sapiens 


sialic acid-binding lectin 1 1 


796 


71 



WO 03/054152 PCT/US02/39555 



Table 2A 
173 



SEQ ID 


Hit ID 


Species 


Description 
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(SIGLEC11) mRNA, complete 
cds. 






1331 


AAY41724 


Homo sapiens 


GETH Human PRO940 protein 
sequence. 


535 


50 


1331 


AAB44280 


Homo sapiens 


GETH Human PRO940 (UNQ477) 
protein sequence SEQ ID NO:259. 


535 


50 


1332 


AAT90731_ 
aal 


Homo sapiens 


FARB Human placental bikunin 
cDNA. 


849 


98 


1332 


gil2804515 


Homo sapiens 


serine protease inhibitor, Kunitz 
type, 2, clone MGC:2021 
IMAGE:2959462, mRNA, 
complete cds. 


848 


98 


1332 


gi2065529 


Homo sapiens 


placental bikunin mRNA, complete 
cds. 


848 


98 


1333 


gil4042550 


Homo sapiens 


cDNA FLJ14779 fis, clone 
NT2RP4000398, moderately 
similar to ZINC FINGER 
PROTEIN 140. 


2165 


98 


1333 


AAB93164 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 12091. 


2165 


98 


1333 


AAM93693 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3604. 


2159 


100 


1334 


gil2804907 


Homo sapiens 


Similar to metaxin 1, clone 
MGC:2518 IMAGE:3546178, 
mRNA, complete cds. 


1512 


100 


1334 


gil326108 


Homo sapiens 


Human metaxin (MTX) gene, 
complete cds. 


1098 


100 


1334 


gi2564913 


Homo sapiens 


clk2 kinase (CLK2), propinl, 
cotel, glucocerebrosidase (GBA), 
and metaxin genes, complete cds; 
metaxin pseudogene and 
glucocerebrosidase pseudogene; 
and thrombospondin3 (THBS3) 
gene, partial cds. 


1098 


100 


1335 


AAW85614 


Homo sapiens 


GEMY Secreted protein clone 
fr473 2. 


381 


83 


1335 


AAY94865 


Homo sapiens 


PROT- Human protein clone 
HP 10540. 


381 


83 


1335 


AAY36022 j 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
407. 


365 


80 


1336 


AAB 18447 


Homo sapiens 


MILL- Amino acid sequence of 
human TANGO 216 polypeptide. 


2257 


99 


1336 


AAB 18455 


Homo sapiens 


MILL- A human TANGO 216 
polypeptide clone. 


2257 


99 


1336 


AAU 19662 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 312. 


1876 


96 


1337 


AAB 18447 


Homo sapiens 


MILL- Amino acid sequence of 
human TANGO 216 polypeptide. 


2257 


99 


1337 


AAB 18455 


Homo sapiens 


MILL- A human TANGO 216 
polypeptide clone. 


2257 


99 


1337 


AAU19662 


Homo sapiens 


HUMA- Human novel 
extracellular matrix protein, Seq 
ID No 312. 


1876 


96 


1338 


AAY86303 


Homo sapiens 


HUMA- Human secreted protein 
HOGCK20, SEQ IDNO:218. 


2133 


94 
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1338 


gil4456615 


Homo sapiens 


PIG-T mRNA for phosphatidyl 
inositol glycan class T, complete 
cds. 


2120 


96 


1338 


gil5929132 


Homo sapiens 


clone MGC:8909 
IMAGE: 392 1680, mRNA, 
complete cds. 


2120 


96 


1339 


gil2836893 


Gallus gallus 


IPR328-like protein 


160 


29 


1339 


gi3093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


153 


29 


1339 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma-3 subunit mRNA, 
complete cds. 


153 


29 


1340 


gil2836893 


Gallus gallus 


IPR328-like protein 


158 


29 


1340 


gi3 093433 


Homo sapiens 


Chromosome 16 BAC clone 
CIT987SK-625P11, complete 
sequence. 


151 


29 


1340 


gi4558766 


Homo sapiens 


neuronal voltage gated calcium 
channel gamma-3 subunit mRNA, 
complete cds. 


151 


29 


1341 


AAW85737 


Homo sapiens 


SAGA Polypeptide with 
transmembrane domain. 


692 


100 


1341 


ABB 11882 


Homo sapiens 


HYSE- Human transmembrane 
protein homologue, SEQ ID 
NO:2252. 


692 


100 


1341 


AAG89353 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 473. 


692 


100 


1342 


ABB 12032 


Homo sapiens 


HYSE- Human SIGP 2328134 
homologue, SEQ ID NO:2402. 


866 


97 


1342 


AAY21851 


Homo sapiens 


INCY- Human signal peptide- 
contianing protein (SIGP) (clone 
ID 2328134). 


866 


97 


1342 


gi4101574 


Homo sapiens 


54TMp (54tm) mRNA, complete 
cds. 


860 


96 


1343 


gi3002925 


Homo sapiens 


T cell receptor beta chain 
(TCRB V 1 3 S 1 -TCRB J2 SI) 
mRNA, complete cds. 


1658 


100 


1343 


gi2982508 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


1527 


94 


1343 


gi3002933 


Homo sapiens 


T cell receptor beta chain 
(TCRBV3S 1-TCRBJ2S3) mRNA, 
complete cds. 


1251 


76 


1344 


gil4973269 


Streptococcus 

pneumoniae 

TIGR4 


cell wall surface anchor family 
protein 


481 


19 


1344 


gil5991793 


Streptococcus 
gordonii 


platelet binding protein GspB 


303 


17 


1344 


gi8885520 


Streptococcus 
gordonii 


streptococcal hemagglutinin 


293 


16 


1.343 


AAY0775 1 


Homo sapiens 


T TT TK X A T T j "I i 

HUMA- Human secreted protein 
fragment encoded from gene 8. 


293 


100 


1345 


gil 142588 


Trypanosoma 
brucei 


CR3 


85 


42 


1345 


gi3037018 


Bodo saltans 


NADH dehydrogenase subunit 5 


80 


35 


1346 


AAG78000 


Homo sapiens 


BIOW- Human actin 14. 


663 


100 


1346 


ABB17913 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 


644 


98 
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6570. 






1346 


AAY94954 


Homo sapiens 


GEMY Human secreted protein 
clone iw66_l protein sequence 
SEQ ID NO: 114. 


596 


68 


1347 


gi9837433 


Homo sapiens 


sialic acid binding 
immunoglobulin-hke lectin 8 long 
splice variant (Siglec8) gene, 
complete cds. 


2206 


88 


1347 


gi6746556 


Homo sapiens 


sialic acid-binding 
immunoglobulin-like lectin-8 
(SIGLEC8) mRNA, complete cds. 


2031 


93 


1347 


gi6980022 


Homo sapiens 


si glee SAF2 (SAF2) mRNA, 
complete cds. 


2031 


93 


1348 


gil5451469 


Homo sapiens 


siglec-like protein (SLG2) gene 
and alternatively spliced variants, 
complete cds. 


2689 


99 


1348 


gil5217166 


Homo sapiens 


sialic acid-binding Ig-like lectin 10 
(SIGLEC10) mRNA, complete 
cds. 


2682 


99 


1348 


gil4164613 


Homo sapiens 


sialic acid binding 
immunoglobulin-like lectin 10 
(SIGLEC10) mRNA, complete 
cds. 


2356 


98 


1349 


AAB60112 


Homo sapiens 


INCY- Human transport protein 
TPPT-32. 


775 


100 


1349 


gi9663117 


Homo sapiens 


mRNA for organic cation 
transporter. 


TOO 




1349 


AAB47000 


Homo sapiens 


BOEN/ Human BOCT protein. 


382 


A O 

4o 


1350 


AAY76219 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 96. 


336 


94 


1350 


gi2906006 


Homo sapiens 


WASP interacting protein (WIP) 
mRNA, partial cds. 


134 


30 


1350 


gi22269 j 


Zea mays ! 


cell wall protein (108 AA) 


105 


34 


1351 


AAB08767 


Homo sapiens 


INCY- A human leukocyte and 

11 1 -1.1 j • /■•¥■ -r-» A T»\ 

blood related protein (LBAP). 


92 


37 


1351 


gi5 76631 


Torpedo 
marmorata 


14 kDa transmembrane protein 


87 


32 


1351 


AAM78542 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1204. 


72 


31 


1352 


gi5817194 


Homo sapiens 


mRNA; cDNA DKFZp434F01 1 
(from clone DKFZp434F011); 
partial cds. 


221 


95 


1352 


gi7576452 


Homo sapiens 


hBOIT mRNA for potent brain 
type organic ion transporter, 
complete cds. 


160 


36 


1352 


AAY06116 


Homo sapiens 


MILL- Human organic cation 
transporter OCT-3. 


154 


35 


1353 


gil6552104 


Homo sapiens 


cDNA FLJ32082 fis, clone 
OCBBF2000231, weakly similar 
to rHUbrrlL>JLlrAc>±i AZ 
INHIBITOR SUBUNIT B 
PRECURSOR. 


2566 


99 


1353 


AAY66713 


Homo sapiens 


GETH Membrane-bound protein 
PRO1309. 


2566 


99 


1353 


AAU12396 


Homo sapiens 


GETH Human PRO1309 
polypeptide sequence. 


2566 


99 


1354 


gi 15559274 


Homo sapiens 


clone MGC:20205 


502 


40 
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IMAGE:3627858, mRNA, 
complete cds. 






1354 


AAB 15549 


Homo sapiens 


INCY- Human immune system 
molecule from Incyte clone 
2774913. 


502 


40 


1354 


AAB19729 


Homo sapiens 


CURA- Human SECX Clone 
4339264-2 encoded protein. 


502 


40 


1355 


AAY99399 


Homo sapiens 


GETH Human PR01268 
(UNQ638) amino acid sequence 
SEQ IDNO:214. 


603 


100 


1355 


AAY78808 


Homo sapiens 


PROT- Hydrophobic domain 
containing protein clone HP 10537 
protein sequence. 


603 


100 


1355 


AAB87570 


Homo sapiens 


GETH Human PR01268. 


603 


100 


1356 


AAM78418 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1080. 


1902 


97 


1356 


ABB 11897 


Homo sapiens 


HYSE- Human F22162_l 
homologue, SEQ ID NO:2267. 


1827 


93 


1356 


AAM79402 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3048. 


1820 


93 


1357 


gi397607 


Homo sapiens 


H.sapiens encoding CLA-1 
mRNA. 


2331 


99 


1357 


AAY49573 


Homo sapiens 


WHED Human CLA-1 protein 
sequence. 


2331 


99 


1357 


AAW97900 


Homo sapiens 


MILL- Human SR-BI class B 
scavenger. 


2318 


98 


1358 


gi854065 


Human 
herpesvirus 6 


U88 


348 


31 


1358 


gil0434098 


Homo sapiens 


cDNA FLJ 12547 fis, clone 
NT2RM4000634. 


273 


32 


1358 


AAB95124 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:17122. 


273 


32 


1359 


AAE05302 


Homo sapiens 


MILL- Human TANGO 457 
protein. 


1521 


96 


1359 


AAE05303 


Homo sapiens 


MILL- Human mature TANGO 
457 protein. 


1397 


96 


1359 


AAE05305 


Homo sapiens 


MILL- Human TANGO 457 
protein cytoplasmic domain. 


1260 


100 


1360 


gil 0129690 


Homo sapiens 


mRNA for mucolipidin (ML4 
gene). 


804 


53 


1360 


gil0438844 


Homo sapiens 


cDNA: FLJ22449 fis, clone 
HRC09609. 


804 


53 


1360 


gil 3477347 


Homo sapiens 


mucolipin 1, clone MGC:3287 
IMAGE:3507836, mRNA, 
complete cds. 


804 


53 


1361 


AAE04122 


Homo sapiens 


HUMA- Human gene 23 encoded 
secreted protein HESOK73, SEQ 
ID NO: 108. 


214 


61 


1361 


AAE04169 


Homo sapiens 


HUMA- Human gene 23 encoded 
secreted protein HE80K73, SEQ 
ID NO:158. 


207 


60 


1361 


AAG00392 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4473. 


117 


43 


1362 


AAY27853 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 101. 


274 


94 


1362 


gi904289 


Phaseolus 
vulgaris 


fungine endopolygalacturonase 
inhibitor 


66 


41 
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1362 


AAB27594 


Homo sapiens 


HUMA- Human secreted protein 
SEQ ID NO: 95. 


60 


58 


1363 


gil7016967 


Homo sapiens 


NUANCE (NUA) mRNA, 
complete cds; alternatively spliced. 


3404 


71 


1363 


gil7861384 


Homo sapiens 


nesprin-2 gamma mRNA, 
complete cds. 


3404 


71 


1363 


gi5262574 


Homo sapiens 


mRNA; cDNA DKFZp434G173 
(from clone DKFZp434G173); 
complete cds. 


3404 


71 


1364 


AAB95854 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18912. 


72 


37 


1364 


gi9621943 


Pelargonium 
senecioides 


NADH dehydrogenase 


58 


42 


1364 


gi9621945 


Pelargonium 
trifidum 


NADH dehydrogenase 


58 


A O 

42 


1365 


AAW29654 


Homo sapiens 


GEMY Human secreted protein 
DM406 1. 


140 


A O 


1365 


gil0187870 


Rhodococcus sp. 


ohpA transport 


75 


ZD 


1365 


gil5559671 


Homo sapiens 


clone MGC:20633 
IMAGE:4761663, mRNA, 
complete cds. 


72 


33 


1366 


gil0566471 


Mus musculus 


Gliacolin 


OCA 

850 


68 


1366 


gil4278927 


Mus musculus 


gliacolin 


850 


68 


1366 


gi3747099 


Mus musculus 


Clq-related factor 


724 


66 


1367 


gi2745756 


Aotus trivirgatus 


ribonuclease k6 precursor 


431 


91 


1367 


gi5730384 


Eulemur fulvus 
collaris 


ribonuclease k6 precursor 


305 


86 


1367 


gi5730382 


Nycticebus 
coucang 


ribonuclease k6 precursor 


279 


80 


1368 


AAE09651 


Homo sapiens 


HUMA- Human gene 13 encoded 
lipid metabolism protein HTJNI73, 
SEQ ID NO:45. 


484 


98 


1368 


AAG64355 


Homo sapiens 


UYFU- Human lambda crystallin. 


400 


97 


1368 


AAY92506 


Homo sapiens 


INCY- Human OXRE-3 with 
identity to lambda crystallin. 


381 


no 

98 


1369 


AAM25241 


Homo sapiens 


HYSE- Human protein sequence 
SEQ IDNO:756. 


484 


95 


1369 


AAE09651 


Homo sapiens 


HUMA- Human gene 13 encoded 
lipid metabolism protein HTJNI73, 
SEQ ID NO:45. 


352 


100 


1369 


AAG64355 


Homo sapiens 


UYFU- Human lambda crystallin. 


268 


98 


1370 


AAM79626 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3272. 


214 


100 


1370 


AAM79368 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3014. 


70 


44 


1370 


AAO02702 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 16594. 


65 


36 


1371 


gi6653659 


Oryctolagus 
cuniculus 


chloride channel CLC-6 


3919 


96 


1371 


ABB 11 826 


Homo sapiens 


HYSE- Human CI channel 
homologue, SEQ ID NO:2196. 


3865 


96 


1371 


gil770376 


Homo sapiens 


H. sapiens mRNA for chloride 
channel, ClC-6a. 


1620 


100 


1372 


AAG71967 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1648. 


725 


97 


1372 


AAG71962 


Homo sapiens 


YEDA Human olfactory receptor 


714 


95 



WO 03/054152 PCT/US02/39555 



Table 2A 
178 



SEQ ID 


Hit ID 


Species 


Description 


S score 


% Identity 








polypeptide, SEQ ID NO: 1643. 






1372 


gil2007416 


Mus musculus 


m51 olfactory receptor 


553 


72 


1373 


AAM63071 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 35176. 


278 


100 


1373 


AAM75882 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 36188. 


278 


100 


1373 


AAM67333 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 27639. 


234 


100 


1374 


AAH26531_ 
aal 


Homo sapiens 


REGC Human 

proton/oligonucleotide transporter 
hPHTl cDNA. 


920 


100 


1374 


AAB82821 


Homo sapiens 


REGC Human 

proton/oligonucleotide transporter 
hPHTl polypeptide. 


815 


100 


1374 


gi2208839 


Rattus 
norvegicus 


peptide/histidine transporter 


721 


82 


1375 


gil90418 


Homo sapiens 


Human cathepsin L gene, complete 
cds. 


1597 


87 


1375 


gi29715 


Homo sapiens 


Human mRNA for pro-cathepsin L 
(major excreted protein MEP). 


1597 


87 


1375 


AAW47031 


Homo sapiens 


USSH Human procathepsin L. 


1597 


87 


1376 


gi 16566341 


Homo sapiens 


G protein-coupled receptor 
(GPR101) gene, complete cds. 


2554 


98 


1376 


AAB86428 


Homo sapiens 


BOEN/ Human brain SERALPHA 
protein. 


2554 


98 


1376 


AAU04369 


Homo sapiens 


AREN- Human G-protein coupled 
receptor, hRUP15. 


2554 


98 


1377 


AAY53605 


Homo sapiens 


METR- Peptide of human KChAP 
that binds to KValpha and Kvbeta 
subunits. 


79 


35 


1377 


AAY53608 


Homo sapiens 


METR- KChAP domain that binds 
to KValpha and Kvbeta subunits. 


78 j 


35 


1377 


gi6 102853 


Homo sapiens 


mRNA; cDNA DKFZp727A051 
(from clone DKFZp727A051); 
partial cds. 


82 


38 


1378 


AAB61616 


Homo sapiens 


PROT- Human protein HP 10678. 


2416 


94 


1378 


AAG68126 


Homo sapiens 


FARB Human 7TM-GPCR protein 
sequence SEQ ID NO:6. 


966 


40 


1378 


AAE12023 


Homo sapiens i 


INCY- Human G-protein coupled 
receptor, GCREC-2. 


951 


40 


1379 


AAY30735 


Homo sapiens 


HUMA- Amino acid sequence of a 
human secreted protein. 


280 


100 


1379 


gi333947 


Human 
respiratory 
syncytial virus 


membrane glycoprotein 


48 


39 


1379 


gi222567 


Human 
respiratory 
syncytial virus 


SH protein 


48 


39 


1380 


gi2459682 


Homo sapiens 


MAGE-B2 (MAGE-B2), MAGE- 
B3 (MAGE-B3), MAGE-B4 
(MAGE-B4), and MAGE-B1 
(MAGE-B1) genes, complete cds. 


884 


55 


1380 


gi3687199 


Homo sapiens 


Xp22 bins 169-171 BAC GSHB- 
383H3 (Genome Systems Human 


884 


55 
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BAG Library) complete sequence. 






1380 


gi4033512 


Homo sapiens 


DAM 10 exon 3, partial sequence; 
and DAM10=DSS-AHC critical 
interval MAGE superfamily 
protein (DAM 10) gene, complete 
cds. 


867 


54 


1381 


gil0198115 


Homo sapiens 


2P domain potassium channel 
TREK2 (KCNK10) mRNA, 
complete cds. 


2697 


100 


1381 


gi8452900 


Rattus 
norvegicus 


potassium channel TREK-2 


2556 


95 


1381 


gi4584799 


Mus musculus 


TREK-1 K+ channel subunit 


1244 


65 


1382 


ABB 11297 


Homo sapiens 


HYSE- Human Coxsackie 
adenovirus receptor homologue, 
SEQ ID NO: 1667. 


699 


97 


1382 


gil4279421 


Danio rerio 


coxsackievirus and adenovirus 
receptor-like protein 


312 


40 


1382 


gi6013133 


Rattus 
norvegicus 


coxsackie-adeno virus-receptor 
homolog 


306 


39 


1383 


gil7016394 


Homo sapiens 


cervical cancer 1 proto-oncogene- 
binding protein KG19 (KG19) 
mRNA, complete cds. 


753 


100 


1383 


AAM58441 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 30546. 


99 


28 


1383 


AAB86126 


Homo sapiens 


HUMA- Human MIT-like protein 
fragment encoded by cDNA clone 
HMSMI80. 


99 


28 


1384 


AAM06866 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 1074. 


1133 


95 


1384 


gi 15099951 


Mus musculus 


diacylglycerol acyltransferase 2 


959 


51 


1384 


gi 15099953 


Homo sapiens 


diacylglycerol acyltransferase 2 
mRNA, complete cds. 


951 


50 


1386 


gi338506 


Homo sapiens 


Human salivary statherin gene, 
exons 2-6. 


254 


83 


1386 


gi338508 


Homo sapiens 


Human statherin mRNA, complete 
cds. 


254 


83 


1386 


AAY94527 


Homo sapiens 


INCY- Human statherin protein. 


254 


83 


1387 


gi 10435784 


Homo sapiens 


cDNA FLJ13693 fis, clone 
PLACE2000111. 


1011 


100 


1387 


AAB94721 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15739. 


1011 


100 


1387 


gil3592427 


Caenorhabditis 
elegans 


similar to glycoproteins 


124 


34 


1388 


gil2654579 


Homo sapiens 


peptidylprolyl isomerase B 
(cyclophilin B), clone MGC:2224 
IMAGE:2966791, mRNA, 
complete cds. 


918 


98 


1388 


gi 14250758 


Homo sapiens 


peptidylprolyl isomerase B 
(cyclophilin B), clone MGC: 14109 
IMAGE:3502055, mRNA, 
complete cds. 


918 


98 


1388 


gi337999 


Homo sapiens 


Human secreted cyclophilin-like 
protein (SCYLP) mRNA, complete 
cds. 


918 


98 


1389 


AAE07112 


Homo sapiens 


HUMA- Human gene 6 encoded 
secreted protein fragment, SEQ ID 


2473 


99 
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NO: 129. 






1389 


gi9368530 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 363668. 


2457 


99 


1389 


gil2053163 


Homo sapiens 


mRNA; cDNA DKFZp434D0727 
(from clone DKFZp434D0727); 
complete cds. 


2378 


99 


1390 


gil6589056 


Homo sapiens 


type II gonadotropin-releasing 
hormone receptor gene, partial cds. 


1021 


99 


1390 


gil7048804 


Homo sapiens 


The CDS shown includes some 
apparent amino acids (from Gly 10 
onw ards) which would be deleted 
in a short intron 


1008 


98 


1390 


gil4029600 


Cercopithecus 
aethiops 


GnRH receptor II 


944 


92 


1391 


gil6359249 


Mus musculus 


RIKENcDNA 130001 0M03 gene 


2226 


91 


1391 


AAM93450 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3100. 


575 


37 


1391 


gil0438431 


Homo sapiens 


cDNA: FLJ22155 fis, clone 
HRC00205. 


596 


34 


1392 


AAE04896 


Homo sapiens 


INCY- Human transporter and ion 
channel-9 (TRICH-9) protein. 


825 


100 


1392 


gil2003980 


Homo sapiens 


spinster-like protein mRNA, 
complete cds. 


695 


52 


1392 


gil4249892 


Homo sapiens 


spinster-like protein, clone 
MGC: 15767 IMAGE:3501826, 
mRNA, complete cds. 


695 


52 


1393 


AAG71515 


Homo sapiens 


YEDA Human olfactory receptor 
polypeptide, SEQ ID NO: 1 196. 


1051 


94 


1393 


AAG72603 


Homo sapiens 


YEDA Human OR-like 
polypeptide query sequence, SEQ 
ID NO: 2284. 


1051 


94 


1393 


AAU24762 


Homo sapiens 


SENO- Human olfactory receptor 
AOLFR130B. 


482 


47 


1394 


AAB08894 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 4 SEQ 
IDNO:51. 


165 


59 


1394 


gil5626257 


Buffalopox virus 


p8 protein homologue 


69 


40 


1394 


AAO06451 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 20343. 


69 


35 


1395 


gi2792525 


Equus caballus 


connexin 43 


64 


35 


1395 


gi 15 148992 


Human 

immunodeficienc 
y virus type 1 


vpu protein 


64 


30 


1395 


gi5738572 


Human 

immunodeficienc 
y virus type 1 


VPU protein 


60 


33 


1396 


AAM83617 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 11210. 


61 


41 


1396 


gi4467773 


Helicobacter 
pylori 


cytotoxin associated protein A 


60 


34 


1396 


gi7248699 


Helicobacter 
pylori 


cytotoxin associated protein CagA 


60 


34 


1397 


gil 1862939 


Mus musculus 


DDM36 


5233 


88 


1397 


gil 1862941 


Mus musculus 


DDM36E 


5224 


88 


1397 


gi7650186 


Mus musculus 


neighbor of Punc ell protein 


5196 


87 
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1398 


gi5596705 


Homo sapiens 


Novel human mRNA similar to C. 
elegans gene WP:CE18674, 
TR:Q19985. 


1305 


71 


1398 


gil5292481 


Drosophila 
melanogaster 


SD03655p 


1174 


59 


1398 


AAB88372 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0108. 


886 


71 


1399 


gil335598 


Simian sarcoma 
virus 


coding sequence of pl5E 


64 


33 


1399 


gil4039584 


Casuarius 
casuarius 


ATPase 8 


55 


50 


1399 


gil7427567 


Ralstonia 
solanacearum 


HYPOTHETICAL 
TRANSMEMBRANE PROTEIN 


72 


42 


1400 


AAM88491 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 16084. 


38 


41 


1400 


AAO09674 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23566. 


38 


44 


1400 


gi3845106 


Plasmodium 
falciparum 


metal binding protein (DHHC 
domain) 


55 


36 


1401 


gil0434098 


Homo sapiens 


cDNA FLJ12547 fis, clone 
NT2RM4000634. 


149 


34 


1401 


AAB95124 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17122. 


149 


34 


1401 


AAO09309 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23201. 


139 


26 


1402 


AAE01249 


Homo sapiens 


HUMA- Human gene 18 encoded 
secreted protein HFIIN69, SEQ ID 
NO:lll. 


222 


100 


1402 


AAE01299 


Homo sapiens 


HUMA- Human gene 18 encoded 
secreted protein HFIIN69, SEQ ID 
NO: 162. 


222 


100 


1402 


AAE01332 


Homo sapiens 


HUMA- Human gene 18 encoded 
secreted protein fragment, SEQ ID 
NO: 197. 


222 


100 


1403 


AAM06589 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 320. 


237 


100 


1403 


gil0732779 


Mus musculus 


APRIL 


56 


43 


1403 


AAM87662 


Homo sapiens ■ 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15255. 


38 


58 


1404 


AAO09486 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 23378. 


486 


98 


1404 


gi9955912 


Homo sapiens 


GP VI mRNA for platelet 
glycoprotein VT-2, complete cds. 


288 


37 


1404 


AAB40232 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 46 SEQ 
ID NO: 142. 


326 


40 


1405 


AAM06606 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 337. 


50 


34 


1405 


gi495989 


Homo sapiens 


Human rearranged IgH chain gene, 
VJ6 region, partial cds. 


58 


27 


1405 


AAM85487 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 13080. 


56 


32 


1406 


gil3377867 


Gallus gallus 


claudin-3 


116 


24 


1406 


gil5553371 


Danio rerio 


claudin c 


112 


26 
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1406 


gil5553375 


Danio rerio 


claudin h 


110 


23 


1407 


AAY78801 


Homo sapiens 


PROT- Hydrophobic domain 
containing protein clone HP00631 
amino acid sequence. 


701 


100 


1407 


AAY32204 


Homo sapiens 


INCY- Human receptor molecule 
(REC) encoded by Incyte clone 
2132179. 


701 


100 


1407 


gi5231135 


Homo sapiens 


androgen induced protein (AIG-1) 
mRNA, complete cds. 


695 


99 


1408 


gi 13543940 


Homo sapiens 


Similar to RIKEN cDNA 
2610017G09 gene, clone 
MGC: 12975 IMAGE: 3 3473 12, 
mRNA, complete cds. 


2232 


96 


1408 


AAB12138 


Homo sapiens 


PROT- Hydrophobic domain 
protein isolated from HT-1080 
cells. 


2221 


96 


1408 


AAG81335 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 188. 


2218 


96 


1410 


gil0719608 


Homo sapiens 


IL-22 receptor (IL22R) mRNA, 
complete cds. 


1699 


100 


1410 


AAW97861 


Homo sapiens 


ZYMO Human cytokine receptor 
11 (Zcytorll). 


1699 


100 


1410 


AAY97045 


Homo sapiens 


MILL- Human TANGO 241. 


1699 


100 


1411 


gil4090278 


Rattus 
norvegicus 


TAT1 


668 


84 


1411 


gi458247 


Homo sapiens 


Human X-linked PEST-containing 
transporter (XPCT) mRNA, partial 
cds. 


414 


51 


1411 


gi458255 


Homo sapiens 


Human X-l inked PEST-containing 
transporter (XPCT) gene, exon 6. 


414 


51 


1412 


gi4378057 


Homo sapiens 


organic anion transporter 1 
(OAT1) mRNA, complete cds. 


317 


51 


1412 


gi4579725 


Homo sapiens 


mRNA for hOATl-2, complete 
cds. 


317 


51 


1412 


gi5901645 


Homo sapiens 


organic anion transporter 1 
(SLC22A6) mRNA, complete cds. 


317 


51 


1413 


gi3881524 


Caenorhabditis 
elegans 


ZK1067.4 


714 


41 


1413 


gi 19322 


Lycopersicon 
esculentum 


glycine-rich protein 


63 


50 


1413 


gi2204081 


Pinctada fucata 


insoluble protein 


93 


52 


1414 


AAB43682 


Homo sapiens 


HUMA- Human cancer associated 
protein sequence SEQ ID 
NO: 1127. 


1522 


100 


1414 


gil2654351 


Homo sapiens 


solute carrier family 25 
(mitochondrial carrier; phosphate 
carrier), member 3, clone 
MGC:5280 IMAGE:2984830, 
mRNA, complete cds. 


1522 


100 


1414 


gil 2654961 


Homo sapiens 


solute carrier family 25 
(mitochondrial carrier; phosphate 
carrier), member 3, clone 
MGC:5556 IMAGE:3457151, 
mRNA, complete cds. 


1522 


100 


1415 


gil764015 


Ciona intestinalis 


COS41.5 


314 


42 


1415 


AAM42167 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 7098. 


284 


30 
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1415 


AAE03438 


Homo sapiens 


HUMA- Human gene 12 encoded 
secreted protein HETHW90, SEQ 
ID NO: 121. 


258 


30 


1416 


gil3591714 


Homo sapiens 


immunoglobulin superfamily 
receptor translocation associated 
protein 2c (IRTA2) mRNA, 
complete cds, alternatively spliced. 


1128 


88 


1416 


gil5277746 


Homo sapiens 


Fc receptor-like protein 5 (FCRH5) 
mRNA, complete cds. 


1128 


88 


1416 


AAB82315 


Homo sapiens 


UYCO Human immunoglobulin 
receptor isoform IRTA2c. 


1128 


88 


1417 


AAV40386_ 
aal 


Homo sapiens 


INCY- Human zinc binding 
protein ZB-2 encoding cDNA. 


525 


80 


1417 


gil3278762 


Homo sapiens 


ring finger protein 5, clone 
MGQ2407 IMAGE:2822537, 

n\T A 1 i 3_ 

mRNA, complete cds. 


525 


80 


1417 


gil3366064 


Homo sapiens 


mRNA for HsRmal, complete cds. 


525 


80 


1418 


gi3077703 


Oryctolagus 
cuniculus 


mitsugumin29 


1336 


93 


1418 


gi3461888 


Mus musculus 


mitsugumin29 


1314 


91 


1418 


AAU25436 


Homo sapiens 


INCY- Human mddt protein from 
clone LG:171377.1:2000MAY19. 


1050 


96 


1419 


gil3452508 


Mus musculus 


claudin 14 


371 


40 


1419 


gi 12597447 


Homo sapiens 


claudin 14 (CLDN14) mRNA, 
complete cds. 


370 


40 


1419 


gi 15082421 


Homo sapiens 


Similar to claudin 14, clone 
MGC:20195 IMAGE :46 84949, 
mRNA, complete cds. 


370 


40 


1420 


AAG00539 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4620. 


173 


49 


1420 


AAU20426 


Homo sapiens 


HUMA- Human secreted protein, 
Seq ID No 418. 


163 


47 


1420 


AAG75413 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:6177, 


159 


50 


1421 


gil4486155 


Bos taurus 


Rh type B glycoprotein 


1703 


86 


1421 


gil5718471 


Homo sapiens 


Rh type B glycoprotein (RHBG) 
gene, exons 9, and 10 and 
complete cds. 


1448 


80 


1421 


gi9858562 


Homo sapiens 


Rh type B glycoprotein (RHBG) 
mRNA, complete cds. 


1448 


80 


1422 


AAM00949 


Homo sapiens 


HYSE- Human bone marrow 
protein, SEQ ID NO: 425. 


215 


46 


1422 


AAE01852 


Homo sapiens 


HUMA- Human gene 1 1 encoded 
secreted protein fragment, SEQ ID 
NO:175. 


174 


40 


1422 


gil4209834 


Mus musculus 


ATP-binding cassette transporter 
sub-family A member 7 


178 


38 


1423 


gi 12053628 


Homo sapiens 


mRNA for ribonuclease 7. 


718 


99 


1423 


A A A T A A 1 PkO 

AAY44192 


Homo sapiens 


INNO- Human keratmocyte- 
derived RNase-like protein. 


718 


99 


1423 


AAB10601 


Homo sapiens 


SCHD Human SAP-2 pre-protein. 


718 


99 


1424 


AAM83996 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 11589. 


1153 


99 


1424 


AAG02219 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6300. 


483 


99 
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1424 


gi222902 


Oncorhynchus 
keta 


pituitary-specific transcription 
factor, Pit- 1/GHF-l 


98 


32 


1425 


AAB82485 


Homo sapiens 


ZYMO Human secretin-like 
receptor Zgprl. 


499 


92 


1425 


AAB82487 


Homo sapiens 


ZYMO Human secretin-like 
receptor Zgprl splice variant. 


499 


92 


1425 


AAE03382 


Homo sapiens 


HUMA- Human gene 5 encoded 
secreted protein HEOMX53, SEQ 
ID NO:40. 


499 


92 


1426 


gi6808374 


Homo sapiens 


mRNA; cDNA DKFZp434G0812 
(from clone DKFZp434G0812); 
partial cds. 


3211 


100 


1426 


gi433383 


Tripneustes 
gratilla 


dynein heavy chain isotype 5A 


241 


26 


1426 


gi6706264 


Leishmania 
major 


dynein heavy chain 


271 


26 


1428 


gi 135403 00 


Mus musculus 


nucleolar protein C7B 


427 


34 


1428 


gil3561516 


Mus musculus 


nucleolar protein C7 


426 


34 


1428 


AAM25939 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1454. 


156 


90 


1429 


AAU27632 


Homo sapiens 


ZYMO Human protein 
AFP674535. 


117 


49 


1429 


AAM40391 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3536. 


117 


49 


1429 


gi7022187 


Homo sapiens 


cDNA FLJ10261 fis 5 clone 
HEMBB1000975. 


85 


42 


1430 


AAY07751 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 8. 


293 


100 


1430 


gil 142588 


Trypanosoma 
brucei 


CR3 


84 


43 


1430 


AAO05990 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 19882. 


72 


33 


1431 


AAM93525 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3259. 


246 


36 


1431 


AAY66693 


Homo sapiens 


GETH Membrane-bound protein 
PRO 1004. 


191 


38 


1431 


AAB65216 


Homo sapiens 


GETH Human PRO 1004 
(UNQ488) protein sequence SEQ 
ID NO:227. 


191 


38 


1432 


AAB88388 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0131. 


316 


44 


1432 


AAB25719 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 6 SEQ 
IDNO:108. 


84 


73 


1432 


AAM62047 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 34152. 


45 


39 


1433 


AAB88388 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone PSEC0131. 


318 


44 


1433 


AAB25719 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 6 SEQ 
ID NO: 108. 


86 


66 


1433 


gi5629917 


Homo sapiens 


partial FLN2 gene for ABP-L, 
gamma filamin, exons 1 to 3. 


77 


34 


1434 


AAG75991 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:6755. 


235 


77 
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1434 


AAM78941 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1603. 


220 


86 


1434 


ABB14802 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
3459. 


58 


40 


1435 


gi9621664 


Homo sapiens 


RHBDL gene for rhomboid-related 
protein. 


1129 


56 


1435 


gil4336709 


Homo sapiens 


16pl3.3 sequence section 3 of 8. 


1123 


56 


1435 


gi3287191 


Homo sapiens 


T1 "VTA/' 1 1 " 1 1 J. J 

mRNA for rhomboid-related 
protein, complete CDS. 


1123 


56 


1436 


gi7 106866 


Homo sapiens 


HSPC238 


238 


56 


1436 


AAU15887 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 840. 


238 


56 


1436 


AAU16342 


Homo sapiens 


HUMA- Human novel secreted 
protein, Seq ID 1295. 


238 


56 


1437 


gil418942 


Mus musculus 


samaphorin G 


5660 


93 


1437 


AAY94990 


Homo sapiens 


ALPH- Human secreted protein 
vb21_l, SEQ ID NO: 20. 


5403 


99 


1437 


gi2772584 


Homo sapiens 


semaphorin F homolog mRNA, 
complete cds. 


3560 


59 


1439 


gil 1055322 


Homo sapiens 


vanilloid receptor-related 
osmotically activated channel 
(VROAC) mRNA, complete cds. 


3324 


100 


1439 


AAI66972_ 
aal 


Homo sapiens 


MILL- Human ion channel VR-5 
cDNA sequence. 


3319 


99 


1439 


AAG65787 


Homo sapiens 


MILL- Human ion channel VR-5 
protein sequence. 


3318 


99 


1440 


gi4155033 


Helicobacter 
pylori J99 


cag island protein 


72 


25 


1440 


AAU69567 


Homo sapiens 


PHAA Human G protein-coupled 
receptor from cDNA Seq-2643. 


77 


23 


1440 


gil3171062 


turkey 
coronavirus 


M protein 


56 


41 


1441 


gil6554186 


Homo sapiens 


cDNA FLJ25409 fis, clone 
TST03074. 


644 


100 


1441 


gi4235228 


Mus musculus 


leucine zipper-EF-hand containing 
transmembrane protein 1 


497 


59 


1441 


gi6599194 


Homo sapiens 


mRNA; cDNA DKFZp434C229 
(from clone DKFZp434C229); 
partial cds. 


493 


63 


1442 


gil6359165 


Homo sapiens 


clone IMAGE:4645529, mRNA, 
partial cds. 


1270 


100 


1442 


ABB 11242 


Homo sapiens 


HYSE- Human SLIT-2 
homologue, SEQ ID NO: 1612. 


653 


99 


1442 


AAB07469 


Homo sapiens 


ZYMO A human leucine-rich 
repeat protein designated Zlrr3. 


451 


36 


1443 


AAE03245 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein fragment, SEQ ID 
NO:95. 


804 


100 


1443 


AAE03244 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein fragment, SEQ ID 
NO:94. 


799 


100 


1443 


AAE03204 


Homo sapiens 


HUMA- Human gene 3 encoded 
secreted protein HNGNN78, SEQ 
IDNO:54. 


617 


100 


1444 


gi4062463 


Escherichia coli 


ABC transporter probable ATP- 


697 


91 
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binding subunit homolog 






1444 


gi 15487341 


Escherichia coli 


macrolide-specific ABC-type 
efflux carrier 


697 


91 


1444 


gil7743610 


Agrobacterium 
tumefaciens str. 
C58 (Dupont) 


ABC transporter, nucleotide 
binding/ATPase protein 


366 


51 


1446 


gi3 168602 


Homo sapiens 


pi 60 mRNA, partial cds. 


3755 


89 


1446 


AAW3 11 85 


Homo sapiens 


DAND Human pi 60 polypeptide 
160.1. 


3169 


87 


1446 


gi3 168604 


Homo sapiens 


proline and glutamic acid rich 
nuclear protein isoform mRNA, 
partial cds. 


1641 


99 


1447 


gil4042515 


Homo sapiens 


cDNA FLJ14761 fis, clone 
NT2RP3003302. 


67 


36 


1447 


AAG67254 


Homo sapiens 


HELI- Amino acid sequence of a 
human liver-associated gene. 


67 


36 


1447 


AAB94495 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15 188. 


67 


36 


1448 


AAB24058 


Homo sapiens 


GETH Human PRO290 protein 
sequence SEQ ID NO:7. 


1972 


100 v 


1448 


AAY66639 


Homo sapiens 


GETH Membrane-bound protein 
PRO290. 


1972 


100 


1448 


AAB65162 


Homo sapiens 


GETH Human PRO290 (UNQ253) 
protein sequence SEQ ID NO: 33. 


1972 


100 


1449 


gil4043409 


Homo sapiens 


Similar to procollagen, type IV, 
alpha 3, clone MGC: 11337 
IMAGE:3953131, mRNA, 
complete cds. 


89 


75 


1449 


gi5420387 


Leishmania 
major 


proteophosphoglycan 


86 


24 


1449 


gil61 17372 


Macropodid 
herpesvirus 1 


ICP4 


102 


27 


1450 


AAU27660 


Homo sapiens 


ZYMO Human protein 
AFP671052. 


889 


100 


1450 


AAG74151 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:4915. 


748 


100 


1450 


gi5670326 


Homo sapiens 


ICERE-1 mRNA, complete cds. 


110 


31 


1451 


AAY86519 


Homo sapiens 


HUMA- Human gene 71 -encoded 
protein fragment, SEQ ID NO:434. 


46 


60 


1451 


AAM59183 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 31288. 


61 


39 


1451 


AAM71718 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 32024. 


61 


39 


1452 


gil2053219 


Homo sapiens 


mRNA; cDNA DKFZp434N1235 
(from clone DKFZp434N1235); 
complete cds. 


1031 


93 


1452 


gil 5559050 


Ethmostigmus 
rubripes 


ADP-ATP translocator 


738 


68 


1452 


gi339723 


Homo sapiens 


Human ADP/ATP translocase 
mRNA, 3' end, clone pHAT8. 


693 


68 


1453 


gil5025781 


Clostridium 
acetobutylicum 


Predicted membrane protein 


136 


32 


1453 


AAM95190 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 3848. 


42 


23 
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1453 


AAM55650 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded protein 
SEQ ID NO: 27755. 


61 


31 


1454 


gi4929597 


Homo sapiens 


CGI-64 protein mRNA, complete 
cds. 


1879 


96 


1454 


gi6995987 


Homo sapiens 


mitochondrial carrier homolog 1 
isoform a mRNA, complete cds; 
nuclear gene for mitochondrial 
product. 


1818 


99 


1454 


gi6995989 


Homo sapiens 


mitochondrial carrier homolog 1 
isoform b (MTCH1) mRNA, 
partial cds; nuclear gene for 
mitochondrial product. 


1483 


99 


1455 


gil7131893 


Nostoc sp. PCC 
7120 


WD-repeat protein 


248 


26 


1455 


gi886024 


Thermomonospo 
ra curvata 


PkwA 


248 


30 


1455 


gil7225210 


Podospora 
anserina 


beta transducin-like protein HET- 
D2Y 


250 


25 


1456 


AAB36840 


Homo sapiens 


ZYMO Human insulin receptor- 
related receptor protein with signal 
peptide. 


6733 


98 


1456 


gil 86555 


Homo sapiens 


Human insulin receptor-related 
receptor (IRR) mRNA, 3 ' end. 


6728 


99 


1456 


AAB36836 


Homo sapiens 


ZYMO Human insulin receptor- 
related receptor protein. 


6728 


99 


1457 


gi6453436 


Homo sapiens 


mRNA; cDNA DKFZp586E041 
(from clone DKFZp586E041); 
partial cds. 


10115 


100 


1457 


ABB 11803 


Homo sapiens 


HYSE- Human GPI-122 
homologue, SEQ IDNO:2173. 


6423 


99 


1457 


AAY50125 


Homo sapiens 


GEMY Human 
glycophosphatidylinositol- 
anchored protein GPI-122. 


6323 


100 


1458 


AAU00023 


Homo sapiens 


BIOJ Human activated T- 
lymphocyte associated sequence 2, 
ATLAS-2. 


3623 


99 


1458 


AAE04546 


Homo sapiens 


INCY- Human G-protein coupled 
receptor-2 (GCREC-2) protein. 


2570 


79 


1458 


ABB 11735 


Homo sapiens 


HYSE- Human vasopressin 
receptor homologue, SEQ ID 
NO:2105. 


2546 


100 


1459 


gi7021924 


Homo sapiens 


cDNA FLJ10081 fis, clone 
HEMBA1002018. 


2742 


100 


1459 


AAB92508 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 10631. 


2742 


100 


1459 


gil0435862 


Homo sapiens 


cDNA FLJ13751 fis, clone 
PLACE3000339, weakly similar to 
GLUCOAMYLASE S1/S2 
PRECURSOR (EC 3.2.1.3). 


2687 


99 


1460 


AAM95163 


Homo sapiens 


HUMA- Human reproductive 
system related antigen SEQ ID 
NO: 3821. 


75 


33 


1460 


AAM06875 


Homo sapiens 


HYSE- Human foetal protein, SEQ 
ID NO: 1083. 


68 


45 


1460 


AAG76978 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID NO:7742. 


65 


39 
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1401 


gll353365 


Macaca 
fascicularis 


MHC-G 


84 


30 


1461 


gi2655072 


Pan troglodytes 


MHC class I antigen HLA-H 
ortholog 


72 


28 


1461 


gi2655078 


Pan paniscus 


MHC class I antigen HLA-H 
ortholog 


72 


28 


1462 


gi 11066090 


Homo sapiens 


matrix metal loprotease MMP-27 
mRNA, complete cds. 


1191 


85 


1462 


AAE10410 


Homo sapiens 


SCHA/ Human full length matrix 
metalloproteinase-251 (MMP-251) 
protein. 


1188 


85 


1 A /CO 

14oz 


AAY90293 


Homo sapiens 


INCY- Human peptidase, HPEP- 
10 protein sequence. 


1188 


85 


1463 


gi601948 


Drosophila 
melanogaster 


Inscuteable 


126 


25 


1 /i <i 

1403 


gllo57962 


Drosophila 
melanogaster 


Nem 


115 


26 


1463 


gil5281684 


Bacteriophage 
Mx8 


p4 


66 


40 


1464 


AAB45378 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 38 SEQ 
ID NO:l30. 


434 


87 


1464 


gi406058 


Mus musculus 


protein kinase 


219 


54 


1 A CLA 

14o4 


gi 135 3 7204 


Homo sapiens 


mRNA for MAST205, complete 
cds. 


216 


54 


1466 


AAY97293 


Homo sapiens 


INCY- Lipid associated protein 
(LIPAP) 3335404CD1. 


1894 


77 


1466 


AAB2423 1 


Homo sapiens 


INCY- Human vesicle associated 
protein 10 SEQ ID NO: 10. 


1116 


48 


1466 


AAM39997 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3142. 


1116 


48 


1467 


gil3278855 


Homo sapiens 


calcium binding atopy-related 
autoantigen 1, clone MGC:2891 
IMAGE:3009677, mRNA, 
complete cds. 


919 


96 


1467 


gil327892l 


Homo sapiens 


calcium binding atopy-related 
autoantigen 1, clone MGC:4521 
IMAGE:3009677, mRNA, 
complete cds. 


919 


96 


1467 


AAB58329 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 667. 


919 


96 


1468 


gil 2654401 


Homo sapiens 


Similar to RNA cyclase homolog, 
clone MGC: 1390 
IMAGE:3343468, mRNA, 
complete cds. 


690 


78 


1468 


gil0434565 


Homo sapiens 


cDNA FLJ12842 fis, clone 
NT2RP2003286, weakly similar to 
PROBABLE RNA 3 -TERMINAL 
PHOSPHATE CYCLASE (EC 
6.5.1.4). 


690 


78 


1468 


AAB97250 


Homo sapiens 


HOMO RNA cyclase 41 protein. 


690 


78 


1469 


gil2053215 


Homo sapiens 


mRNA; cDNA DKFZp434K2435 
(from clone DKFZp434K2435); 
complete cds. 


273 


98 


1469 


gi2633333 


Bacillus subtilis 


yhaJ 


57 


25 


1469 


gil5023682 


Clostridium 
acetobutylicum 


Glycosyltransferase involved in 
cell wall biogenesis 


73 


34 
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1470 


gil4290599 


XUJ111U octpiCIlb 


clone Mut. i /oZ4 
IMAGE:3 855543, mRNA, 
uurnpiete cus. 


347 


100 


1470 


gi65265 


-/vt/iiw[JLio laCVla 


a xenopus upstream binding factor 


131 


29 


1470 


gi65201 


Xenopus laevis 


RNA polymerase I transcription 
factor 


128 


29 


1471 


gil3 182747 


Homo s aniens 


iiiiL/ioi>oindi signal peptiaase 
subunit mRNA, complete cds. 


1 A O 

148 


96 


1471 


AAW29660 


Hnmn csjirvipnc 


numo sapiens v^riz/ i clone 
secreted protein. 


148 


96 


1471 


gi 164084 


Oanis familiariQ 


signal pepnaase zi lcua suoumt 


141 


90 


1472 


AAG03600 


Homo sapiens 


Human secreted protein, SEQ ID 

INw. /OOl. 


245 


86 


1472 


gi7770239 


Homo ^flnipnc 


rXvUZoj 1 


139 


71 


1472 


gi 14026000 


Mesorhizobium 
loti 


transposase 


64 


46 


1473 


gi3065951 


Homo sapiens 


Notch3 (NOTCH3) gene, exon 33 


99 


58 


1473 


gi2668592 


Homo sapiens 


Notch3 (NOTCH3) mRNA, 
c-ompieie cas. 


99 


58 


1473 


AAW49698 


Homo sapiens 


Human Notch3 protein. 


99 


58 


1474 


gi458938 


Saccharomyces 


Yhrl86cp 


156 


58 


1474 


gi5921144 


Schizosaccharom 
yces pombe 


mipl 


151 


83 


1474 




j-/cinococcus 
radiodurans 


serine/threonine protein kinase- 
related protein 


96 


31 


1475 


glVJ^-' UZr 1 / J 


Homo sapiens 


mRNA; cDNA DKFZp566H033 
(^rrom clone DivFZp566H033); 
paniai cas. 


295 


91 


1475 


AAB38280 


Rnmn fjanifnc 


Human secreted protein sequence 
NO: 136. 


289 


96 


1475 


AAB94892 


Homo sapiens 


NO: 16234. 


Zo4 


89 


1476 


gi7 160973 


Homo saoiens 


iiijvi>/-v ioi viMiNj proiem. 




93 


1476 


gi6102996 


Mus musculus 


Vanin-3 


1661 


73 


1476 


gi6649540 


Canis familiaris 


TIFF66 


1370 


67 


1477 


gi7959741 


Homo sapiens 


PRO1051 


134 


76 


1477 


ei 1 7290^ 

gi 1 / £*Zs\JJ 


Oaccnaromyces 
cerevisiae 


transcription factor III A 


113 


26 


1477 


gi786305 


Saccharomyces 
cerevisiae . 


Transcription factor TFIIIA (PIR 
accession number S20050) 


113 


26 


1478 


cri 1 9^^952?^ 


Homo sapiens 


voltage-dependent anion channel 
2, clone MGC:5237 
IMAGE:2901 130, mRNA, 
complete cds. 


483 


74 


1478 


gi 15277577 


Homo sapiens 


voltage-dependent anion channel 
2, clone MGC:21498 
IMAGE:3875077, mRNA, 
complete cds. 


483 


74 


1478 


gi5 114261 


Homo sapiens 


voltage-dependent anion channel 
isoform 2 (VDAC2) gene, exon 10 
and complete cds. 


483 


74 


1479 


gil0435380 


Homo sapiens 


cDNA FLJ13381 fis, clone 
PLACE1001010. 


133 


58 
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1479 


AAB94622 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15476. 


133 


58 


1479 


AAB85361 


Homo sapiens 


Human phosphatase (PP) (clone ID 
7472032CD1). 


118 


50 


1480 


gil2803725 


Homo sapiens 


APG5 (autophagy 5, S. 
cerevisiae)-like, clone MGQ3622 
IMAGE:3609927, mRNA, 
complete cds. 


204 


100 


1480 


gi2995198 


Homo sapiens 


H.sapiens mRNA for apoptosis 
specific protein. 


204 


100 


1480 


gil2006864 


Homo sapiens 


apoptosis-related protein (APG5L) 
mRNA, complete cds, alternatively 
spliced. 


204 


100 


1481 


gil26 19679 


Conus arenatus 


conotoxin scaffold VI/VTI 
precursor 


56 


25 


1481 


gi2661493 


Drosophila 
melanogaster 


56F3.e 


72 


34 


1481 


gi2648663 


Archaeoglobus 
fulgidus 


DNA-directed RNA polymerase, 
subunit H (rpoH) 


55 


27 


1482 


gil2654623 


Homo sapiens 


RAB31, member RAS oncogene 
family, clone MGG1258 
IMAGE:3534853, mRNA, 
complete cds. 


194 


58 


1482 


gil457954 


Homo sapiens 


Human small GTP-binding protein 
rab22b mRNA, complete cds. 


194 


58 


1482 


gil388195 


Homo sapiens 


Human low-Mr GTP-binding 
protein (RAB31) mRNA, complete 
cds. 


194 


58 


1483 


gi7959778 


Homo sapiens 


PRQ1546 


163 


65 


1483 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6834. 


154 


68 


1483 


AAB93922 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 13907. 


137 


59 


1484 


gi6979921 


Drosophila 
melanogaster 


RhoGTPase ) 


160 


32 


1484 


gi7263024 


Drosophila 
melanogaster 


G protein RhoBTB 


160 


32 


1484 


gil5291731 


Drosophila 
melanogaster 


LD24835p 


160 


32 


1485 


gil4150450 


Rattus 
norvegicus 


UDP-GalNAc:polypeptide N- 
acetylgalactosaminyltransferase T9 


246 


93 


1485 


gi3047203 


Caenorhabditis 
elegans 


GLY7 


128 


54 


1485 


gi304259 


Bos taurus 


UDP-GalNAc:polypeptide, N- 
acetylgalactosaminyltransferase 


109 


45 


1486 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18850. 


202 


76 


1486 


AAG03710 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7791. 


192 


73 


1486 


AAG02922 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7003. 


182 


69 


1487 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


1244 


84 


1487 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:121. 


1249 


85 


1487 


gi339771 


Homo sapiens 


Human transposon Ll.l with a 


1243 


84 
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S score 
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base deletion relative to L1.2B 
resulting in a premature stop codon 
in the coding region. 






1488 


gil3560707 


Homo sapiens 


group XIII secreted phospholipase 
A2 mRNA, complete cds. 


383 


98 


1488 


gil5824793 


Homo sapiens 


group XIII secreted phospholipase 
A2 


383 


98 


1488 


AAY27572 


Homo sapiens 


Human secreted protein encoded 
by gene No. 6. 


383 


98 


1489 


gi2995442 


Homo sapiens 


mRNA for UDPGahGlcNAc bl,4 
galactosyltransferase. 


996 


98 


1489 


gi4520136 


Homo sapiens 


mRNA for beta- 1,4- 
galactosyltransferase II, complete 
cds. 


996 


98 


1489 


gi3869131 


Mus musculus 


beta-l,4-galactosyltransferase II 


946 


94 


1490 


AAM06551 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
282. 


140 


49 


1490 


AAB51718 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 45 SEQ ID 
NO:158. 


129 


58 


1490 


gil 196431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


124 


52 


1491 


gil2052884 


Homo sapiens 


mRNA; cDNA DKFZp564C2478 
(from clone DKFZp564C2478); 
complete cds. 


487 


98 


1491 


gi7023332 


Homo sapiens 


cDNA FLJ10961 fis, clone 
PLACE 10005 8 8, highly similar to 
INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1. 


487 


98 


1491 


AAB93371 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12521. 


487 


98 


1492 


AAG00392 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4473. 


150 


81 


1492 


AAB54106 


Homo sapiens 


Human pancreatic cancer antigen 
protein sequence SEQ ID NO: 55 8. 


136 


71 


1492 


gi914110 


Streptococcus 
pyogenes 


EmmL15 


107 


20 


1493 


gi6855513 


Gallus gallus 


syndesmos 


554 


60 


1493 


gil 3623247 


Homo sapiens 


Similar to RIKEN cDNA 
1110001K21 gene, clone 
MGC: 11275 IMAGE: 3 9443 55, 
mRNA, complete cds. 


534 


58 


1493 


gil2544542 


Corynebacterium 
glutamicum 


RXA02115 


99 


28 


1494 


gil2082725 


Mus musculus 


B cell phosphoinositide 3 -kinase 
adaptor 


335 


62 


1494 


gil2082811 


Gallus gallus 


B cell phosphoinositide 3 -kinase 
adaptor 


211 


48 


1494 


gi330842 


Equine 
herpesvirus 1 


myristylated virion protein 


61 


33 


1495 


AAB43811 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1256. 


273 


100 


1495 


AAB64482 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 13 SEQ ID 
NO:120. 


273 


100 


1495 


gi5 106795 


Homo sapiens 


sec61 homolog mRNA, complete 


273 


100 



WO 03/054152 PCT/US02/39555 



Table 2A 
192 



SEQ ID 
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Species 


Description 
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cds. 






1496 


AAE01786 


Homo sapiens 


Human gene 17 encoded secreted 
protein HWBEM18, SEQ ID 
NO: 107. 


4730 


97 


1496 


gi6650678 


Mus musculus 


nuclear pore membrane 
glycoprotein POM210 


4025 


76 


1496 


gi56463 


Rattus 
norvegicus 


gp210(AA 1-1886) 


4000 


79 


1497 


gil 85996 


Homo sapiens 


Human Ig germline kappa L-chain 
V-region gene (HK1 66), V-kappa- 
1. 


607 


100 


1497 


AAR38651 


Homo sapiens 


Human V- kappa fragment encoded 
by clone vk65.15. 


607 


100 


1497 


AAR62931 


Homo sapiens 


Human V-kappa vk65.15 region. 


607 


100 


1498 


gi431857 


Homo sapiens 


H. sapiens mRNA for delta 4-3- 
oxosteroid 5 beta-reductase. 


460 


76 


1498 


gil 1640835 


Homo sapiens 


5-beta steroid reductase (SRD5B1) 
gene, exon 9 and complete cds. 


460 


76 


1498 


gi5689216 


Oryctolagus 
cuniculus 


delta4-3-oxosteroid 5beta- 
reductase 


442 


70 


1499 


gil752736 


Saccharomyces 
cerevisiae 


gene required for phosphoylation 
of oligosaccharides/ has high 
homology with YJR061w 


210 


47 


1499 


AAB53977 


Homo sapiens 


Human colon cancer antigen 
protein sequence SEQ ID 
NO:1517. 


178 


71 


1499 


AAB27956 


Homo sapiens 


Human secreted protein SEQ ID 
NO: 110. 


177 


65 


1500 


AAB93159 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12081. 


2209 


64 


1500 


AAB58796 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 504. 


1455 


65 


1500 


gil3377567 


Vibrio cholerae 


accessory colonization factor AcfD 


105 


26 


1501 


AAB95655 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18417. 


186 


78 


1501 


AAB95596 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18279. 


172 


80 


1501 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


182 


78 


1502 


AAB58202 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 540. 


427 


100 


1502 


gi4572328 


Homo sapiens 


caveolin-1 gene, exon 3 and 
complete cds. 


345 


100 


1502 


gi6599075 


Homo sapiens 


caveolin-l/-2 locus, Contigl, 
D7S522, genes CAV2 (exons 1, 
2a, and 2b), CAV1 (exons 1 and 
2). 


345 


100 


1503 


gil99584 


Mus musculus 


MHox 


405 


96 


1503 


gi51362 


Mus musculus 


DNA-binding protein 


405 


96 


1503 


gil 836044 


Rattus sp. 


rHox protein 


405 


96 


1504 


gi8 163762 


Homo sapiens 


membrane cofactor protein CD46 
variant (MCP) mRNA, partial cds. 


639 


83 


1504 


AAB58394 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 732. 


635 


82 


1504 


AAG75528 


Homo sapiens 


Human colon cancer antigen 


635 


82 
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% Identity 








protein SEQ ID NO: 6292. 






1505 


gil5990400 


Homo sapiens 


clone IMAGE:3954SS4, mRNA, 
partial cds. 


872 


83 


1505 


gi3523113 


Homo sapiens 


prostate-specific transglutaminase 
(TGM4) gene, alternative spliced 
variant, exon 2 and partial cds. 


92 


46 


1505 


AAB54389 


Homo sapiens 


Human pancreatic cancer antigen 
protein sequence SEQ ID NO: 841. 


84 


56 


1506 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16231. 


214 


65 


1506 


gil 196431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


197 


66 


1506 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO: 136. 


196 


69 


1507 


gil 184173 


Homo sapiens 


Human nucleoporin 98 (NUP98) 
mRNA, complete cds. 


1058 


100 


1507 


gil 1414896 


Homo sapiens 


NUP98 mRNA for nucleoporin, 
complete cds. 


1058 


100 


1507 


gi4545101 


Homo sapiens 


cell-line HeLa Nup98-Nup96 
precursor splice variant 1 mRNA, 
complete cds. 


1060 


88 


1508 


gi5 106521 


Homo sapiens 


K-Cl cotransporter KCC4 mRNA, 
complete cds. 


1062 


96 


1508 


gil 0440500 


Homo sapiens 


mRNA for FLJ00098 protein, 
partial cds. 


1062 


96 


1508 


gil 04405 14 


Homo sapiens 


mRNA for FLJ00105 protein, 
partial cds. 


1062 


96 


1509 


gi6691968 


Homo sapiens 


Human DNA sequence from clone 
RP1-148M19 on chromosome 
Xpl 1.22-11.3 Contains ZNF741 
(zinc finger protein), a ribosomal 
protein L23a pseudogene, STSs 
and GSSs, complete sequence. 


293 


81 


1509 


gi4096339 


Homo sapiens 


Human zinc finger protein 
(ZNF741) mRNA, complete cds. 


293 


81 


1509 


AAB21033 


Homo sapiens 


Human nucleic acid-binding 
protein, NuABP-37. 


293 


81 


1510 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


426 


57 


1510 


gi5 052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


425 


57 


1510 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO:12l. 


426 


57 


1511 


gil6041769 


Homo sapiens 


clone MGC:23189 
IMAGE:4854518, mRNA, 
complete cds. 


313 


55 


1511 


g i4454678 


Homo sapiens 


zinc finger protein 4 


308 


55 


1511 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) mRNA, complete 
cds. 


321 


55 


1512 


gil4027838 


Mesorhizobium 
loti 


transcriptional regulator 


91 


26 


1513 


gi48 86463 


Homo sapiens 

V 


mRNA; cDNA DKFZp586G1219 
(from clone DKFZp586G1219); 
partial cds. 


1039 


100 
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1513 


gi4337460 


Homo sapiens 


neuroblastoma-amplified protein 
mRNA, complete cds. 


1039 


100 


1513 


AAB38417 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 5 clone 
HCGMF16. 


1039 


100 


1514 


gil621611 


Homo sapiens 


Human TRAF family member- 
associated NF-kB activator TANK 
mRNA, complete cds. 


1667 


76 


1514 


gil518018 


Homo sapiens 


Human TRAF-interacting protein 
I-TRAF mRNA, complete cds. 


1655 


77 


1514 


AAW27163 


Homo sapiens 


Human TRAF inhibitor protein I- 
TRAF. 


1655 


77 


1515 


gil2957169 


Mus musculus 


synaptotagmin-like protein 3-b 


142 


60 


1515 


gil 3647085 


Mus musculus 


synaptotagmin-like protein 3-a 
delta 3S-II 


142 


60 


1515 


gil3647079 


Mus musculus 


synaptotagmin-like protein 3-a + 
3S-I 


142 


60 


1516 


gi6688199 


Homo sapiens 


mRNA for AMP-activated protein 
kinase gamma2 subunit (AMPK 
gamma2 gene). 


1055 


93 


1516 


gi5931569 


Homo sapiens 


mRNA for H91620p, complete 
cds. 


1055 


93 


1516 


gil2642942 


Homo sapiens 


AMP-activated protein kinase 
gamma subunit (PRKAG2) 
mRNA, complete cds. 


1055 


93 


1517 


gi6807587 


Homo sapiens 


Novel human gene mapping to 
chomosome 1. 


2360 


100 


1517 


gil 769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


1135 


49 


1517 


gil 86774 


Homo sapiens 


Human Kruppel related zinc finger 
protein (HTF10) mRNA, complete 
cds. 


904 


41 


1518 


gi9956065 


Homo sapiens 


clone CDABP0092 mRNA 
sequence. 


602 


100 


1518 


gi4038733 


Homo sapiens 


mRNA for beta 2-microglobulin, 
complete cds. 


602 


100 


1518 


gi5725512 


Homo sapiens 


beta-2 microglobulin gene, 
complete cds. 


602 


100 


1519 


gil87177 


Homo sapiens 


Human lamin-like protein in 
Hindlll repetitive element derived 
DNA, 3' end. 


216 


57 


1519 


gi339771 


Homo sapiens 


Human transposon Ll.l with a 
base deletion relative to L1.2B 
resulting in a premature stop codon 
in the coding region. 


223 


57 


1519 


gi5070622 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


223 


57 


1520 


AAE03963 


Homo sapiens 


Human gene 17 encoded secreted 
protein fragment, SEQ ID NO: 142. 


110 


47 


1520 


gi4097459 


Elephantulus 
edwardii 


reverse transcriptase 


109 


47 


1520 


AAB94930 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16405. 


109 


46 


1521 


gil0934047 


Mus musculus 


Scot-tl 


1035 


77 


1521 


gil0934052 


Mus musculus 


Scot-t2 


1035 | 77 
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1521 


gi 164423 


Sus scrofa 


succinyl-CoA: alpha-ketoacid 
coenzyme A transferase 


1024 


81 


1522 


gi 12003 128 


Eremothecium 
gossypii 


Yerl54p 


108 


31 


1522 


gi5052482 


Drosophila 
melanogaster 


BcDNA.GH02220 


102 


29 


1522 


gi6227006 


Arabidopsis 
thaliana 


F16G16.8 


94 


50 


1523 


gi3419880 


Homo sapiens 


mRNA for MDC/ADAMl l, 
complete cds. 


2989 


1 r\r\ 
100 


1523 


gi836683 


Homo sapiens 


Human 

metalloprotease/disintegrin-like 
(MDC) gene, partial cds. 


2989 


100 


1523 


AAR75352 


Homo sapiens 


Human fetal brain MDC protein. 


2984 


99 


1524 


gi 1109782 


Homo sapiens 


Human protein-tyrosine 
phosphatase mRNA, complete cds. 




9/ 


1524 


gi 178 1037 


Mus musculus 


neuronal tyrosine threonine 
phosphatase 1 


1996 


5/ 


1524 


AAB66436 


Homo sapiens 


Human MAP-kinase phosphatase 
hVH5. 


883 


99 


1525 


gi757911 


Homo sapiens 


H.sapiens mRNA for A2b 
adenosine receptor. 


442 


100 


1525 


gil78l50 


Homo sapiens 


Human adenosine A2b receptor 
(ADORA2) mRNA, complete cds. 


442 


100 


1525 


AAR41526 


Homo sapiens 


Human A2b adenosine receptor. 


442 


100 


1526 


gil3540160 


Homo sapiens 


TCF12-TEC fusion protein 
mRNA, partial cds. 


335 


79 


1526 


gi 183930 


Homo sapiens 


Human HEB helix-loop-helix 
protein (HEB) mRNA, complete 
cds. 


337 


75 


1526 


gi 184448 


Homo sapiens 


transcription factor (HTF4) 
mRNA, complete cds. 


337 


75 


1527 


AAB43940 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:1385. 


741 


95 


1527 


gil4602778 


Homo sapiens 


spermine synthase, clone 
MGC:2071 IMAGE:3506022, 
mRNA, complete cds. 


690 


95 


1527 


gi2198557 


Homo sapiens 


spermidine aminopropyltransferase 
mRNA, complete cds. 


690 


nr 

yj 


1528 


gi 13436 152 


Homo sapiens 


reticulocalbin 2, EF-hand calcium 
binding domain, clone MGO1650 
IMAGE:3505241, mRNA, 
complete cds. 


674 


O 1 

51 


1528 


gi469885 


Homo sapiens 


H.sapiens ERC-55 mRNA. 


674 


81 


1528 


AAW21949 


Homo sapiens 


E6-binding protein E6-BPSD7. 


674 


O 1 

51 


1529 


gil0440331 


Homo sapiens 


cDNA: FLJ23591 fis, clone 
LNG14729. 


606 


100 


1529 


AAB84327 


Homo sapiens 


Amino acid sequence of a human 
lyase and associated protein 
HLYAP-2. 


606 


100 


1529 


gil5150358 


Mus musculus 


UDP-glucuronic acid 
decarboxylase 


605 


99 


1530 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


3094 


92 


1530 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 


3094 


92 
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NO:121. 






1530 


gi5052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


3083 


91 


1531 


gi5596433 


Homo sapiens 


candidate tumor suppressor protein 
NOC2 (NOC2) mRNA, complete 
cds. 


330 


87 


1531 


gi7020579 


Homo sapiens 


cDNA FLJ20462 fis, clone 
KAT06107. 


330 


87 


1531 


gi 13477353 


Homo sapiens 


Similar to rabphilin 3A-like 
(without C2 domains), clone 
MGC:3453 IMAGE:3529317, 
mRNA, complete cds. 


329 


87 


1532 


gil531645 


Rattus 
norvegicus 


C2-HC type zinc finger protein r- 
MyT3 


1914 


0/ 


1532 


gi29 14751 


Rattus 
norvegicus 


neural zinc finger factor 3; NZF-3 


1909 


66 


1532 


gil531653 


Xenopus laevis 


C2-HC type zinc finger protein X- 
MyTl 


300 


42 


1533 


gil2805043 


Homo sapiens 


clone IMAGE:3461487, mRNA, 
partial cds. 


364 


41 


1533 


AAG81328 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:174. 


356 


41 


1533 


gi2226004 


Homo sapiens 


Human Tiggerl transposable 
element, complete consensus 
sequence. 


441 


71 


1534 


gil0436783 


Homo sapiens 


cDNA FLJ14341 fis, clone 
THYRO 1000343, wealdy similar 
toATROPHIN-1. 


3184 


97 


1534 


AAB95860 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18924. 


3184 


97 


1534 


gil2802159 


Homo sapiens 


SH3-SAM adaptor protein 
(HACS1) mRNA, complete cds. 


515 


46 


1535 


gil5928572 


Mus musculus 


Similar to leucine rich repeat (in 
FLU) interacting protein 2 


103 


66 


1535 


gil4091821 


Oryza sativa 


Putative protein with region similar 
to cyclin-dependent kinase like 
proteins 


98 


38 


1535 


gi7020214 


Homo sapiens 


cDNA FLJ20248 fis, clone 
COLF6543. 


97 


63 


1536 


AAG02639 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6720. 


160 


71 


1536 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6834. 


141 


60 


1536 


gi7959778 


Homo sapiens 


PR01546 


140 


60 


1537 


gi7022610 


Homo sapiens 


cDNA FLJ 10521 fis, clone 
NT2RP2000841. 


184 


39 


1537 


AAB92909 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11539. 


184 


39 


1537 


gil 1527193 


Drosophila 
melanogaster 


Sunday driver 


112 


41 


1538 


gil2654055 


Homo sapiens 


clone IMAGE:3455871, mRNA, 
partial cds. 


849 


85 


1538 


gi3133291 


Homo sapiens 


mitogen activated protein kinase 
activated protein kinase gene, 
complete cds. 


844 


85 


1538 


gi2911813 


Mus musculus 


mitogen-activated protein kinase- 
activated protein kinase 


841 


84 
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1539 


gil 1761810 


Arabidopsis 
thaliana 


glutathione dependent 
dehydroascorbate reductase 
precursor 


60 


32 


1540 


gil3162677 


Homo sapiens 


GLUT4 enhancer factor mRNA, 
complete cds. 


1399 


99 


1540 


gil2655101 


Homo sapiens 


clone IMAGE:3 140406, mRNA, 
partial cds. 


1399 


99 


1540 


AAB58934 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 642. 


1395 


99 


1541 


gi30058 


Homo sapiens 


Human mRNA for pro-alpha- 1 
type 3 collagen. 


1606 


98 


1541 


gil6197601 


Homo sapiens 


type III preprocollagen alpha 1 
chain (COL3A1) gene, exon 1 and 
complete cds. 


1606 


98 


1541 


AAW12842 


Homo sapiens 


Truncated pro-alphal(III) chain. 


1606 


98 


1542 


AAE01436 


Homo sapiens 


Human gene 1 encoded secreted 
protein HWLFJ10, SEQ ID 
NO:91. 


1622 


99 


1542 


AAE01464 


Homo sapiens 


Human gene 1 encoded secreted 
protein HWLFJ10, SEQ ID 
NO: 119. 


1618 


98 


1542 


AAE01515 


Homo sapiens 


Human gene 1 encoded secreted 
protein fragment, SEQ ID NO: 172. 


1618 


98 


1543 


gil 86043 


Homo sapiens 


immunoglobulin light chain 
variable region (IGL@) mRNA, 
partial cds. 


547 


86 


1543 


gi2 19886 


Homo sapiens 


Human Ig kappa light chain gene, 
V- and J-region. 


543 


89 


1543 


gi33248 


Homo sapiens 


H.sapiens gene for Ig kappa light 
chain variable region '012'. 


540 


89 


1544 


gi7673618 


Mus musculus 


ubiquitin specific protease 


1972 


73 


1544 


gi7328168 


Homo sapiens 


mRNA; cDNA DKFZp434K1822 
(from clone DKFZp434Kl822); 
partial cds. 


1004 


74 


1544 


gi5823525 


Drosophila 
melanogaster 


ubiquitin-specific protease nonstop 


957 


41 


1545 


gi6693836 


Rattus 
norvegicus 


SNIP-b 


3975 


85 


1545 


gi6693834 


Rattus 
norvegicus 


SNIP-a 


3975 


85 


1545 


gi3098418 


Mus musculus 


P140 


3699 


75 


1546 


gil 79433 


Homo sapiens 


Human biglycan (BGN) gene, 
exon 8. 


2032 


95 


1546 


gil2803217 


Homo sapiens 


biglycan, clone MGC:2298 
IMAGE:3 162633, mRNA, 
complete cds. 


1512 


96 


1546 


gil3279002 


Homo sapiens 


biglycan, clone MGC:10461 
complete cds. 


1512 


96 


1547 


gil4718648 


Homo sapiens 


allantoicase mRNA, partial cds. 


777 


95 


1547 


gi9255889 


Mus musculus 


allantoicase 


1039 


58 


1547 


gi4929823 


Xenopus laevis 


allantoicase 


681 


41 


1548 


gil2653161 


Homo sapiens 


ribosomal protein L35, clone 
MGC:8582 IMAGE:2960987, 
mRNA, complete cds. 


203 


78 
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1548 


gil5012043 


Homo sapiens 


Similar to ribosomal protein L35, 
clone MGC: 13488 
IMAGE:4251487, mRNA, 
complete cds. 


203 


78 


1548 


gi562074 


Homo sapiens 


Human ribosomal protein L35 
mRNA, complete cds. 


203 


78 


1549 


gil903236 


Mus musculus 


capping protein beta 3 subunit 


156 


100 


1549 


gi595257 


Homo sapiens 


Human F-actin capping protein 
beta subunit mRNA, complete cds. 


156 


100 


1549 


gi500749 


Mus musculus 


capping protein beta subunit, 
isoform 2 


156 


100 


1550 


gil5278186 


Homo sapiens 


MAGI- 1 A mRNA, complete cds, 
alternatively spliced. 


526 


91 


1550 


gi3370998 


Homo sapiens 


mRNA for BAI1 -associated 
protein 1, complete cds. 


526 


91 


1550 


gil5278182 


Homo sapiens 


MAGI- IB alpha beta mRNA, 
complete cds, alternatively spliced. 


526 


y i 


1551 


gi 12654299 


Homo sapiens 


clone IMAGE:3447394, mRNA, 
partial cds. 


1945 


c\r\ 

99 


1551 


gil 1095188 


Homo sapiens 


dipeptidyl peptidase 8 (DPP 8) 
mRNA, complete cds. 


1357 


67 


1551 


AAB47187 


Homo sapiens 


Human DPP 8. 


1357 


67 


1552 


gil91012 


Cricetus cricetus 


ornithine decarboxylase 


153 




1552 


gi49440 


Cricetus cricetus 


ornithine decarboxylase (AA 1- 
455) 


153 


a © 
io 


1552 


gi9858179 


Danio rerio 


ornithine decarboxylase 


149 


44 


1553 


gil2053087 


Homo sapiens 


mRNA; cDNA DKFZp434B0819 
(from clone DKFZp434B0819); 
complete cds. 


168 


100 


1553 


gil6118555 


Homo sapiens 


ELMOl mRNA, complete cds. 


168 


100 


1553 


gil6118551 


Mus musculus 


ELMOl 


168 


100 


1554 


AAY91640 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID 
NO:313. 


238 


81 


1554 


AAB93352 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12476. 


238 


81 


1554 


AAY91484 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 34 SEQ ID 
NO:157. 


233 


100 


1555 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


726 


87 


1555 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 121. 


726 


87 


1555 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


727 


87 


1556 


gi4530437 


Homo sapiens 


thyroid hormone receptor- 
associated protein complex 
component l KArz4U mKJN A, 
complete cds. 


1125 


90 


1556 


gi7109237 


Drosophila 
melanogaster 


TRAP240 


213 


44 


1556 


gi7230590 


Drosophila 
melanogaster 


Pap/DTRAP240 


213 


44 


1557 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 


2330 


89 
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NO: 121. 






1557 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


2325 


89 


1557 


gi5 052951 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


2309 


88 


1558 


gill 066463 


Rattus 
norvegicus 


RhoGEF glutamate transport 
modulator GTRAP48 


3450 


72 


1558 


gi7 110160 


Homo sapiens 


guanine nucleotide exchange factor 
(LARG) mRNA, complete cds. 


1063 


50 


1558 


AAW64468 


Homo sapiens 


Human secreted protein from clone 
CW420 2. 


1063 


50 


1559 


gil0440888 


Morone saxatilis 


myosin heavy chain FM3A 


456 


51 


1559 


gil5982970 


Danio rerio 


myosin IIIA 


476 


55 


1559 


gi7958618 


Homo sapiens 


class III myosin (MY03A) 
mRNA, complete cds, alternatively 
spliced. 


456 


51 


1560 


AAB93846 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13688. 


1112 


66 


1560 


gi3834629 


Mus musculus 


diaphanous-related formin; pi 34 
mDia2 


565 


28 


1560 


gi3171906 


Homo sapiens 


mRNA for dia-1 56 protein. 


559 


30 


1561 


AAB59019 


Homo sapiens 


Breast and ovarian cancer 
associated antigen protein 
sequence SEQ ID 727. 


162 


47 


1561 


AAG74843 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO: 5 607. 


88 


52 


1561 


AAB95751 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18660. 


85 


30 


1562 


gil3195147 


Mus musculus 


HCH 


1228 


87 


1562 


AAW03515 


Homo sapiens 


Human DOCK180 protein. 


1308 


56 


1562 


gil339910 


Homo sapiens 


Human DOCK180 protein mRNA, 
complete cds. 


1304 


56 


1563 


AAB52017 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID NO:66. 


266 


100 


1563 


gi6449393 


Callinectes 
sapidus 


copper-specific metallothionein 
CuMT-II 


57 


37 


1563 


gi60 10621 


Human 

calicivirus strain 
BAV/2.1/98/DE 
U 


RNA-dependent RNA polymerase 


51 


36 


1564 


gi540073 


Homo sapiens 


Human agouti gene, exon 3 and 
complete cds. 


362 


97 


1564 


gi608648 


Homo sapiens 


agouti signalling protein (ASP) 
gene, complete cds. 


356 


96 


1564 


AAW10102 


Homo sapiens 


Human agouti signalling protein. 


352 


84 


1565 


gi2463646 


Homo sapiens 


Human 3-hydroxy-3- 
methylglutaryl CoA synthase gene, 
exon 9 and complete cds. 


1277 


100 


1565 


gi619877 


Homo sapiens 


H.sapiens mRNA for 3-hydroxy~3- 
methylglutaryl coenzyme A 
synthase. 


1277 


100 


1565 


gil5928472 


Mus musculus 


3-hydroxy-3-methylglutaryl- 
Coenzyme A synthase 2 


1155 


88 


1566 


gi5912057 


Homo sapiens 


mRNA; cDNA DKFZp434B172 
(from clone DKFZp434B 172); 1 
partial cds. 


350 


98 
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1566 


gi7578787 


Homo sapiens 


AD021 protein (AD021) mRNA, 
complete cds. 


157 


41 


1566 


AAB57036 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:l6l4. 


69 


33 


1567 


gi2226005 


Homo sapiens 


Human Tiggerl transposable 
element, complete consensus 
sequence. 


269 


53 


1567 


gi7634783 


Homo sapiens 


HDCMB45P mRNA, partial cds. 


210 


43 


1567 


AAB39252 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 12 SEQ ID 
NO: 132. 


126 


61 


1568 


gil2653985 


Homo sapiens 


glycine cleavage system protein H 
(aminomethyl carrier), clone 
MGC:5190 IMAGE:3451361, 
mRNA, complete cds. 


846 


91 


1568 


gi 184348 


Homo sapiens 


Human H-protein mRNA, 
complete cds. 


846 


91 


1568 


gi2 19671 


Homo sapiens 


mRNA for hydrogen carrier 
protein, a component of an enzyme 
complex, glycine synthase (EC 
2.1.2.10). 


846 


91 


1569 


gi388109 


Enterococcus 
faecalis 


regulatory protein 


64 


47 


1569 


AAY25744 


Homo sapiens 


Human secreted protein encoded 
from gene 34. 


56 


50 


1570 


gi532505 


Homo sapiens 


Human bile acid CoA: Amino acid 
N-acyltransferase mRNA, 
complete cds. 


2049 


99 


1570 


gil5215152 


Mus museums 


Similar to bile acid-Coenzyme A 
dehydrogenase: amino acid n- 
acyltransferase 


1412 


68 


1570 


gi604902 


Rattus 
norvegicus 


Kan-1 


1394 


68 


1571 


AAG89290 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 410. 


198 


97 


1571 


gil5487674 


Homo sapiens 


OSBP-related protein 1 mRNA, 
complete cds. 


204 


100 


1571 


AAB38248 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 46 SEQ ID 
NO: 104. 


62 


51 


1572 


gi4263739 


Homo sapiens 


BAG clone GS1-489L14 from 
7pl4-pl2, complete sequence. 


594 


68 


1572 


gi6l028l2 


Homo sapiens 


GLI3 gene for GLI3 protein. 


594 


68 


1572 


gil 83248 


Homo sapiens 


Human DNA-binding protein 
(GLI3) mRNA, complete cds. 


594 


68 


1573 


gil0l9435 


Trypanosoma 
cruzi 


mucin-like protein 


133 


37 


1573 


gil0l9433 


Trypanosoma 
cruzi 


mucin-like protein; Method: 
conceptual translation supplied by 
author 


114 


38 


1573 


gil280434 


Drosophila 
melanogaster 


hemomucin 


127 


33 


1574 


AAB76873 


Homo sapiens 


Human lung tumour protein related 
protein sequence SEQ ID NO:798. 


141 


37 


1574 


gi60l5472 


Hylobates 
muelleri 


dopamine receptor D4 


133 


34 
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1574 


gil3421134 


Caulobacter 
crescentus 


translation initiation factor IF-2 


155 


34 


1575 


AAB74709 


Homo sapiens 


Human membrane associated 
protein MEMAP-15. 


1294 


100 


1575 


gil4495648 


Homo sapiens 


clone MGC: 15606 
IMAGE:3163718, mRNA, 
complete cds. 


334 


36 


1575 


gil5721997 


Homo sapiens 


zonadhesin (ZAN) gene, complete 
cds, alternatively spliced. 


173 


20 


1576 


gil 181628 


Homo sapiens 


H.sapiens APXL mRNA. 


1624 


78 


1576 


gil773381 


Homo sapiens 


chromosome X clone U177G4, 
U152H5, U168D5, 174A6, 
U172D6, and U186B3 from Xp22, 
complete sequence. 


1624 


78 


1576 


AAW58988 


Homo sapiens 


Homo sapiens fetal kidney clone 
BD335_14 encoded protein. 


317 


71 


1577 


gi6979943 


Homo sapiens 


type 1 tumor necrosis factor 
receptor shedding aminopeptidase 
regulator mRNA, complete cds. 


1499 


86 


1577 


gi6381989 


Homo sapiens 


adipocyte-derived leucine 
aminopeptidase mRNA, complete 
cds. 


1492 


86 


1577 


gi6642987 


Homo sapiens 


aminopeptidase PILS (APPILS) 
mRNA, complete cds. 


1492 


86 


1578 


gil4626461 


Rhizobium 
leguminosarum 


HupE 


82 


28 


1578 


gi897 


Canis familiaris 


endothelin-2 


64 


36 ! 


1579 


gi7657864 


Homo sapiens 


BAG clone RP1 1-236P2 from 2, 
complete sequence. 


4025 


98 


1579 


gil 374698 


Homo sapiens 


mRNA for nuclear protein, NP220, 
complete cds. 


4014 


98 


1579 


AAY07032 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


4014 


98 


1580 


gi551065 


Mus musculus 


protease-nexin 1 


995 


83 


1580 


gil4715029 


Mus musculus 


serine (or cysteine) proteinase 
inhibitor, clade E (nexin, 
plasminogen activator inhibitor 
type 1), member 2 


995 


83 


1580 


gi412265 


Rattus 
norvegicus 


glia-derived neurite-promoting 
factor (GdNPF) 


994 


82 


1581 


gi499184 


Felis catus 


neuronal protein 


305 


93 


1581 


gil 0433455 


Homo sapiens 


cDNA FLJ 12066 fis, clone 
HEMBB 1002266, moderately 
similar to NEURONAL 
PROTEIN. 


286 


65 


1581 


AAB95041 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16804. 


286 


65 


1582 


AAY13385 


Homo sapiens 


Amino acid sequence of protein 
PR0293. 


3183 


99 


1582 


AAB80253 


Homo sapiens 


Human PR0293 protein. 


3183 


99 


1582 


AAB33472 


Homo sapiens 


Human PRO 133 8 protein UNQ693 
SEQ ID NO:279. 


1676 


55 


1583 


gil2803185 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 8463 
IMAGE:2821577, mRNA, 
complete cds. 


268 


75 
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1583 


gil4250152 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 14826 
IMAGE-.4276604, mRNA, 
complete cds. 


268 


75 


1583 


gil52 14852 


Homo sapiens 


nucleophosmin (nucleolar 
phosphoprotein B23, numatrin), 
clone MGC: 13433 
IMAGE:4097025, mRNA, 
complete cds. 


268 


75 


1584 


gi4406691 


Homo sapiens 


clone 24922 mRNA sequence, 
complete cds. 


697 


100 


1584 


gi7023544 


Homo sapiens 


cDNA FLJ11094 fis, clone 
PLACE 10053 73, weakly similar to 
TRNA PSEUDOURIDINE 
SYNTHASE B (EC 4.2.1.70). 


697 


100 


1584 


AAB93492 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12796. 


69 / 


inn 


1585 


gil55999 


Bombyx mori 


silk fibroin 


1 A 'I 

14/ 




1585 


gi930003 


Bombyx mori 


silk fibroin (AA 37-252) 


147 


39 


1585 


gi765323 


Bombyx mori 


silk fibroin heavy chain 


158 


39 


1586 


gil2002682 


Homo sapiens 


FERM-containing protein (CGI) 
mRNA, alternative splice product, 
complete cds. 


1282 


on 


1586 


gi7669988 


Homo sapiens 


mRNA; cDN A DKFZp76 IN 1814 
(from clone DKFZp761N1814). 


1090 


oo 


1586 


AAB12318 


Homo sapiens 


Human secreted protein encoded 
by gene 18 clone HE2FL70. 


189 


100 


1587 


gi5019618 


Homo sapiens 


HFB30 mRNA, complete cds. 


1005 


90 


1587 


gi4530066 


Homo sapiens 


androgen receptor associated 
protein 54 (ARA54) mRNA, 
complete cds. 


1005 


90 


1587 


AAY78418 


Homo sapiens 


Human androgen receptor 
coactivator ARA54 SEQ ID NO:2. 


1005 


90 


1588 


AAG73501 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO:277. 


285 


94 


1588 


gil 196432 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


285 


94 


1588 


gi339771 


Homo sapiens 


Human transposon LI .1 with a 
base deletion relative to L1.2B 
resulting in a premature stop codon 
in the coding region. 


285 


94 


1589 


AAB94900 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16288. 


287 


67 


1589 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


289 


70 


1589 


gil916229 


Homo sapiens 


Human line-1 reverse transcriptase 
gene, partial cds, and granulocyte 
chemotactic protein-2 (GCP-2) 
gene, complete cds. 


278 


70 


1590 


gi6562173 


Homo sapiens 


mRNA; cDNA DKFZp566H033 
(from clone DKFZp566H033); 
partial cds. 


227 


88 


1590 


gil335199 


Homo sapiens 


Human Kpnl repetitive sequence 
(T-betaG41) 3kb downstream of 
beta-globin gene. 


227 . 


86 


1590 


AAB38280 


Homo sapiens 


Human secreted protein sequence 


226 


88 
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encoded by gene 20 SEQ ID 
NO: 136. 






1591 


gil0436007 


Homo sapiens 


cDNA FLJ13859 fis, clone 
THYRO1001033, weakly similar 
to TRANSFORMATION- 
SENSITIVE PROTEIN IEF SSP 
3521. 


1794 


80 


1591 


AAB95593 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18273. 


1794 


80 


1591 


gi7020708 


Homo sapiens 


cDNA FLJ20535 fis, clone 
KAT11013. 


1789 


79 


1592 


gi6599260 


Homo sapiens 


mRNA; cDNA DKFZp434P1721 
(from clone DKFZp434P1721); 
partial cds. 


1184 


76 


1592 


gil0567164 


Homo sapiens 


GASC-1 mRNA, complete cds. 


561 


59 


1592 


AAR66461 


Homo sapiens 


AF-17 protein, N-terminal region 
with similarity to peregrin. 


181 


34 


1593 


gil 82020 


Homo sapiens 


Human elastin gene, exon 1 . 


2405 


89 


1593 


AAB08630 


Homo sapiens 


Amino acid sequence of a human 
elastin polypeptide. 


2392 


88 


1593 


gil 82062 


Homo sapiens 


Human elastin mRNA, complete 
cds. 


2386 


88 


1594 


gil4424570 


Homo sapiens 


clone MGG16614 
IMAGE:41 1 1344, mRNA, 
complete cds. 


349 


88 


1594 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


349 


89 


1594 


AAB38012 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HNHCT15. 


349 


89 


1595 


gi291854 


Homo sapiens 


aminopeptidase A mRNA, 
complete cds. 


934 


100 


1595 


gil518865 


Sus scrofa 


aminopeptidase A 


876 


89 


1595 


gi7673021 


Rattus ; 
norvegicus 


aminopeptidase A 


847 


85 


1598 


gi7288173 


Homo sapiens 


gene for alphal,6 
fucosyltransferase, exon 5. 


367 


100 


1598 


gi2055307 


Homo sapiens 


mRNA for N-Acetyl-beta-D- 
glucosaminide, complete cds. 


367 


100 


1598 


gi3451263 


Homo sapiens 


mRNA for glycoprotein 6-alpha-L- 
fucosyltransferase, transcript Bl. 


367 


100 


1599 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12300. 


149 


46 


1599 


gi295671 


Saccharomyces 
cerevisiae 


selected as a weak suppressor of a 
mutant of the subunit AC40 of 
DNA dependant UNA polymerase 
I and III 


123 


25 


1599 


gil 15593 13 


Halocynthia 
roretzi 


synaptotagmin 


121 


24 


1600 


AAB48139 


Homo sapiens 


Human TANGO 209 variant 1 
polypeptide. 


219 


79 


1600 


AAB48140 


Homo sapiens 


Human TANGO 209 variant 2 
polypeptide. 


219 


79 


1600 


AAB48141 


Homo sapiens 


Human TANGO 209 variant 3 
polypeptide. 


224 


51 


1602 


gil2053353 


Homo sapiens 


mRNA; cDNA DKFZp586D0222 
(from clone DKFZp586D0222); 
complete cds. 


548 


82 
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1602 


gi7023456 


Homo sapiens 


cDNA FLJ11040 fis, clone 
PLACE1004388. 


548 


82 


1602 


gil0434241 


Homo sapiens 


cDNAFLJ12633fis, clone 
NT2RM4001856. 


548 


82 


1603 


gil5082532 


Homo sapiens 


clone MGC:20434 
IMAGE:4650497, mRNA, 
complete cds. 


212 


85 


1603 


gil3096814 


Mus musculus 


RIKEN cDNA 4932442K08 gene 


165 


72 


1603 


gi9622395 


Plasmodium 
falciparum 


variant surface protein 


66 


44 


1604 


gil321596 


Homo sapiens 


Human (clone HSLV2132) Ig 
lambda light chain V region, 
subgroup II, partial sequence. 


374 


100 


1604 


AAG76014 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:6778. 


362 


97 


1604 


AAB36212 


Homo sapiens 


Human immune system associated 
protein HISAP-10. 


342 


91 


1605 


gil0434674 


Homo sapiens 


cDNA FLJ1291 1 fis, clone 
NT2RP2004425, highly similar to 
Mus musculus axotrophin mRNA. 


2340 


94 


1605 


AAB95234 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17375. 


2340 


94 


1605 


AAB27239 


Homo sapiens 


Human EXMAD-17 SEQ ID NO: 
17. 


2340 


94 


1606 


gi8918522 


Homo sapiens 


PCCX2 mRNA for protein 
containing CXXC domain 2, 
partial cds. 


2200 


84 


1606 


gil4042116 


Homo sapiens 


cDNA FLJ14534 fis, clone 
NT2RM2000599, weakly similar 
to Homo sapiens F-box protein 
Lilina (LILINA) mRNA. 


2446 


78 


1606 


AAB92702 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 102. 


2446 


78 


1607 


gi3 176762 


Homo sapiens 


receptor for viral semaphorin 
protein (VESPR) mRNA, complete 
cds. 


807 


100 


1607 


AAY13462 


Homo sapiens ! 


Viral-encoded semaphorin protein 
receptor (VESPR) polypeptide. 


807 


100 


1607 


AAB28522 


Homo sapiens 


Human VESPR. 


807 


100 


1608 


gil373425 


Homo sapiens 


Human bumetani de-sensitive Na- 
K-2C1 cotransporter (NKCC2) 
mRNA, complete cds. 


1295 


99 


1608 


AAW29683 


Homo sapiens 


Human Na-K-2C1 cotransporter 
NKCC2. 


1295 


99 


1608 


gi5 16000 


Oryctolagus 
cuniculus 


bumetanide-sensitive Na-K-Cl 
cotransport protein splice isoform 
B 


1244 


95 


1609 


AAB95655 


Homo sapiens 


Human protein sequence SEQ ID 
NO:18417. 


229 


61 


1609 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 121. 


221 


58 


1609 


gil 196433 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


226 


60 


1610 


AAB88388 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0 131. 


923 


100 


1610 


AAB25719 


Homo sapiens 


Human secreted protein sequence 


114 


100 
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encoded by gene 6 SEQ ID 
NO: 108. 






1610 


AAB25718 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID 
NO: 107. 


73 


100 


1611 


AAY04734 


Homo sapiens 


Protein containing PDZ domain 
from clone 38-2- lc. 


269 


98 


1611 


AAY04732 


Homo sapiens 


Protein containing PDZ domain 
from clone 38-2-la. 


269 


98 


1611 


AAY53753 


Homo sapiens 


Amino acid sequence of the 
MMSC2 protein. 


269 


98 


1612 


gil794211 


Homo sapiens 


Human oncostatin-M specific 
receptor beta subunit (OSMRB) 
mRNA, complete cds. 


1127 


92 


1612 


AAR85912 


Homo sapiens 


Oncostatin M receptor-beta 
subunit. 


1127 | 


92 


1612 


gil5012082 


Homo sapiens 


Similar to oncostatin M receptor, 
clone MGC: 13583 
IMAGE:4043935, mRNA, 
complete cds. 


1127 


92 


1613 


gil2803103 


Homo sapiens 


heterogeneous nuclear 
ribonucleoprotein Al, clone 
MGC:8473 IMAGE: 282 1751, 
mRNA, complete cds. 


406 


82 


1613 


gil5082486 


Homo sapiens 


Similar to heterogeneous nuclear 
ribonucleoprotein Al, clone 
MGC:20389 IMAGE:4564655, 
mRNA, complete cds. 


406 


82 


1613 


gi496898 


Homo sapiens 


H.sapiens mRNA for hnRNPcore 
protein Al. 


406 


82 


1614 


gil2653633 


Homo sapiens 


lysyl oxidase-like 2, clone 
MGC: 1709 IMAGE: 3 3 475 12, 
mRNA, complete cds. 


898 


99 


1614 


gil890108 


Homo sapiens 


Human lysyl oxidase-related 
protein (WS9-14) mRNA, 
complete cds. 


898 


99 


1614 


AAB00077 


Homo sapiens 


Human lysyl oxidase related 
protein (Lor). 


898 


99 


1615 


gil2803157 


Homo sapiens 


COX15 (yeast) homolog, 
cytochrome c oxidase assembly 
protein, clone MGC: 8634 
IMAGE:2961532, mRNA, 
complete cds. 


159 


52 


1615 


gil5426569 


Homo sapiens 


clone MGC:4234 
IMAGE:2961532, mRNA, 
complete cds. 


159 


52 


1615 


gi3603230 


Homo sapiens 


cytochrome oxidase assembly 
factor (COX 15) mRNA, nuclear 
gene encoding mitochondrial 
protein, complete cds. 


159 


52 


1616 


gil215746 


Bos taurus 


vacuolar system associated 
protein-60 


1309 


67 


1616 


gi7672979 


Homo sapiens 


glucosidase II beta subunit mRNA, 
complete cds. 


1656 


71 


1616 


gi!293640 


Homo sapiens 


Human protein kinase C substrate 
80K-H (PRKCSH) gene, exon 15- 
17. 


1652 


70 
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1617 


gil418820 


Homo sapiens 


H. sapiens mRNA for gamma 1 
isoform of 61kDa regulatory 
subunit ofPP2A. 


355 


100 


1617 


gil203812 


Homo sapiens 


Human protein phosphatase 2A 
B 'alpha 1 regulatory subunit 
mRNA, complete cds. 


355 


100 


1617 


gi 1000892 


Homo sapiens 


protein phosphatase 2 A B56- 
gammal (PP2A) mRNA, 3' end of 
cds. 


355 


100 


1618 


gil4010930 


Homo sapiens 


BAG clone RP1 1-576F1 from 2, 
complete sequence. 


1868 


100 


1618 


gi7022375 


Homo sapiens 


cDNA FLJ10379 fis, clone 
NT2RM2002014. 


1863 


99 


1618 


AAB92758 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11220. 


1863 


99 


1619 


gi296665 


Homo sapiens 


Human bone marrow serine 
protease gene (medullasin) 
(leukocyte neutrophil elastase 
gene). 


629 


93 


1619 


gi34533 


Homo sapiens 


Human mRNA for medullasin 
(leukocyte (neutrophil) elastase. 


629 


93 


1619 


gi307123 


Homo sapiens 


Human elastase/medullasin 
mRNA, complete cds. 


629 


93 


1620 


gi9965989 


Homo sapiens 


calcineurin A catalytic subunit 
gamma isoform mRNA, complete 
cds. 


1650 


97 


1620 


gi258001 


human, testis, 
mRNA, 2134 nt]. 
[Homo sapiens 


calcineurin A catalytic subunit 


1639 


96 


1620 


gi 13436077 


Homo sapiens 


clone MGC: 10576 
IMAGE:3677098, mRNA, 
complete cds. 


1623 


94 


1621 


gi 1429374 


Escherichia coli 


DcuC protein 


739 


100 


1621 


gil3360118 


Escherichia coli 
0157:H7 


c4-dicarboxylate anaerobic carrier 
DcuC 


739 


100 


1621 


gil786839 


Escherichia coli 
K12 


transport of dicarboxylates 


739 


100 


1622 


AAB82485 


Homo sapiens 


Human secretin-like receptor 
Zgprl. 


271 


94 


1622 


AAB66272 


Homo sapiens 


Human TANGO 378 SEQ ID NO: 
29. 


271 


94 


1622 


AAB82487 


Homo sapiens 


Human secretin-like receptor 
Zgprl splice variant. 


271 


94 


1623 


gi4062658 


Escherichia coli 


Flagellar basal body P-ring protein 
precursor 


752 


87 


1623 


gil787320 


Escherichia coli 
K12 


homolog of Salmonella P-ring of 
flagella basal body 


752 


87 


1623 


gil3360922 


Escherichia coli 
0157:H7 


flagellar basal body P-ring protein 
Flgl 


751 


86 


1624 


gil773192 


Escherichia coli 


similar to S. cerevisiae dall 


334 


98 


1624 


gi2735238 


Escherichia coli 


GlxB3 


334 


98 


1624 


gil3360031 


Escherichia coli 
0157:H7 


allantoinase 


333 


96 


1625 


gi40992 


Escherichia coli 


dehydrogenase 


1481 


93 


1625 


gil3363752 


Escherichia coli 
0157:H7 


aspartate-semialdehyde 
dehydrogenase 


1481 


93 
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1625 


gi3859587 


Shigella sonnei 


aspartate semialdehyde 
dehydrogenase 


1481 


93 


1626 


gil46346 


Escherichia coli 


phenylalanyl-tRNA synthetase 
beta-subunit 


735 


94 


1626 


gil742793 


Escherichia coli 


Phenylalanine— tRN A ligase (EC 
6.1.1.20) b chain 


735 


94 


1626 


gil3361887 


Escherichia coli 
0157:H7 


phenylalanine tRNA synthetase 
beta-subunit 


735 


94 


1627 


AAB49502 


Homo sapiens 


Clone HYASC03. 


310 


98 


1627 


gi7020468 


Homo sapiens 


cDNA FLJ20396 fis, clone 
KAT00561. 


144 


48 


1627 


AAB 18980 


Homo sapiens 


Amino acid sequence of a human 
transmembrane protein. 


144 


48 


1628 


gil4021587 


Mesorhizobium 
loti 


transcriptional regulator 


333 


38 


1628 


gil4523075 


Sinorhizobium 
meliloti 


probable LysR-family protein 


329 


39 


1628 


gi9949248 


Pseudomonas 
aeruginosa 


probable transcriptional regulator 


327 


37 


1629 


gi3 11422 


Escherichia coli 


ORF-2 


282 


100 


1629 


gil5081358 


uncultured 
bacterium 


PufM 


53 


41 


1629 


gi 13362968 


Escherichia coli 
0157:H7 


holin protein 


52 


33 


1630 


gi 159333 


Leishmania 
major 


glycoprotein 96-92 


101 


28 


1630 


AAY91958 


Homo sapiens 


Human cytoskeleton associated 
protein 13 (CYSKP-13). 


100 


28 


1630 


gi8 163686 


Streptococcus 
pneumoniae 


surface protein PspC 


105 


29 


1632 


gi8 87820 


Escherichia coli 


UUG start; possible frameshift at 
end? 


820 


71 


1632 


gi466651 


Escherichia coli 


No definition line found 


343 


82 


1632 


gil742360 


Escherichia coli 


Phosphinothricin acetyltransferase 
(EC 2.3.1.-). 


248 


83 


1633 


gi7022678 


Homo sapiens 


cDNA FLJ10565 fis, clone 
NT2RP2002954. 


290 


100 


1633 


AAB92950 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 629. 


290 


100 


1633 


gil755198 


Cavia porcellus 


nitric oxide synthase 


71 


33 


1634 


AAY48563 


Homo sapiens 


Human breast tumour-associated 
protein 24. 


342 


100 


1634 


gil2804499 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:1360 IMAGE:2959583, 
mRNA, complete cds. 


440 


66 


1634 


gil2804759 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:3554 IMAGE:2959583, 
mRNA, complete cds. 


440 


66 


1635 


gil0438872 


Homo sapiens 


cDNA: FLJ22471 fis, clone 
HRC 10529. 


1620 


100 


1635 


AAY86509 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:424. 


696 


100 


1635 


AAY86510 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:425. 


436 


100 


1636 


gi8096340 


Homo sapiens 


mRNA for RERE, complete cds. 


365 


97 


1636 


gi7413896 


group A 


NSP3 protein 


71 


29 
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rotavirus 








1636 


gi9368393 


Human rotavirus 


NSP3 protein 


65 


27 


1637 


AAB58426 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 764. 


514 


94 


1637 


AAB00191 


Homo sapiens 


Breast cancer protein BCN5. 


514 


94 


1637 


AAB85481 


Homo sapiens 


Human 23553 sulfatase 
polypeptide. 


514 


94 


1638 


AAY42750 


Homo sapiens 


Human calcium binding protein 1 
(CaBP-1). 


741 


100 


1638 


gil2060826 


Homo sapiens 


serologically defined breast cancer 
antigen NY-BR-20 mRNA, partial 
cds. 


673 


89 


1638 


AAY07006 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


637 


88 


1639 


gi 10434205 


Homo sapiens 


cDNA FLJ12612 fis, clone 
NT2RM4001582, highly similar to 
Mus musculus COP9 complex 
subunit 7b (COPS7b) mRNA. 


908 


86 


1639 


AAB94175 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14484. 


908 


86 


1639 


gil5215085 


Mus musculus 


Similar to COP9 (constitutive 
photomorphogenic), subunit 7b 
(Arabidopsis) 


900 


85 


1640 


gil334836 


Human 
herpesvirus 4 


BCRF2 3072 repeat, reading frame 
1 


98 


29 


1640 


gil334837 


Human 
herpesvirus 4 


BWRF1 reading frame 2 


98 


29 


1640 


gil334838 


Human 
herpesvirus 4 


BWRF1 reading frame 3 


98 


29 


1641 


gil3161011 


Homo sapiens 


sclerostin gene, complete cds. 


758 


100 


1641 


gil3161020 


Homo sapiens 


sclerostin mRNA, complete cds. 


758 


100 


1641 


gi 132364 18 


Homo sapiens 


SOST (SOST) mRNA, complete 
cds. 


758 


100 


1642 


gil3436023 


Mus musculus 


RIKEN cDNA B2301 14J08 gene 


279 


94 


1642 


AAY36125 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 510. 


279 


94 


1642 


AAY99458 


Homo sapiens 


Human PRO 193 amino acid 
sequence SEQ ID NO:410. 


279 


94 


1643 


gil5680215 


Homo sapiens 


Similar to ribosomal protein L23a, 
clone MGC:23063 
IMAGE:4872948, mRNA, 
complete cds. 


428 


76 


1643 


gi 15990426 


Homo sapiens 


Similar to cadherin 1, type 1, E- 
cadherin (epithelial), clone 
MGC:23017 IMAGE: 3 95 9042, 
mRNA, complete cds. 


428 


76 


1643 


gi404015 


Homo sapiens 


Human ribosomal protein L23a 
mRNA, partial cds. 


428 


76 


1644 


gil2653271 


Homo sapiens 


thyroid hormone receptor 
interactor 13, clone MGC:8565 
IMAGE:2822981,mRNA, 
complete cds. 


514 


82 


1644 


gi2232019 


Homo sapiens 


HP VI 6 El protein binding protein 
mRNA, complete cds. 


514 


82 


1644 


gi703102 


Homo sapiens 


thyroid receptor interactor 
(TRIP 13) mRNA, partial cds. 


513 


82 
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1646 


gil 3544066 


Homo sapiens 


Similar to mini chromosome 
maintenance deficient 2 (S. 
cerevisiae), clone MGC: 13220 
IMAGE:3959276, mRNA, 
complete cds. 


273 


98 


1646 


gil4043350 


Homo sapiens 


clone MGC:2123 
IMAGE:3143264, mRNA, 
complete cds. 


273 


98 


1646 


g il4044016 


Homo sapiens 


clone MGC: 14281 
IMAGE:4l3l943, mRNA, 
complete cds. 


273 


98 


1648 


gi7960207 


Oncorhynchus 
mykiss 


vitelline envelope protein alpha 


252 


A 1 

43 


1648 


gil60198 


Plasmodium 
knowlesi 


circumsporozoite protein 


191 


HO 


1648 


gi 153 84295 


Mycoplasma 
bovis 


variable surface lipoprotein 
Vsp422-8 


1 / / 


1/1 


1650 


gi474280 


Mus musculus 


mannosyl-oligosaccharide alpha- 
1 ,2-mannosidase 


a i i 
91 1 


OQ 


1650 


gil5929672 


Mus musculus 


Similar to mannosidase 1, alpha 


911 


88 


1650 


gi2154997 


Sus scrofa 


Man9-mannosidase 


869 


83 


1651 


gi5915662 


Homo sapiens 


integrin alpha 1 1 subunit precursor 
(ITGA11) mRNA, complete cds. 


542 


O A 

84 


1651 


AAB30929 


Homo sapiens 


Amino acid sequence of a human 
alpha 11 integrin chain. 


542 


84 


1651 


AAB50085 


Homo sapiens 


Human A259. 


542 


84 


1652 


gi45 12295 


Homo sapiens 


DNA for immunoglobulin heavy- 
chain variable region, complete 
sequence, 3 of 5. 


619 


100 


1652 


gi296657 


Homo sapiens 


Human Ig heavy chain gene 
variable region V(12G-1) (v(h)-iv 
family). 


613 


99 


1652 


gil85579 


Homo sapiens 


H.sapiens immunoglobulin 
germline heavy chain gene, V 
region. 


601 


y I 


1654 


AAE04841 


Homo sapiens 


Human SGP039 phosphatase 
polypeptide. 


667 


92 


1654 


gi2582800 


Medicago sativa 


protein phosphatase 2C 


107 


41 


1654 


gi8778653 


Arabidopsis 
thaliana 


F9C16.6 


107 


a r\ 

40 


1655 


gi9928511 


Mycobacterium 
tuberculosis 


SEQ ID NO 18B* 


89 


27 


1655 


gil3897999 


Galleria 
mellonella 


silk protease inhibitor 2 precursor 


50 


40 


1655 


gi204419 


Rattus 
norvegicus 


glycam 1 


85 


24 


1656 


gil2653509 


Homo sapiens 


DKFZP564K1964 protein, clone 
MGC:8605 !MAGE:2961267, 
mRNA, complete cds. 


227 


95 


1656 


gi5912199 


Homo sapiens 


mRNA; cDNA DKFZp564K1964 
(from clone DKFZp564K1964); 
complete cds. 


227 


yj 


1656 


gi4530587 


Homo sapiens 


TADA1 protein mRNA, complete 
cds. 


227 


95 


1658 


gil2654931 


Homo sapiens 


protein disulfide isomerase-related 
protein, clone MGC:5517 
IMAGE:3454007, mRNA, 


2281 


100 
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complete cds. 






1658 


gil 136743 


Homo sapiens 


Human mRNA for protein 
disulfide isomerase-related protein 
P5, complete cds. 


2281 


100 


1658 


AAW25154 


Homo sapiens 


Human disulphide epimerase like 
enzyme, EP52. 


2281 


100 


1659 


gi6694278 


Homo sapiens 


cell recognition molecule Caspr2 
(CASPR2) mRNA, complete cds. 


356 


95 


1659 


gil3624214 


Homo sapiens 


contactin-associated protein 2 
(CNTNAP2) mRNA, complete 
cds. 


356 


95 


1659 


gil 857708 


Homo sapiens 


contactin associated protein 
(Caspr) mRNA, complete cds. 


140 


37 


1660 


AAG02922 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7003. 


136 


96 


1660 


gi34373 


Homo sapiens 


Human DNA for LINE-1 
transposable element ORFI and II. 


135 


96 


1660 


gi7959889 


Homo sapiens 


PR02221 


127 


88 


1661 


gi4588087 


Homo sapiens 


PTH-responsive osteosarcoma Bl 
protein (Bl) mRNA, complete cds. 


569 


87 


1661 


AAG74841 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:5605. 


569 


87 


1661 


gi4588085 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein (Dl) mRNA, partial cds. 


312 


84 


1662 


gi4754907 


Homo sapiens 


histone deacetylase 4 mRNA, 
complete cds. 


2090 


94 


1662 


AAB49957 


Homo sapiens 


Human histone deacetylase 
HDAC-4. 


2090 


94 


1662 


gil4495171 


Gallus gallus 


histone deacetylase-4 


1790 


78 


1663 


AAB74704 


Homo sapiens 


Human membrane associated 
protein MEMAP-10. 


183 


100 


1663 


gi2621542 


Methanothermob 
acter 

thermautotrophic 
us 


conserved protein 


83 


70 


1663 


g il0440349 


Homo sapiens 


mRNA for FLJ00009 protein, 
partial cds. 


82 


35 


1664 


gil5159543 


Agrobacterium 
tumefaciens 


AGR_LJ2143p 


99 


27 


1664 


gi5360174 


Gallus gallus 


NOTCH- 1 


98 


31 


1664 


gi4960212 


Bos taurus 


cone-rod homeobox 


85 


27 


1665 


AAB32388 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 8 SEQ ID 
NO:74. 


359 


100 


1665 


AAY91419 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 SEQ ID 
NO: 140. 


83 


36 


1665 


gi860970 


Homo sapiens 


H. sapiens mRNA for HPS protein. 


84 


35 


1666 


gil5012095 


Homo sapiens 


Similar to protease inhibitor 3, 
skin-derived (SKALP), clone 
MGC: 13613 IMAGE:4083155, 
mRNA, complete cds. 


621 


100 


1666 


gi28712 


Homo sapiens 


H. sapiens encoding skin-derived 
antileukoproteinase . 


621 


100 


1666 


gi219615 


Homo sapiens 


Human gene for elafm, complete 
cds. 


621 


100 
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1667 


gi6650233 


Homo sapiens 


zinc finger protein 74 (ZNF74) 
gene, exon 3, alternative splice 
products and complete cds. 


259 


58 


1667 


gil5081398 


Homo sapiens 


kruppel-like zinc finger protein 
(ZNF300) mRNA, complete cds. 


246 


73 


1667 


gil769491 


Homo sapiens 


Human kruppel-related zinc finger 
protein (ZNF184) mRNA, partial 
cds. 


246 


64 


1668 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO:136. 


142 


62 


1668 


gil 196431 


Homo sapiens 


Human factor VIII gene LI 
element insertion DNA. 


131 


58 


1668 


gi6562173 


Homo sapiens 


mRNA; cDNA DKFZp566H033 
(from clone DKFZp566H033); 
partial cds. 


147 


60 


1669 


AAG03136 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7217. 


179 


75 


1669 


AAG02563 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6644. 


77 


60 


1669 


AAG73365 


Homo sapiens 


Human gene 20-encoded secreted 
protein HFCAA91, SEQ ID 
NO: 136. 


76 


63 


1670 


gil2804499 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:1360 IMAGE :29595 83, 
mRNA, complete cds. 


376 


83 


1670 


gil2804759 


Homo sapiens 


ribonuclease 6 precursor, clone 
MGC:3554 IMAGE:2959583, 
mRNA, complete cds. 


376 


83 


1670 


gi5091495 


Homo sapiens 


ribonuclease 6 precursor, mRNA, 
complete cds. 


376 


83 


1672 


AAG63163 


Homo sapiens 


Amino acid sequence of 
carcinoembryonic antigen-like 
polypeptide. 


2187 


99 


1672 


AAR54714 


Homo sapiens 


Carcinoembryonic antigen CEA- 
(c). 


463 


34 


1672 


AAB43688 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1133. 


466 


31 


1673 


gil3097624 


Homo sapiens 


clone IMAGE:3608084, mRNA, 
partial cds. 


760 


89 


1673 


gil0438279 


Homo sapiens 


cDNA: FLJ22029 fis, clone 
HEP08661. 


411 


51 


1673 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12300. 


379 


47 


1674 


gi339776 


Homo sapiens 


Human transposon LI. 2. 


686 


85 


1674 


gi5070621 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


686 


85 


1674 


gi5052950 


Homo sapiens 


LINE1 element inserted in B- 
globin gene intron 2. 


684 


85 


1675 


AAB56206 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 130 SEQ ID 
NO:300. 


44 


43 


1675 


AAB56350 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 130 SEQ ID 
NO:444. 


52 


39 


1675 


AAW33908 


Homo sapiens 


Peptide analogue of human insulin- 


57 


31 
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like growth factor- 1 (IGF-1). 






1676 


gi34234 


Homo sapiens 


H.sapiens mRNA for laminin- 
binding protein. 


290 


67 


1676 


gi 13529269 


Homo sapiens 


laminin receptor 1 (67kD, 
ribosomal protein SA), clone 
MGC: 12521 IMAGE :39970 19, 
mRNA, complete cds. 


289 


67 


1676 


gil4250794 


Homo sapiens 


laminin receptor 1 (67kD, 
ribosomal protein SA), clone 
MGC: 16750 IMAGE:4 130936, 
mRNA, complete cds. 


289 


67 


1677 


gi475782 


Homo sapiens 


Human GS2 gene, exon 7 and 
complete cds. 


1249 


97 


1677 


gi458226 


Homo sapiens 


Human GS2 mRNA, complete cds. 


1249 


97 


1677 


AAG00737 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4818. 


531 


94 


1678 


gi3 089427 


Homo sapiens 


SSC6 rearranged T cell receptor 
beta chain (TCRBV17) gene, 
complete cds. 


444 


100 


1678 


gi3 089433 


Homo sapiens 


SSC9 rearranged T cell receptor 
beta chain (TCRBV17) gene, 
complete cds. 


444 


100 


1678 


gi3002927 


Homo sapiens 


T cell receptor beta chain 
(TCRBV1 7S1-TCRBJ1 S5) 
mRNA, complete cds. 


444 


100 


1679 


gil5929119 


Homo sapiens 


clone MGC: 8834 
IMAGE:3920437, mRNA, 
complete cds. 


455 


100 


1679 


gi7022159 


Homo sapiens 


cDNA FLJ10242 fis, clone 
HEMBB1000630. 


455 


100 


1679 


AAB92624 


Homo sapiens 


Human protein sequence SEQ ID 
NO:10919. 


455 


100 


1680 


gil0186503 


Homo sapiens 


sialic acid-specific acetylesterase II 
mRNA, complete cds, alternatively 
spliced. 


817 


100 


1680 


gi6808138 


Homo sapiens 


mRNA; cDNA DKFZp761A051 
(from clone DKFZp761A051); 
partial cds. 


817 


100 


1680 


gi 10242345 


Homo sapiens 


sialic acid-specific 9-0- 

acetyl esterase I mRNA, complete 

cds. 


753 


100 


1681 


gi4768463 


Homo sapiens 


clone 118 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


76 


44 


1681 


gi4768574 


Homo sapiens 


clone 179 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


71 


42 


1681 


gi4768459 


Homo sapiens 


clone 115 T-cell receptor beta 
chain (TCRBV10S1P) mRNA, 
partial cds. 


69 


44 


1682 


gi4097459 


Elephantulus 
edwardii 


reverse transcriptase 


68 


48 


1683 


gil220315 


Homo sapiens 


early placenta insulin-like peptide 
EPIL (INSL4) mRNA, complete 
cds. 


349 


100 


1683 


AAR89134 


Homo sapiens 


Human early placental insulin-like 
protein. 


349 


100 
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1683 


AAW17676 


Homo sapiens 


Human relaxin-related factor-2 
(RRF-2). 


349 


100 


1684 


gi23365 


Homo sapiens 


Human mRNA for 17-beta- 
hydroxysteroid dehydrogenase (17- 
HSD) (EC 1.1.1.62). 


800 


83 


1684 


gi975895 


Homo sapiens 


Human 17-beta-hydroxysteroid 
dehydrogenase (EDH17B2) gene, 
complete cds. 


800 


83 


1684 


gil77127 


Homo sapiens 


Human 17-beta-hydroxysteroid 
dehydrogenase (EDH17B1 and 
EDH17B2) genes, complete coding 
regions and flanks. 


800 


83 


1685 


AAB18919 


Homo sapiens 


A novel polypeptide designated 
PR04356. 


1336 


99 


1685 


AAB31206 


Homo sapiens 


Amino acid sequence of human 
polypeptide PR04356. 


1336 


99 


1685 


gi5834584 


Homo sapiens 


mRNA encoding rat C4.4-like 
protein. 


231 


31 


1686 


gil655963 


Homo sapiens 


Human transforming growth 
factor-beta type II receptor gene, 
exon 7 and complete cds. 


1634 


99 


1686 


gi339570 


Homo sapiens 


Human TGF-beta type II receptor 
mRNA, complete cds. 


1634 


99 


1686 


AAR36601 


Homo sapiens 


TGF-beta 1 receptor type II (clone 
3FF). 


1634 


99 


1687 


gi508260 


Homo sapiens 


Human type 1 vasoactive intestinal 
peptide receptor (VI RG) gene, 
exon 13 and complete cds. 


390 


76 


1687 


gi407462 


Homo sapiens 


H.sapiens HIVR mRNA for 
vasoactive intestinal peptide (VIP) 
receptor. 


390 


76 


1687 


gi292904 


Homo sapiens 


Human vasoactive intestial peptide 
receptor mRNA, complete cds. 


390 


76 


1688 


gil81268 


Homo sapiens 


Human c-yes-1 mRNA. 


520 


90 


1688 


AAY24421 


Homo sapiens 


Human yes 1 protein. 


520 


90 j 


1688 


AAB84663 


Homo sapiens 


Amino acid sequence of human 
tyrosine kinase protein Yes. 


520 


90 


1689 


AAG02314 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6395. 


139 


55 


1689 


gil3325174 


Homo sapiens 


clone MGC: 10997 
IMAGE:3638158, mRNA, 
complete cds. 


104 


70 


1689 


gi9652123 


Mus musculus 


disrupter of silencing SAS10 


86 


40 


1690 


gil35 16467 


Homo sapiens 


HDNBl(homzygously deleted in 
neuroblastoma- 1)/UFD2 mRNA, 
complete cds. 


1217 


89 


1690 


gi4 104976 


Homo sapiens 


ubiquitin-fusion degradation 
protein 2 (UFD2) mRNA, 
complete cds. 


1217 


89 


1690 


gi 145 82754 


Homo sapiens 


ubiquitin-fusion degradation 
protein 2 mRNA, complete cds. 


1217 


89 


1691 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 mRNA, 
complete cds. 


326 


96 


1691 


AAG73420 


Homo sapiens 


Human gene 21 -encoded secreted 
protein HBWBR94, SEQ ID 
NO: 192. 


63 


35 
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1691 


gi9246437 


Staphylococcus 
aureus 


fmtA-like protein 


83 


26 


1692 


AAB58175 


Homo sapiens 


Lung cancer associated 
polypeptide sequence SEQ ID 513. 


394 


98 


1692 


gil4211500 


Homo sapiens 


secretory protein SEC8 mRNA, 
complete cds. 


394 


98 


1692 


gil4042555 


Homo sapiens 


cDNA FLJ14782 fis, clone 
NT2RP4000524, highly similar to 
Mus musculus Sec8 mRNA. 


394 


98 


1693 


gi3786312 


Homo sapiens 


mRNA for extracellular matrix 
protein, complete cds. 


498 


39 


1693 


gi 13937865 


Homo sapiens 


lumican, clone MGC: 12410 
IMAGE:3950745, mRNA, 
complete cds. 


312 


29 


1693 


gi699577 


Homo sapiens 


Human lumican mRNA, complete 
cds. 


312 


29 


1694 


gi3002588 


Mus mus cuius 


Plenty ofSH3s; POSH 


219 


93 


1694 


gi7230620 


Rattus 
norvegicus 


SH3 domain-containing adapter 
protein isoform SETA- 1x23 


130 


34 


1694 


AAW34246 


Homo sapiens 


SH3 domain of human clone 53 
protein. 


114 


50 


1695 


gil0645308 


Leishmania 
major 


L8453.1 


151 


27 


1695 


gil5419013 


Toxoplasma 
gondii 


subtili sin-like protein 


147 


30 


1695 


gil2018147 


Chlamydomonas 
reinhardtii 


vegetative cell wall protein gpl 


143 


30 


1696 


AAB43791 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1236. 


353 


98 


1697 


gil56368 


Caenorhabditis 
elegans 


metallothionein-2 


56 


39 


1697 


gil56381 


Caenorhabditis 
elegans 


metallothionein-like protein 


56 


39 


1697 


gi6782 


Caenorhabditis 
elegans 


metall othionein-II 


56 


39 


1698 


gi9858855 


Homo sapiens 


HPT protein (HPT) mRNA, 
complete cds; nuclear gene for 
mitochondrial product. 


1318 


85 


1698 


AAB29653 


Homo sapiens 


Human membrane-associated 
protein HUMAP-10. 


1318 


85 


1698 


AAB32389 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 19 SEQ ID 
NO:75. 


1318 


85 


1699 


gi6841138 


Homo sapiens 


HSPC099 mRNA, partial cds. 


275 


100 


1699 


gi7022824 


Homo sapiens 


cDNA FLJ10656 fis, clone 
NT2RP2006038. 


123 


23 


1699 


AAB93037 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11816. 


123 


23 


1700 


a a tt>o £.c on 


Homo sapiens 


Human FLEXHT-9 protein 
sequence SEQ ID NO:9. 


585 


53 


1700 


gi7023841 


Homo sapiens 


cDNA FLJ11280 fis, clone 
PLACE1009459. 


581 


54 


1700 


AAB93652 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 13 161. 


581 


54 


1702 


gil2330618 


Human 

immunodeficienc 


envelope glycoprotein 


71 


29 
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y virus type 1 








1703 


AAR48975 


Homo sapiens 


Human betal,6-N- 

acetylglucosaminyltransferase 

protein. 


413 


100 


1703 


gi870752 


Homo sapiens 


Human mRNA for N- 
acetylglucosaminyltransferase V, 
complete cds. 


413 


100 


1703 


gi4545222 


Homo sapiens 


alpha- 1,3 (6) -mannosyl glycoprotein 
beta-1 , 6-N-acetyl- 
glucosaminyltransferase (MGAT5) 
mRNA, complete cds. 


413 


100 


1704 


gil2654535 


Homo sapiens 


HSPC025, clone MGC:735 
IMAGE:3507964, mRNA, 
complete cds. 


1251 


81 


1704 


gil3960140 


Homo sapiens 


HSPC025, clone MGC:4223 
IMAGE:2959747, mRNA, 
complete cds. 


1251 


81 


1704 


gi4679028 


Homo sapiens 


HSPC021 


1251 


81 


1706 


gil4250636 


Homo sapiens 


nuclear factor of kappa light 
polypeptide gene enhancer in B~ 
cells inhibitor-like 2, clone 
MGC:3398 IMAGE:3628374, 
mRNA, complete cds. 


2605 


86 


1706 


gi6580428 


Homo sapiens 


partial NFKBIL2 gene for 
IkappaBR, exons 1-13. 


275 1 


95 


1706 


gi746415 


Homo sapiens 


Human I kappa BR mRNA, 
complete cds. 


1668 


72 


1707 


AAB95830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:l8850. 


219 


72 


1707 


gi7959889 


Homo sapiens 


PR02221 


137 


49 


1707 


gi2072969 


Homo sapiens 


Human LI element LI. 24 p40 
gene, complete cds. 


133 


48 


1708 


gi5901529 


Homo sapiens 


C2H2 type Kruppel-like zinc 
finger protein splice variant b 
(ZNF236) mRNA, complete cds. 


565 


99 


1708 


gi5705917 


human, MOLT 4 
T-cells, mRNA, 
798 nt]. [Homo 
sapiens 


HKR-T l =Kr uppel-like zinc finger 
protein 


223 


45 ! 


1708 


gi498736 


Homo sapiens 


H. sapiens HZr9 mRNA tor zinc 
finger protein. 


ion 




1709 


gi 18657 16 


Bos taurus 


procollagen I N-proteinase 


ZOD 


AO 


1709 


AAW47029 


Homo sapiens 


Human N-proteinase (70 kDa short 
form). 


254 


43 


1709 


AAW47030 


Homo sapiens 


Bovine N-proteinase. 


254 


42 


1710 


gil2862392 


Mus musculus 


D86 


1379 


78 


1710 


gi763113 


Homo sapiens 


H.sapiens ERK3 mRNA. 


872 


92 


1710 


gil294779 


Homo sapiens 


ERK3 protein kinase mRNA, 
complete cds. 


872 


92 


1711 


gi572680 


Escherichia coli 


beta ketoacyl-acyl carrier protein 
synthase 


315 


01 


1711 


gi664870 


Escherichia coli 


beta-ketoacyl-acyl carrier protein 
synthase II 


315 


91 


1711 


gi4062664 


Escherichia coli 


3-oxoacyl- synthase (EC 2.3.1.41) 
II 


315 


91 


1712 


AAR89952 


Homo sapiens 


Insulin-like growth factor binding 
protein-3. 


726 


95 
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1712 


gil2652547 


Homo sapiens 


insulin-like growth factor binding 
protein 3, clone MGC:2305 
IMAGE:3506666, mRNA, 
complete cds. 


722 


95 


1712 


gi398164 


Homo sapiens 


H. sapiens mRNA for insulin-like 
growth factor binding protein-3. 


722 


95 


1713 


gil0435700 


Homo sapiens 


cDNA FLJ13633 fis, clone 
PLACE 10 1 1114, weakly similar to 
PROBABLE ATP-DEPENDENT 
RNA HELICASE HAS1. 


710 


97 


1713 


AAB99891 


Homo sapiens 


Human RNA helicase gene 
helicain B protein sequence SEQ 
ID NO:4. 


710 


97 


1713 


AAB99892 


Homo sapiens 


Human RNA helicase gene 
helicain C protein sequence SEQ 
ID NO:6. 


710 


97 


1714 


gil5384740 


Homo sapiens 


mRNA for Paralemmin-2 (PALM2 
gene). 


1652 


100 


1714 


gil5384742 


Homo sapiens 


mRNA for Palm2-AKAP2 fusion 
protein (PALM2/AKAP2 gene). 


1577 


100 


1714 


gil4041780 


Homo sapiens 


mRNA for AKAP-2 protein. 


415 


60 


1715 


gi3600073 


Homo sapiens 


Human endogenous retrovirus K 
clone 7. 1 polymerase mRNA, 
partial cds. 


573 


48 


1715 


gi3600067 


Homo sapiens 


Human endogenous retrovirus K 
clone 10.9 polymerase mRNA, 
partial cds. 


572 


48 


1715 


gi 1780973 


Human 
endogenous 
retrovirus K 


pol protein 


572 


48 


1716 


AAU00025 


Homo sapiens 


Human activated T-lymphocyte 
associated sequence 4, ATLAS-4. 


315 


74 


1716 


gi5880909 


Drosophila 
melanogaster 


RNA-binding protein Smaug 


87 


29 ; 


1716 


gi7380929 


Drosophila 
melanogaster 


smaug protein 


87 


29 


1717 


gi35825 


Homo sapiens 


Human mRNA for pregnancy zone 
protein. 


3998 


93 


1717 


gi579594 


Homo sapiens 


alpha 2-macroglobulin 690-740 


2841 


64 


1717 


AAR11749 


Homo sapiens 


Human alpha-2 macroglobulin bait 
region mutant. 


2832 


64 


1718 


gi4760578 


Mus musculus 


PB-Cadherin 


1033 


84 


1718 


gil398906 


Rattus 
norvegicus 


long type PB-cadherin 


1027 


84 


1718 


gil398912 


Rattus 
norvegicus 


short type PB-cadherin 


1027 


84 


1719 


gi339776 


Homo sapiens 


Human transposon LI .2. 


312 


62 


1719 


gi5070621 


Homo sapiens 


retrotransposon LI insertion in X- 
linked retinitis pigmentosa locus, 
complete sequence. 


312 


62 


1719 


gi7959889 


Homo sapiens 


PR02221 


308 


71 


1720 


AAE06588 


Homo sapiens 


Human protein having 
hydrophobic domain, HP 10778. 


687 


100 


1720 


gil5072402 


Raja erinacea 


organic solute transporter alpha 


357 


45 


1720 


AAB38348 I 


Homo sapiens 


Human secreted protein encoded 
by gene 28 clone HLDOW79. 


328 


100 
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1721 


AAG81345 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:208. 


525 


90 


1721 


AAB93797 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 13560. 


525 


90 


1721 


AAB44681 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 41 SEQ ID 
NO: 146. 


199 


100 


1722 


gi 1809225 


Homo sapiens 


Human BAC clone RG161K23 
from 7q21, complete sequence. 


2183 


83 


1722 


gi 1698396 


Homo sapiens 


Human lanosterol 14-demethylase 
cytochrome P450 (CYP51) 
mRNA, complete cds. 


2183 


83 


1722 


gi871883 


Homo sapiens 


Human mRNA for lanosterol 14- 
demethylase, complete cds. 


2183 


83 


1723 


gil6182326 


Drosophila 
melanogaster 


GH01206p 


108 


29 


1723 


AAG77172 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:7938. 


90 


39 


1723 


gi3 878064 


Caenorhabditis 
elegans 


H19J13.1 


72 


25 


1724 


AAY53040 


Homo sapiens 


Human secreted protein clone 
kj320 1 protein sequence SEQ ID 
NO:86. 


2480 


100 


1724 


gi35 10639 


Rattus 
norvegicus 


UDP-GalNAc:polypeptide N- 
acetylgalactosaminyltransferase T5 


1351 


59 


1724 


gi6688167 


Homo sapiens 


partial mRNA for GalNAc-T5 
(GALNT5 gene). 


1082 


100 


1725 


gil4603092 


Homo sapiens 


Similar to CD47 antigen (Rh- 
related antigen, integrin-associated 
signal transducer), clone 
MGC: 15298 IMAGE :43 03 5 34, 
mRNA, complete cds. i 


1329 


97 


1725 


gi 152775 80 


Homo sapiens 


clone MGC:9240 
IMAGE:3857911, mRNA, 
complete cds. 


1329 


97 


1725 


gi396705 


Homo sapiens 


H. sapiens integrin associated 
protein mRNA, complete CDS,. 


1329 


97 


1726 


gil864011 


Homo sapiens 


mRNA for SHPS-1, complete cds. 


858 


98 


1726 


gi2052056 


Homo sapiens 


H.sapiens mRNA for SIRP-alphal. 


858 


98 


1726 


gi65 18913 


Homo sapiens 


Bit mRNA, complete cds. 


858 


98 


1727 


gi2707601 


Homo sapiens 


A4 differentiation-dependent 
protein (A4), triple LIM domain 
protein (LM06), and 
synaptophysin (SYP) genes, 
complete cds; and calcium channel 
alpha- 1 subunit (CACNA1F) gene, 
partial cds. 


1656 


100 


1727 


gi6180176 


Homo sapiens 


transcription factor IGHM 
enhancer 3, JM1 1 protein, JM4 
protein, JM5 protein, T54 protein, 
JM10 protein, A4 differentiation- 
dependent protein, triple LIM 
domain protein 6, and 
synaptophysin genes, complete 
cds; and L-type calcium channel 
alpha- 1 subunit gene, partial cds, 
complete sequence. 


1656 


100 
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1727 


gi899301 


Homo sapiens 


Human mRNA for synaptophysin 
(P38). 


1602 


100 


1728 


gil5590682 


Homo sapiens 


histone deacetylase 9a (HDAC9) 
mRNA, complete cds, alternatively 
spliced. 


629 


96 


1728 


gil2060992 


Mus musculus 


MEF2-interacting transcription 
repressor MITR 


626 


95 


1728 


gil3183017 


Mus musculus 


histone deacetylase-related protein 


623 I 


94 


1729 


gi59 11884 


Homo sapiens 


mRNA; cDNA DKFZp434N126 
(from clone DKFZp434Nl26). 


1605 


99 


1729 


gil5912209 


Arabidopsis 
thaliana 


At l g27520/T 1 7H3_2 


228 


34 


1729 


gil4164377 


Mus musculus 


Type II membrane protein of 
ER~mouse gene similar to alpha- 
mannosidase 


216 


38 


1730 


gi21842 


Triticum 
aestivum 


proline-rich protein 


403 


33 


1730 


gi4138732 


Zea mays 


proline-rich protein 


398 


30 


1730 


gil 1610622 


Rattus 
norvegicus 


GABA-A epsilon subunit splice 
variant 


41 1 


32 


1731 


gi 189222 


Homo sapiens 


Human neurokinin-2 receptor 
(TAC2R) gene, exon 5. 


1733 


95 


1731 


AAW80135 


Homo sapiens 


Human recombinant neurokinin-2 
(NK-2) receptor protein. 


1733 


95 


1731 


gil89135 


Homo sapiens 


Human neurokinin A receptor 
(NK-2R) mRNA, complete cds. 


1732 


95 


1732 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:l48. 


678 


99 


1732 


AAB75542 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:96. 


294 


100 


1732 


gil 8640 11 


Homo sapiens 


mRNA for SHPS-l, complete cds. 


261 


43 


1733 


AAY66648 


Homo sapiens 


Membrane-bound protein 
PROH20. 


294 


93 


1733 


AAB65171 


Homo sapiens 


Human PROl 120 (UNQ559) 
protein sequence SEQ ID NO: 84. 


294 


93 


1733 


AAE01440 


Homo sapiens 


Human gene 5 encoded secreted 
protein HE9QN39, SEQ ID 
NO:95. 


294 


93 


1734 


gil3161409 


Mus musculus 


family 4 cytochrome P450 


539 


78 


1734 


gil55947 


Blaberus 
discoidalis 


cytochrome P450 


248 


43 


1734 


gi3249041 


Diploptera 
punctata 


corpora allata cytochrome P450 


233 


42 


1735 


AAY71159 


Homo sapiens 


Human phosphodiesterase 
interacting protein, myomegalin. 


682 


45 


1735 


gi4761644 


Rattus 
norvegicus 


myomegalin 


484 


55 


1735 


gi53782 


Mus musculus 


profilin (AA 1-140) 


414 


74 


1736 


gi4959382 


Homo sapiens 


human endogenous retrovirus 
HERV-H19 pol protein (pol) gene, 
partial cds; env protein (env) gene, 
complete cds; and 3 f LTR, 
complete sequence. 


301 


39 


1736 


gi8439396 


HERV-H/env62 


envelope protein 


301 


39 
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1736 


gi5442112 


Simian retrovirus 
type 2 


envelope glycoprotein 


296 


42 


1737 


AAY76177 


Homo sapiens 


Human secreted protein encoded 
by gene 54. 


288 


1 nn 
IUU 


1738 


gi3688630 


Homo sapiens 


hdkk-4 gene, exon3, exon4 and 
complete cds. 


759 


1 aa 
IUU 


1738 


gi6049610 


Homo sapiens 


dickkopf-4 (DKK-4) mRNA, 
complete cds. 


759 


1 AA 

100 


1738 


AAW73017 


Homo sapiens 


Human cysteine-rich secreted 
protein CRSP-2. 


759 


100 


1739 


gil0439926 


Homo sapiens 


cDNA: FLJ23293 fis, clone 
HEP10514. 


1786 


99 


1739 


gi4406632 


Homo sapiens 


clone 25221 mRNA sequence, 
complete cds. 


1253 


64 


1739 


gil0435296 | 


Homo sapiens 


cDNA FLJ13321 fis, clone 
OVARC1001703, weakly similar 
to Mus musculus ARL-6 
interacting protein-2 (Aip-2) 
mRNA. 


1084 


62 


1740 


AAB49278 


Homo sapiens 


Protein encoded by zsig8 1 cDNA 
fragment. 


755 


A/1 

94 


1740 


gil335033 


Homo sapiens 


Human mRNA for collagen VI 
alpha-2 alternative C-terminal 
domain. 


69 


DA 


1740 


AAY40063 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


63 


35 


1741 


gil4714807 


Mus musculus 


Similar to transporter-like protein 


565 


83 


1741 


AAG81264 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:46. 


657 


1 A A 

IUU 


1741 


AAY66673 


Homo sapiens 


Membrane-bound protein 
PROH15. 


657 


100 


1742 


gi 145 83077 


Homo sapiens 


PAS-kinase (PASK) mRNA, 
complete cds. 


2227 


99 


1742 


AAB65630 


Homo sapiens 


Novel protein kinase, SEQ ID NO: 
156. 


2223 


99 


1742 


gil 5487238 


Mus musculus 


PASKIN protein 


1632 


72 


1743 


gil3879899 


Mycobacterium 

tuberculosis 

CDC1551 


PPE family protein 


118 


31 


1743 


gil 334643 


Xenopus laevis 


APEG precursor protein 


90 


1 A 

34 


1743 


gil 845 11 


Homo sapiens 


Human zinc-finger DNA-binding 
motifs (IA-1) mRNA, complete 
cds. 


86 


32 


1744 


gi5931718 


Chlamydomonas 
reinhardtii 


1 -alpha dynein heavy chain 


1874 


62 


1744 


gi9409781 


Chlamydomonas 
reinhardtii 


1 beta dynein heavy chain 


986 


36 


1744 


gi514215 


Chlamydomonas 
reinhardtii 


dynein beta heavy chain 


978 


36 


1745 


AAY13392 


Homo sapiens 


Amino acid sequence of protein 
PR0328. 


1168 


57 


1745 


AAB01373 


Homo sapiens 


Neuron-associated protein. 


1168 


57 


1745 


AAY95343 


Homo sapiens 


Human PR0328 antitumour 
protein. 


1168 


57 


1746 


AAY99398 


Homo sapiens 


Human PRO1301 (UNQ667) 
amino acid sequence SEQ ID 


677 


94 
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NO:212. 






1746 


AAW88501 


Homo sapiens 


Human stomach carcinoma clone 
HP10415-encoded protein. 


675 


94 


1746 


AAB24255 


Homo sapiens 


Human cytochrome P450 
(HUCYP) protein SEQ ID NO:l. 


675 


94 


1747 


AAG02314 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6395. 


78 


66 


1747 


AAB45051 


Homo sapiens 


Human secreted protein encoded 
by gene 27. 


60 


30 


1747 


gi2585662 


Human 

immunodeficienc 
y virus type 1 


envelope glycoprotein 


58 


38 


1748 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1142 


88 


1748 


gil5637151 


Beta vulgaris 


glycine decarboxylase subunit P 


62 


36 


1749 


gil3506805 


Homo sapiens 


thymic stromal co-transporter 
mRNA, complete cds. 


2309 


100 


1749 


gil3506808 


Mus musculus 


thymic stromal co-transporter 


1789 


77 


1749 


AAE04906 


Homo sapiens 


Human transporter and ion 
channel- 19 (TRICH-19) protein. 


305 


34 


1750 


gil0438815 


Homo sapiens 


cDNA: FLJ22427 Us, clone 
HRC09013. 


4170 


98 


1750 


AAB01381 


Homo sapiens 


Neuron-associated protein. 


2964 


95 


1750 


gi6650377 


Mus musculus 


pecanex 1 


2288 


73 


1751 


gi825663 


Homo sapiens 


H.sapiens GLAST1 gene for glilial 
glutamate transporter, exonl, exon 
2. 


411 


100 


1751 


gi487339 


Homo sapiens 


Human excitatory amino acid 
transporter 1 mRNA, complete cds. 


411 


100 


1751 


gi825504 


Homo sapiens 


Human mRNA for glutamate 
transporter, complete cds. 


411 


100 


1752 


gil621607 


Homo sapiens 


Human neogenin mRNA, complete 
cds. 


593 


100 


1752 


gi641966 


Gallus gallus 


neogenin 


591 


98 


1752 


gil785999 


Rattus 
norvegicus 


neogenin 


586 


97 


1753 


gi7020927 


Homo sapiens 


cDNA FLJ20674 fis, clone 
KAIA4450. 


293 


25 


1753 


AAP94014 


Homo sapiens 


Carcinoembryonic cell surface 
antigen. 


254 


27 


1753 


AAR60619 


Homo sapiens 


Carcinoembryonic antigen 
glycoprotein. 


250 


27 


1754 


AAY19507 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


343 


91 


1754 


AAY19654 


Homo sapiens 


SEQ ID NO 372 from 
W09922243. 


85 


100 


1754 


gil5075730 


Sinorhizobium 
men i on 


HYPOTHETICAL 

i KAIN I^MJiMoKAJNti rKO 1 E1N 


65 


30 


1755 


gi2506078 


Mus musculus 


tetracycline transporter-like protein 


1120 


99 


1755 


AAY29332 


Homo sapiens 


Human secreted protein clone 
pe584 2 protein sequence. 


806 


78 


1755 


AAB75295 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID 
NO: 114. 


806 


78 


1756 


AAE06608 


Homo sapiens 


Human protein having 


1065 


98 
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hydrophobic domain, HP 10798. 






1756 


AAB88469 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0027. 


1065 


98 


1756 


gi9664030 


Drosophila 
melanogaster 


aquaporin 


115 


22 


1757 


gi8925284 


Homo sapiens 


phosphatidyl inositol 
polyphosphate 5-phosphatase type 
IV mRNA, complete cds. 


2598 


99 


1757 


gi9295353 


Mus musculus 


inositol polyphosphate 5- 
phosphatase 


1993 


74 


1757 


gi5360761 


Rattus 
norvegicus 


pharbin 


1933 


77 


1758 


gi395207 


Bos taurus 


potassium channel (BGK5) 


757 


83 


1758 


gil86669 


Homo sapiens 


Human potassium channel mRNA, 
complete cds. 


754 


83 


1758 


gi3 04652 


Canis familiaris 


delayed rectifier K+ channel 


750 


82 


1759 


gi7023003 


Homo sapiens 


cDNA FLJ10769 lis, clone 
NT2RP4000151. 


647 


85 


1759 


AAB93147 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12057. 


647 


85 


1759 


AAB38451 


Homo sapiens 


Fragment of human secreted 
protein encoded by gene 23 clone 
HEGAK44. 


484 


84 


1760 


AAB87763 


Homo sapiens 


Human T2R33 amino acid 
sequence SEQ ID NO:56. 


693 


83 


1760 


AAB87780 


Homo sapiens 


Human T2R50 amino acid 
sequence SEQ ID NO:76. 


671 


82 


1760 


AAE03828 


Homo sapiens 


Human gene 1 1 encoded secreted 
protein HHAUQ28, SEQ ID NO: 
74. 


656 


89 


1761 


gil3543624 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 14612 
IMAGE:4051044, mRNA, 
complete cds. 


514 


87 


1761 


gil3937872 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 12437 
IMAGE:3930701, mRNA, 
complete cds. 


514 


87 


1761 


gil4250122 


Homo sapiens 


uncharacterized hematopoietic 
stem/progenitor cells protein 
MDS029, clone MGC: 14755 
IMAGE:4283253, mRNA, 
complete cds. 


514 


87 


1762 


gi2739094 


Homo sapiens 


sodium/myo-inositol cotransporter 
(SLC5A3) gene, complete cds. 


3621 


100 


1762 


gi662843 


Homo sapiens 


Na+/myo-inositol cotransporter 
(SLC5A3) gene, complete cds. 


3616 


99 


1762 


gi 123 743 7 


Bos taurus 


Na+/myo-inositol cotransporter 


3486 


95 


1763 


AAW84596 


Homo sapiens 


Amino acid sequence of the human 
Tan go -7 9 protein. 


1212 


44 


1763 


gil5029689 


Homo sapiens 


clone MGC: 17422 
IMAGE:4214343, mRNA, 
complete cds. 


1209 


44 


1763 


AAB74705 


Homo sapiens 


Human membrane associated 
protein MEMAP-11. 


1209 


44 
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1764 


gi5923786 


Homo sapiens 


zinc metalloprotease ADAMTS6 
(ADAMTS6) mRNA, complete 
cds. 


614 


43 


1764 


AAB72282 


Homo sapiens 


Human ADAMTS-6 amino acid 
sequence. 


614 


43 


1764 


gil2053709 


Homo sapiens 


mRNA for ADAMTS 12. 


1078 


41 


1765 


gil695682 


Homo sapiens 


mRNA for hepatic triglyceride 
lipase, complete cds. 


152 


64 


1765 


gi32498 


Homo sapiens 


Human mRNA for hepatic 
triglyceride lipase (HTGL). 


152 


64 


1765 


gi307129 


Homo sapiens 


Human hepatic lipase mRNA, 
complete cds. 


152 


64 


1766 


gi8 86282 


Homo sapiens 


glycoprotein lb alpha (GPIb) gene, 
partial cds. 


48 


42 


1766 


AAB64868 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 43 SEQ ID 
NO:154. 


66 


39 


1766 


gi 10798865 


Homo sapiens 


zinc finger transcription factor 
BTEB2 gene, partial cds. 


65 


29 


1767 


gi8886005 


Homo sapiens 


lysophosphatidic acid 
acyltransferase-delta (LPAAT- 
delta) mRNA, complete cds. 


324 


90 


1767 


AAY96592 


Homo sapiens 


Human lysophosphatidic acid 
acyltransferase delta. 


324 


90 


1767 


AAY66665 


Homo sapiens 


Membrane-bound protein 
PRO1016. 


324 


90 


1768 


gill60183 


Homo sapiens 


H.sapiens mRNA for metabotropic 
glutamate receptor type 4. 


926 


99 


1768 


gil 935039 


Homo sapiens 


Human metabotropic glutamate 
receptor 4 mRNA, complete cds. 


926 


99 


1768 


gi2298840 


unidentified 


HMGLUR4 


926 


99 


1769 


gil3279140 


Homo sapiens 


Similar to synaptotagmin 1 1, 
clone MGC: 108 81 
IMAGE:3621 175, mRNA, 
complete cds. 


2210 


99 


1769 


gil5489165 


Homo sapiens 


clone MGC: 17226 
IMAGE:4179089, mRNA, 
complete cds. 


2210 


99 


1769 


gil4042290 


Homo sapiens 


cDNA FLJ14634 fis, clone 
NT2RP2001081, moderately 
similar to S YNAPTOTAGMIN 
IV. 


2201 


98 


1770 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
0610008P16 gene, clone 
MGC: 15937 IMAGE:3537224, 
mRNA, complete cds. 


239 


78 


1770 


AAB56487 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO: 1065. 


239 


78 


1770 


A A DTI CIO 

AAB73512 


Homo sapiens 


Human transferase HTFS-19, SEQ 
ID NO: 19. 


239 


78 


1771 


gi7678873 


Homo sapiens 


mRNA for vascular cadherin-2, 
complete cds. 


5369 


99 


1771 


gi7407150 


Homo sapiens 


protocadherin 12 (PCDH12) 
mRNA, complete cds. 


5369 


99 


1771 


gi8 164037 


Homo sapiens 


vascular endothelial cadherin 2 
mRNA, complete cds. 


5369 


99 
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1772 


gil5082281 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 10252 
IMAGE:3622879, mRNA, 
complete cds. 


231 


93 


1772 


gil5214803 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 13329 
IMAGE:4281565, mRNA, 
complete cds. 


231 


93 


1772 


gi5531815 


Homo sapiens 


steroid dehydrogenase homolog 


231 


93 


1773 


AAG73650 


Homo sapiens 


Human colon cancer antigen 
protein SEQ ID NO:4414. 


265 


58 


1773 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16231. 


265 


64 


1773 


gil335205 


Homo sapiens 


Human DNA for LINE-1 
transposable element ORFI and II. 


273 


48 


1774 


AAW71708 


Homo sapiens 


Human integral membrane protein 
TMP-1. 


657 


50 


1774 


gi3603459 


Homo sapiens 


tetraspan NET- 5 mRNA, complete 
cds. 


626 


90 


1774 


AAB93885 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13815. 


626 


90 


1775 


AAG71597 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1278. 


981 


99 


1775 


AAG71587 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1268. 


746 


75 


1775 


gi!5293749 


Homo sapiens 


clone OR10R2 olfactory receptor 
gene, partial cds. 


725 


75 


1776 


AAB38011 


Homo sapiens 


Human secreted protein encoded 
by gene 3 clone HPJCX13. 


252 


85 


1776 


AAB64888 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:66. 


252 


85 


1776 


gi 10432803 


Homo sapiens 


cDNA FLJ11531 lis, clone 
HEMBA 1002661. 


192 


C A 

54 


1777 


gil 82851 


Homo sapiens 


Human G0S2 protein gene, 
complete cds. 


497 


1 f\f\ 
100 


1777 


gi 182853 


Homo sapiens 


Human GOS2 gene, 5' flank and 
cds. 


497 


100 


1777 


gil213013 


Mus musculus 


G0S2-like protein 


377 


77 


1778 


gi4027903 


Homo sapiens 


"1 r A TV A~T\E TIXT A 1 _j 1 _ 

VAMP5 mRNA, complete cds. 


A Tl 

473 


i r\r\ 
1UU 


1778 


gi4679008 


Homo sapiens 


VAMP5-hke protein 


473 


i r\r\ 
100 


1778 


AAW04181 


Homo sapiens 


t~\ 11 1 • r\ 

Cellubrevm-2. 


473 


i r\r\ 

100 


1779 


gi5326919 


Bos taurus 


SCO-spondin 


75 


30 


1779 


gi3 059229 


Aspergillus 
oryzae 


HAPB 


74 


35 


1779 


AAY84596 


Homo sapiens 


Fragment of human pre-pro- 
artemin polypeptide. 


48 


40 


1780 


gi 1399321 


Macaca mulatta 


MHC class I antigen Mamu B*08 


679 


72 


1780 


gi!399319 


Macaca mulatta 


MHC class I antigen Mamu B*07 


674 


67 


1780 


' O 1 1 '7*70 c\ 

gioi i /vyy 


Pan troglodytes 


MHC class I antigen 


6/ / 


/u 


1781 


gil 5929602 


Homo sapiens 


clone MGC: 17861 
IMAGE:3903313, mRNA, 
complete cds. 


607 


51 


1781 


gil 5099953 


Homo sapiens 


diacylglycerol acyltransferase 2 
mRNA, complete cds. 


607 


51 


1781 


AAY99421 


Homo sapiens 


Human PR01433 (UNQ738) 
amino acid sequence SEQ ID 


607 


51 
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NO:292. 






1782 


gi2706518 


Homo sapiens 


PMP69 gene, exon 1 (and joined 
CDS). 


1602 


98 


1782 


gi2343157 


Homo sapiens 


peroxisomal membrane protein 69 
(PMP69) mRNA, complete cds. 


1602 


98 


1782 


gil5215442 


Homo sapiens 


Similar to ATP-binding cassette, 
sub-family D (ALD), member 4, 
clone MGC:4125 
IMAGE:2960427, mRNA, 
complete cds. 


1594 


98 


1783 


AAB35235 


Homo sapiens 


Human neurotransmitter 
transporter protein GC42. 


3571 


98 


1783 


gi546769 


human, 

substantia nigra, 
mRNA, 2364 nt]. 
[Homo sapiens 


glycine transporter type lb 


3557 


98 


1783 


AAB35236 


Homo sapiens 


Human glycine transporter type lc. 


3557 


98 


1784 


AAU00017 


Homo sapiens 


Human Plexin-Dl. 


3361 


97 


1784 


gil3097621 


Homo sapiens 


clone IMAGE: 3607457, mRNA, 
partial cds. 


2470 


100 


1784 


AAU00015 


Homo sapiens 


Human Plexin-B2. 


1290 


47 


1785 


gi6 19726 


Homo sapiens 


Human nuclear factor I (NFI) 
mRNA, clone ATI, complete cds. 


953 


99 


1785 


gi6 19730 


Homo sapiens 


Human nuclear factor I (NFI) 
mRNA, clone CT1, partial cds. 


953 


99 


1785 


gi305357 


Mesocricetus 
auratus 


nuclear factor 1-like protein 


953 


99 


1786 


gil3810568 


Homo sapiens 


TLR5 mRNA for Toll-like receptor 
5, complete cds. 


4482 


100 


1786 


gi3 132526 


Homo sapiens 


Toll/interleukin-1 receptor-like 
protein 3 (TIL3) mRNA, complete 
cds. 


4464 


99 


1786 


gi7648687 


Mus musculus 


toll-like receptor 5 


3235 


72 


1787 


AAB56473 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID 
NO:1051. 


474 


100 


1787 


AAB60119 


Homo sapiens 


Human transport protein TPPT-39. 


456 


98 


1787 


gil 1463949 


Homo sapiens 


hUGTrel7 mRNA for UDP- 
glucuronic acid, complete cds. 


308 


62 


1788 


gi4929765 


Homo sapiens 


CGI- 148 protein mRNA, complete 
cds. 


744 


89 


1788 


gi7578785 


Homo sapiens 


NPD008 protein (NPD008) 
mRNA, complete cds. 


744 


89 


1788 


gil4250060 


Homo sapiens 


clone MGC: 14598 
IMAGE:4292664, mRNA, 
complete cds. 


737 


88 


1789 


gil736785 


Escherichia coli 


Acriflavin resistance protein F 
(EnvD protein). 


2265 


100 


1789 


gil5980819 


Yersinia pestis 


AcrB/AcrD/AcrF family 
membrane protein 


1854 


79 


1789 


gil 736782 


Escherichia coli 


Acriflavin resistance protein F 
(EnvD protein). 


1821 


100 


1790 


gil5160166 


Agrobacterium 
tumefaciens 


AGR_L_3181p 


549 


59 


1790 


gi4981492 


Thermotoga 
maritima 


ribose ABC transporter, permease 
protein 


316 


40 
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1790 


gil2724646 


Lactococcus 
lactis subsp. 
lactis 


ribose ABC transporter permease 
protein 


314 


35 


1791 


gi41587 


Escherichia coli 


glycerol-3-phosphatase transporter 
(AA 1 - 452, glpT) 


1073 


100 


1791 


gil799587 


Escherichia coli 


glycerol-3-phosphate transport 
protein 


1073 


100 


1791 


gil788573 


Escherichia coli 
K12 


sn-glycerol-3-phosphate permease 


1073 


100 


1792 


gil48200 


Escherichia coli 


similar to arylsulfatases of 
Klebsiella pneumoniae and Homo 
sapiens 


679 


98 


1792 


gil3364207 


Escherichia coli 
0157:H7 


arylsulfatase 


679 




1792 


gil790233 


Escherichia coli 
K12 


arylsulfatase 


679 


98 


1793 


gi9657461 


Vibrio cholerae 


sulfate permease family protein 


267 


51 


1793 


gi2635979 


Bacillus subtilis 


similar to transporter 


249 


46 


1793 


gil4024597 


Mesorhizobium 
loti 


sulfate transporter family protein 


231 


45 


1794 


gil799719 


Escherichia coli 


similar to 


922 


100 


1794 


gil5156677 


Agrobacterium 
tumefaciens 


AGRC_2926p 


452 


50 


1794 


gil5074970 


Sinorhizobium 
meliloti 


HYPOTHETICAL 
TRANSMEMBRANE PROTEIN 


414 


47 


1795 


gil3325242 


Homo sapiens 


clone MGC:4033 
IMAGE:2820092, mRNA, 
complete cds. 


228 


81 


1795 


gil644366 


Rattus 
norvegicus 


ninjurinl 


228 


81 


1795 


gi3077901 


Mus musculus 


ninjurin 


228 


81 


1796 


gil5987491 


Homo sapiens 


tumor endothelial marker 5 
precursor (TEM5) mRNA, 
complete cds. 


5742 


89 


1796 


AAB71863 


Homo sapiens 


Human hl5571 GPCR. 


5742 


89 


1796 


gil5987499 


Mus musculus 


tumor endothelial marker 5 
precursor 


5030 


79 


1797 


gil3938575 


Homo sapiens 


Similar to RIKEN cDNA 
26105 11E22 gene, clone 
MGC:4251 IMAGE:3028940, 
mRNA, complete cds. 


1331 


100 


1797 


AAY91598 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 8 SEQ ID 
NO:271. 


1322 


100 


1797 


gil5029776 


Mus musculus 


RIKEN cDNA 261051 1E22 gene 


1317 


98 


1798 


gi42989 


Escherichia coli 


SecY (PrlA) polypeptide (aa 1- 
443) 


953 


100 


1798 


gi606234 


Escherichia coli 


secY 


953 


100 


1798 


gil 5978329 


Yersinia pestis 


preprotein translocase SecY 
subunit 




y 1 


1800 


gil0435708 


Homo sapiens 


cDNA FLJ13639 fis, clone 
PLACE101 1219, weakly similar to 
PROBABLE 

OXIDOREDUCTASE (EC 1 .-.-.-). 


289 


84 


1800 


AAB94698 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15680. 


289 


84 
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1800 


gil5292559 


Drosophila 
melanogaster 


SD07613p 


101 


67 


1801 


gi7415511 


Homo sapiens 


mRNA for peptide transporter 3, 
complete cds. 


1281 


89 


1801 


gil3810437 


Rattus 
norvegicus 


peptide histidine transporter 1 
homolog rPHT2 


1066 


83 


1801 


AAY86224 


Homo sapiens 


Human secreted protein 
HDPWU34, SEQ ID NO: 139. 


1280 


100 


1802 


gi387012 


Homo sapiens 


Human pepsinogen gene, exon 9. 


615 


97 


1802 


gi387013 


Homo sapiens 


Human pepsinogen A (15.0) gene, 
exon 9, clone cgHGP2. 


615 


97 


1802 


AAB66589 


Homo sapiens 


Human pepsin. 


615 


97 


1803 


gil5680159 


Homo sapiens 


Similar to claudin 2, clone 
MGC:20191 IMAGE :4645 075, 
mRNA, complete cds. 


1113 


99 


1803 


gil0503980 


Homo sapiens 


clone SP82 claudin 2 mRNA, 
complete cds. 


1113 


99 


1803 


gi9755009 


Homo sapiens 


claudin-2 mRNA, complete cds. 


1113 


99 


1804 


gi476222 


Homo sapiens 


Human anion exchanger 3 brain 
isoform (bAE3) mRNA, complete 
cds. 


317 


74 


1804 


gil 0953762 


Mus musculus 


anion exchanger 3 cardiac isoform 


317 


74 


1804 


gi309095 


Mus musculus 


AE3 protein 


317 


74 


1805 


AAB29632 


Homo sapiens 


Human pollinosis-associated gene 
581-encoded protein, SEQ ID 
NO:12. 


2518 


55 


1805 


AAY70023 


Homo sapiens 


Human Protease and associated 
protein-17 (PPRG-17). 


2068 


58 


1805 


gil3529590 


Mus musculus 


Similar to ubiquitin specific 
protease 20 


1924 


56 


1806 


AAY99363 


Homo sapiens 


Human PRO1380 (UNQ717) 
amino acid sequence SEQ ID 
NO:79. 


1773 


98 


1806 


gi 12656639 


Homo sapiens 


equilibrative nucleoside transporter 
3 (ENT3) mRNA, complete cds. 


1767 


98 


1806 


AAY82285 


Homo sapiens 


Human ENT1 receptor SEQ ID 
NO:l. 


1767 


98 


1807 


AAG81410 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:338. 


155 


27 


1807 


gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrolase 


159 


30 


1807 


gil5778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


159 


30 


1808 


gil5082281 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 10252 
IMAGE:3622879, mRNA, 
complete cds. 


231 


.93 


1808 


gil5214803 


Homo sapiens 


Similar to steroid dehydrogenase 
homolog, clone MGC: 13329 
IMAGE:4281565, mRNA, 
complete cds. 


231 


93 


1808 


gi5531815 


Homo sapiens 


steroid dehydrogenase homolog 


231 


93 


1810 


gil4270513 


Homo sapiens 


partial SLC22A3 gene for organic 
cation transporter 3, exon 1 and 
joined CDS. 


775 


99 


1810 


gi3581982 


Homo sapiens 


mRNA for extraneuronal 


775 


99 
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monoamine transporter. 






1810 


g i4454795 


Mus musculus 


organic cation transporter 3 


720 


89 


1811 


gil212965 


Homo sapiens 


H.sapiens mRNA for 
transmembrane protein rnp24. 


266 


91 


1811 


AAB74750 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 8 SEQ ID 
NO:59. 


266 


91 


1811 


AAB74759 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID 
NO:68. 


266 


91 


1812 


gil5082375 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 1 
(upregulated in kidney), clone 
MGC:20076 IMAGE:4643216, 
mRNA, complete cds. 


711 


98 


1812 


gi 13096836 


Mus musculus 


Similar to transmembrane 7 
superfamily member 1 
(upregulated in kidney) 


690 


95 


1812 


AAY69992 


Homo sapiens 


Human receptor-associated protein 
from Incyte clone 786873. 


412 


72 


1813 


gil2653811 


Homo sapiens 


zinc finger protein 219, clone 
MGC:1124 IMAGE:3347777, 
mRNA, complete cds. 


500 


38 


1813 


gi6899807 


Homo sapiens 


mRNA for zinc finger protein 219, 
complete cds. 


496 


38 


1813 


gi 14549 186 


Mus musculus 


zinc finger protein 219 


490 


36 j 


1814 


gi4324468 


Homo sapiens 


LAG1 protein (LAG1) gene, exon 
7 and complete cds. 


1385 


93 


1814 


gi 183051 


Homo sapiens 


Human growth/differentiation 
factor 1 (GDF-1) mRNA, complete 
cds. 


1385 


93 


1814 


AAR20230 


Homo sapiens 


hUOG-1. 


1385 


93 


1815 


gi6009515 


Xenopus laevis 


epidermis specific serine protease 


338 


41 


1815 


gi9757698 


Xenopus laevis 


embryonic serine protease- 1 


314 


46 


1815 


gil 3277969 


Mus musculus 


Similar to protease, serine, 8 
(prostasin) 


310 


38 


1816 


gi41875 


Escherichia coli 


KefC potassium efflux system 


706 


100 


1816 


gi216472 


Escherichia coli 


kefc potassium efflux system 


706 


100 


1816 


gil 786232 


Escherichia coli 
K12 


K+ efflux antiporter, glutathione- 
regulated 


706 


100 


1817 


gil054578 


Escherichia coli 


dioxygenase 


1510 


100 


1817 


gil786565 


Escherichia coli 
K12 


taurine dioxygenase, 2- 
oxoglutarate-dependent 


1510 


100 


1817 


gil3359879 


Escherichia coli 
0157:H7 


taurine dioxygenase 


1506 


99 


1818 


gil279401 


Escherichia coli 


SapB protein 


928 


98 


1818 


gil742115 


Escherichia coli 


Peptide transport system permease 
protein SapB. 


928 


98 


1818 


gil3361335 


Escherichia coli 
0157:H7 


homolog of Salmonella peptide 
transport permease protein 


928 


98 


1819 


gi9652147 


Homo sapiens 


transmembrane-type protein 
tyrosine phosphatase H (PTPRH) 
gene, exon 20 and complete cds. 


4639 


100 


1819 


gi475004 


Homo sapiens 


mRNA for protein tyrosine 
phosphatase precursor, complete 
cds. 


4308 


93 
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1819 


gil321659 


Rattus 
norvegicus 


brain-enriched membrane- 
associated protein tyrosine 
phosphatase (BEM)-2 


920 


77 


1820 


gi7141125 


Homo sapiens 


tissue-type heart Ellis-van Creveld 
syndrome protein (EVC) mRNA, 
complete cds. 


1277 


100 


1820 


gi7141127 


Homo sapiens 


tissue-type brain Ellis- van Creveld 
syndrome protein (EVC) mRNA, 
complete cds. 


1277 


100 


1820 


gi7271903 


Homo sapiens 


DWF-1 mRNA, complete cds. 


1271 


99 


1821 


AAG71453 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 134. 


568 


98 


1821 


AAG72370 


Homo sapiens 


Human OR-like polypeptide query 
sequence, oJbv id jno. zuj i. 


568 


98 


1821 


AAE04556 


Homo sapiens 


Human G-protein coupled 
receptor- 12 (GCREC-12) protein. 


558 


100 


1822 


gil0440040 


Homo sapiens 


cDNA: FLJ23375 fis, clone 
HEP16206. 


757 


98 


1822 


AAB95094 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17042. 


98 


43 


1822 


gi6002197 


Homo sapiens 


H.sapiens NDUFV3 gene, exon 1. 


78 


34 
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912 


gi601918 


Homo sapiens 


glutathione S-transferase theta 2 


12o3 


1UU 


912 


gi769703 


Rattus 
norvegicus 


glutathione S-transferase subunit Yrs 


nnc 
yyj 


TO 

/o 


912 


gi220757 


Rattus 
norvegicus 


glutathione S-transferase Yrs-Yrs 


995 


78 


913 


gil3872813 


Homo sapiens 


fibulin-6 


4538 


93 


913 


gi 14575679 


Homo sapiens 


hemicentin 


4533 


93 


913 


ABB 10492 


Homo sapiens 


Human cDNA SEQ ID NO: 800. 


3343 


90 


914 


AAU80378 


Homo sapiens 


Human DOPD-like protein NOV1 . 


A £. C 

465 




914 


AAR83048 


Homo sapiens 


Human macrophage migration 
inhibitory factor-3 (MIF-3). 


450 


94 


914 


gi2 104581 


Homo sapiens 


phenylpyruvate tautomerase II 


450 


94 


915 


AAR83048 


Homo sapiens 


Human macrophage migration 
inhibitory factor-3 (MIF-3). 


546 


OO 


915 


gi2104581 


Homo sapiens 


phenylpyruvate tautomerase II 


546 


OO 


915 


gi3047378 


Homo sapiens 


D-dopachrome tautomerase 


546 


00 


916 


AAR89197 


Homo sapiens 


-W T t » 111 J l l"V _ J 

Human hepatocellular growth factor 
single chain precursor protein. 


3466 


A 1 

91 


916 


gi2 19681 


Homo sapiens 


HGF activator precursor 


3466 


91 


916 


gi4 190954 


Homo sapiens 


hepatocyte growth factor activator 


3466 


91 


917 


gi5441937 


Homo sapiens 


laminin beta precursor; similar to 
AAB92586 (PID:g2708707) 


8964 


100 


917 


AAY15457 


Homo sapiens 


Human laminin beta 4 protein. 


8838 


93 


917 


AAY15459 


Homo sapiens 


SEQ ID 5 of WG9919347. 


6042 


97 


918 


AAG65887 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 14936. 


5050 


100 


918 


AAG65888 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 14936. 


4582 


84 


918 


AAG68261 


Homo sapiens 


Human POLYS protein sequence 
SEQ ID NO: 16. j 


4543 


83 


919 


ABB74955 


Homo sapiens 


Human lung tumour L524S variant 
protein sequence SEQ ID NO: 166. 


905 


100 


919 


ABB74954 


Homo sapiens 


Human lung tumour L524S variant 
protein sequence SEQ ID NO: 165. 


905 


100 


919 


AAE23751 


Homo sapiens 


Human parathyroid related peptide, 
PTHrP (1-173). 


905 


100 


920 


gi20269129 


Homo sapiens 


MEGF6 


2584 


95 


920 


AAE17919 


Homo sapiens 


Human gene 1 encoded serine 
protease, HMGBM65. 


2468 


99 


920 


AAY72091 


Homo sapiens 


Human serine protease #2 encoded 
by clone HMGBM65. 


2468 


99 


921 


ABB90774 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 281. 


346 


100 


921 


ABB90773 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 279. 


346 


100 


921 


ABB90772 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 277. 


o40 


1UU 




AAU 97039 


Homo sapiens 


nurnan Ljr proiem Lriyu. 


2054 


89 


922 


ABG30500 


Homo sapiens 


Human Carboxypeptidase A. 


2054 


89 


922 


AAB47565 


Homo sapiens 


Protease PRTS-7. 


2054 


89 


923 


AAM52240 


Homo sapiens 


Human MFAP4 SEQ ID NO 3. 


1198 


99 


923 


gi790817 


Homo sapiens 


microfibril-associated glycoprotein 4 


1198 


99 


923 


AAM52239 


Homo sapiens 


Human MAG4V SEQ ID NO 1. 


1197 


100 


924 


AAU81960 


Homo sapiens 


Human PR0536. 


1648 


100 


924 


AAB65173 


Homo sapiens 


Human PR0536 (UNQ337) protein 


1648 


100 
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sequence SEQ ID NO:97. 






924 


AAB94830 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15991. 


1648 


100 


925 


AAU10497 


Homo sapiens 


Human Apolipoprotein A-II, 
APOA2. 


438 


90 


925 


gil3528981 


Homo sapiens 


apolipoprotein A-II 


438 


90 


925 


gi21429231 


Homo sapiens 


apolipoprotein All 


438 


90 


926 


AAY76156 


Homo sapiens 


Human secreted protein encoded by 
gene 33. 


419 


90 


927 


gi 13097252 


Homo sapiens 


Similar to FK506 binding protein 2 
(13 kDa) 


648 


91 


927 


gi337370 


Homo sapiens 


rapamycin- and FK506-binding 
protein 


648 


91 


927 


AAR93551 


Homo sapiens 


Human FKBP-13 immunophilin 
protein. 


628 


90 


928 


AAG73789 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:4553. 


230 


95 


928 


AAB53360 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO:900. 


230 


95 


930 


gi 17982522 


Brucella 
melitensis 


(S)-2-hydroxy-acid oxidase chain D 


1151 


49 


930 


gil7740510 


Agrobacterium 
tumefaciens str. 
C58 (U. 
Washington) 


FAD dependent oxidoreductase 


1149 


49 


930 


gil5157181 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_C_3718p 


1149 


49 


931 


ABB89770 


Homo sapiens 


Human polypeptide SEQ ID NO 
2146. 


366 


70 


931 


AAB75367 


Homo sapiens 


Human secreted protein #26. 


366 


70 


931 


AAU04353 


Homo sapiens 


Mammalian toxicological response 
marker protein #5. 


366 


70 


932 


ABB55723 


Homo sapiens 


Human polypeptide SEQ ID NO 52. 


1107 


99 


932 


AAU39014 


Homo sapiens 


Human secreted protein yc2 1 . 


1107 


99 


932 


AAM94621 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 3279. 


203 


67 


933 


gi 16648246 


Drosophila 
melanogaster 


GH27263p 


228 


33 


933 


gil711197 


Xenopus laevis 


Xfringe2 


184 


25 


933 


gil 679784 


Xenopus laevis 


radical fringe 


184 


26 


934 


AAY73383 


Homo sapiens 


HTRM clone 2280456 protein 
sequence. 


1571 


100 


934 


AAG75282 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6046. 


619 


99 


934 


AAM41045 


Homo sapiens 


Human polypeptide SEQ ID NO 
5976. 


89 


29 


935 


AAG65916 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 239881. 


3041 


99 


935 


gil5705411 


Homo sapiens 


peptidoglycan recognition protein L 
precursor 


3041 


99 


935 


AAG65915 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 239881. 


2892 


99 


936 


AAU97218 


Homo sapiens 


Human G protein-coupled receptor, 
TGR25. 


850 


99 
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936 


AAE23415 


Homo sapiens 


Human G-protein coupled receptor- 7 
(GCREC-7). 


850 


99 


936 


ABB12463 


Homo sapiens 


Human bone marrow expressed 
protein SEQ ID NO: 302. 


769 


100 


937 


gil711232 


Homo sapiens 


14-3-3 protein eta chain 


1094 


94 


937 


gi460779 


Homo sapiens 


14-3-3 eta subtype 


1094 


94 


937 


gi 1477931 


Homo sapiens 


14.3.3 eta chain 


1094 


94 


938 


gil6118441 


Oryctolagus 
cuniculus 


S-100 calcium-binding protein beta 
subunit 


479 


100 


938 


gi 12804681 


Homo sapiens 


SI 00 calcium-binding protein, beta 
(neural) 


479 


100 


938 


gi337730 


Homo sapiens 


SI 00 protein beta subunit 


479 


100 


939 


AAW75082 


Homo sapiens 


Human secreted protein encoded by 
gene 26 clone HTLEV12. 


392 


92 


939 


AAM95398 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4056. 


71 


50 


939 


AAU29184 


Homo sapiens 


Human PRO polypeptide sequence 
#161. 


71 


50 


940 


AAU29315 


Homo sapiens 


Human PRO polypeptide sequence 
#292. 


3743 


99 


940 


ABB75753 


Homo sapiens 


Human pancreas GP354. 


3104 


100 


940 


ABB75751 


Homo sapiens 


Human immunoglobulin superfamily 
member GP354. 


3073 


99 


941 


AA021477 


Homo sapiens 


Human NgR2 protein sequence. 


2133 


100 


941 


AA021482 


Homo sapiens 


Mature human NgR2 protein 
sequence. 


2082 


100 


941 


gi20987877 


Mus museums 


similar to Nogo receptor 


885 


58 


942 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO: 63. 


768 


98 


942 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


238 


90 


942 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene 


238 


90 


943 


AAE22093 


Homo sapiens 


Human kidney specific renal cell 
carcinoma (KSRCC) protein. 


851 


98 j 


943 


AAY85678 


Homo sapiens 


Human kidney disease associated 
protein SEQ ID 10. 


751 


no 


943 


gi3 127193 


Rattus 
norvegicus 


kidney-specific protein 


ooo 


75 | 


944 


ABP41513 


Homo sapiens 


Human ovarian antigen HCOOX52, 
SEQ ID NO:2645. 


2558 


100 


944 


gil 197499 


Homo sapiens 


CI inhibitor 


2527 


100 


944 


AAW18207 


Homo sapiens 


Wild-type CI inhibitor. 


2524 


99 


945 


gil7982052 


Brucella 
melitensis 


RIBOSOMAL PROTEIN LI 1 
METHYLTRANSFERASE 


241 


36 


945 


gil5157854 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_C_4799p 


207 


36 


945 


gil 8 145768 


Clostridium 
perfringens str. 
13 


probable ribosomal protein LI 1 
methyltransferase 


118 


37 


946 


AAB03948 


Homo sapiens 


Human mesenchymal stem cell 
polypeptide. 


462 


98 


946 


AAB64909 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO: 87. 


165 


63 
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946 


AAU01649 


Homo sapiens 


Human secreted protein 
immunogenic epitope encoded by 
gene #12. 


165 


63 


947 


AAE13809 


Homo sapiens 


Human lung tumour-specific protein 
SAL-82. 


2906 


99 


947 


AAB44467 


Homo sapiens 


Human lung tumour- specific antigen 
encoded by cDNA #103. 


2906 


99 


947 


gil3958036 


Homo sapiens 


RUFY1 


2906 


99 


949 


AAB88326 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC002L 


541 


74 


949 


AAB92475 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10549. 


541 


74 


949 


AAE03226 


Homo sapiens 


Human gene 7 encoded secreted 
protein HNTDL21, SEQ ID NO:76. 


541 


74 


950 


ABP47877 


Homo sapiens 


Human polypeptide SEQ ID NO 307. 


1303 


100 


950 


ABP47870 


Homo sapiens 


Human polypeptide SEQ ID NO 300. 


959 


99 


950 


ABP48038 


Homo sapiens 


Human polypeptide SEQ ID NO 468. 


835 


98 


951 


AAU12382 


Homo sapiens 


Human PR0792 polypeptide 
sequence. 


1138 


87 


951 


AAB24416 


Homo sapiens 


Human PR0792 protein sequence 
SEQ ID NO: 155. 


1138 


87 


951 


AAB24055 


Homo sapiens 


Human PR0792 protein sequence 
SEQ ID NO:31. 


1138 


87 


952 


gi7670746 


Homo sapiens 


UDP-glucose:glycoprotein 
glucosyltransferase 1 precursor 


7974 


98 


952 


gil3275621 


synthetic 
construct 


Rat RUGT 


7317 


91 


952 


gi7677176 


Rattus 
norvegicus 


UDP-glucose 

glycoprotein: glucosyltransferase 
precursor 


7317 


91 


953 


gi208 10499 


Homo sapiens 


Similar to RIKEN cDNA 
2900074C18gene 


1237 


100 


953 


AAM42005 


Homo sapiens 


Human polypeptide SEQ ID NO 
6936. 


404 


48 


953 


AAB64390 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA22. 


402 


50 


954 


gil 8676660 


Homo sapiens 


FLJ00229 protein 


2292 


98 


954 


AAM40342 


Homo sapiens 


Human polypeptide SEQ ID NO 
3487. 


1454 


60 


954 


AAM42128 


Homo sapiens 


Human polypeptide SEQ ID NO 
7059. 


765 


60 


955 


gi21707216 


Homo sapiens 


LOC146556 


2100 


99 


955 


AAG81399 


Homo sapiens 


Human AFP protein sequence SEQ 
IDNO:316. 


1910 


100 


955 


AAB61421 


Homo sapiens 


Human TANGO 300 protein. 


1904 


99 


956 


gi21667020 


Homo sapiens 


mutant I beta-l,6-N- 
acetylglucosaminyltransferase C 
form 


1637 


99 


956 


gi21667011 


Homo sapiens 


Ibeta-1,6-N- 

acetylglucosaminyltransferase C 
form 


1637 


99 


956 


gi21667015 


Homo sapiens 


mutant I beta-l,6-N- 
acetylglucosaminyltransferase C 
form 


1633 


99 
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957 


gi2065165 


Homo sapiens 


collagen type XIV 






957 


AAG00322 


Homo sapiens 


Human secreted protein, SEQ ID 

\TA. A A(\1 

JNU: 44U3. 


482 


98 


957 


gi288875 


Gallus gallus 


collagen XIV 


461 


63 


958 


AAG75509 


Homo sapiens 


Human colon cancer antigen protein 
obQ 1JJ JNU:oz/3. 


1/1 CO 

z4oz 


OA 

y4 


958 


A AB 84606 


Homo sapiens 


Amino acid sequence of matrix 
metalloproteinase collagenase 1. 


Z4oZ 


y4 


958 


AAE10415 


Homo sapiens 


Human matrix metalloprotinase-1 
(MMP-1) protein. 


1A QO 

Z4o2 


QA 

y4 


959 


AAY25868 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 57. 


yyj 


QQ 

yy 


959 


gi 15 145797 


Sus scrofa 


basic proline-rich protein 


CO 
oo 


AO 
4Z 


959 


gi3413810 


Mus musculus 


Bassoon 


84 


31 


960 


AAG77807 


Homo sapiens 


Human HSIFL59 serine/threonine 
phosphatase protein sequence. 


4/3 


no 

yo 


960 


AAB58288 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 626. 




yo 


960 


A a \7"mn A A 

AAY59744 


Homo sapiens 


Human normal ovarian tissue derived 
protein 21. 


47^ 
4/ 3 


OS 

yo 


961 


gi40 19247 


Ateline 
herpesvirus 3 


thymidine kinase 


i 1 


40 


962 


gi9956936 


Mus musculus 


L>u(yarj3-y nomolog buv^ynz 


1 ^ i n 
13 1U 


CA 
00 


962 


AAM78701 


Homo sapiens 


Human protein SEQ ID NO 1363. 


1236 


100 


962 


AAW05261 


Homo sapiens 


Chromatin regulator protein 
bU V39H. 


Qt A 

o!4 


jj 


963 


AAG03840 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7921. 


439 


80 


963 


gil698653 


Homo sapiens 


NADPH: ubiquinone oxidoreductase 
subunit B13 


43y 


ol) 


963 


gi 12654023 


Homo sapiens 


NADH dehydrogenase (ubiquinone) 
1 alpha subcornplex, 5 (13kD, B13) 


43 y 


cn 
oU 


964 


gi37347 


Homo sapiens 


iKu gamma cnam (AA i-34Uj 


1 AA7 
lOD / 


yn 


964 


gi3 39407 


Homo sapiens 


Ti antigen CD3-associated protein 
precursor 


1 AO 7 
10Z / 


yo 


964 


gi37018 


Homo sapiens 


pre-gamma-chain (AA -14 to 309) 


1430 


87 
0 / 


965 


AAE06606 


Homo sapiens 


Human protein having hydrophobic 
domain, Hriu/y^. 


566 


96 


965 


AAM94615 


Homo sapiens 


Human reproductive system related 
antigen aisQ 1JJ INU: 


566 


96 


965 


gil8490535 


Mus musculus 


RIKEN cDNA 2610528J11 gene 


388 


78 


966 


gil3654639 


Bos taurus 


D-glucuronyl C5 epimerase 


3 oy 


07 

y / 


966 


gi 13442978 


Mus musculus 


D-glucuronyl C5-epimerase 


3139 


95 


966 


gil 1935177 


Mus musculus 


heparin/heparan sulfate: glucuronic 
acid C5 epimerase 


J 1 34 


yr> 


967 


AAG72204 


Homo sapiens 


Human olfactory receptor 
polypeptide, b.bQ ID NO: loo:). 


HO / 


/ / 


yo f 


A A071 

A/VVJ / LO / J 




"Hiitrmii ol ptorv rpfprYfnr 

XlUIllcUl VJllClL/tWl jf lCUCpiUl 

polypeptide, SEQ ID NO: 1556. 


1138 


74 


967 


AAG71816 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1497. 


1102 


72 


968 


AAB73679 


Homo sapiens 


Human oxidoreductase protein ORP- 
12. 


1918 


100 


968 


gil2655133 


Homo sapiens 


Similar to CGI-63 protein 


1918 


100 


968 


ABB89788 


Homo sapiens 


Human polypeptide SEQ ID NO 


1907 


99 
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2164. 






969 


AAY28995 


Homo sapiens 


Tumour suppressor Del-27 protein 
sequence. 


2309 


59 


969 


AAY15344 


Homo sapiens 


Tumour suppressor protein del-27. 


2309 


59 


969 


gi6062874 


Homo sapiens 


candidate tumor suppressor protein 
DICE1 


2309 


59 


970 


gi387011 


Homo sapiens 


pyruvate dehydrogenase El -alpha 
precursor 


2187 


99 


970 


gi35381 


Homo sapiens 


pyruvate dehydrogenase El' subunit 
(AA 1 - 390) 


2049 


100 


970 


gi219982 


Homo sapiens 


alpha subunit 


2049 


100 


971 


gi2275569 


Homo sapiens 


TCRBV23S1 


660 


99 


971 


gi2104755 


Homo sapiens 


T cell receptor V-beta 23 


660 


99 


971 


gi495404 


Pan troglodytes 


T cell receptor beta chain 


657 


88 


972 


AAM95505 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4163. 


1469 


99 


972 


ABB96188 


Homo sapiens 


Human testicular antigen SEQ ID 
NO: 1572. 


1469 


99 


972 


AAB75360 


Homo sapiens 


Human secreted protein #19. 


1272 


100 


973 


AAE02937 


Homo sapiens 


Human TFRP protein. 


3040 


94 


973 


gi4996563 


Homo sapiens 


protein inhibitor of activatied STAT3 


3040 


94 


973 


gil2654633 


Homo sapiens 


protein inhibitor of activated STAT3 


3040 


94 


974 


gil5099957 


Homo sapiens 


diacylglycerol acyltransferase 2-like 
protein 


855 


93 


974 


gil5099955 


Mus musculus 


diacylglycerol acyltransferase 2-like 
protein 


675 


59 


974 


AAY94889 


Homo sapiens 


Human protein clone HP02485. 


576 


56 


975 


AAB92669 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11033. 


1432 


100 


975 


ABG40833 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
30498. 


399 


98 


975 


AAM31319 


Homo sapiens 


Peptide #5356 encoded by probe for 
measuring placental gene expression. 


399 


98 


976 


gi42 10474 


Navel orange 
infectious 
mottling virus 


polyprotein 


71 


21 


977 


gil209685 


Homo sapiens 


salivary peroxidase 


3754 


96 


977 


gil63307 


Bos taurus 


lactoperoxidase 


3166 


80 


977 


gil 1990122 


Camelus 
dromedarius 


peroxidase 


3153 


81 


978 


AAY01604 


Homo sapiens 


Amino acid sequence of the human 
defensin (Def-X) protein. 


501 


97 


978 


gi29735 


Homo sapiens 


corticostatin/defensin HP-4 precursor 


214 


46 


978 


AAR15222 


Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 


211 ■ 


47 


979 


AAG81415 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:348. 


848 


100 


980 


gi6633820 


Arabidopsis 
thaliana 


F1N19.20 


79 


26 


980 


gi6 180001 


Capra hircus 


pregnancy-associated glycoprotein-8 


73 


30 


980 


gi20071290 


Mus musculus 


Similar to solute carrier family 26, 
member 7 


71 


25 


981 


gi4309953 


Homo sapiens 


T cell receptor gamma chain; similar 
to PID:g339160 


951 


95 
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981 


gi4309950 


Homo sapiens 


T cell receptor gamma chain; match 
to S08328 (PID:g 106470) 


917 


88 


981 


ABG39493 


Homo sapiens 


Human peptide encoded by genome- 

•t * 1 • 1 1 <-1 T~~» /— v TT"\ 

derived single exon probe SEQ ID 
29158. 


679 


77 


982 


gil223888 


synthetic 
construct 


T cell receptor alpha chain 


1066 


79 


982 


gi338766 


Homo sapiens 


T-cell receptor precursor 


987 


73 


982 


gi3089419 


Homo sapiens 


T cell receptor alpha chain 


974 


71 


983 


ABB07518 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 7474438CD1). 


418 


29 


983 


AAB73512 


Homo sapiens 


Human transferase HTFS-19, SEQ 
ID NO: 19. 


409 


30 


983 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
0610008P16 gene 


409 


30 


984 


AAG71286 


Homo sapiens 


Human gene 9-encoded secreted 
protein HMSDL37, SEQ ID NO: 135. 


31 1 


OA 


984 


AAG71251 


Homo sapiens 


Human gene 9-encoded secreted 
protem HiviaiJL,3 /, csnc^ lu MU.yy. 


311 


nn 

90 


984 


gi 13096922 


Mus musculus 


Similar to nadrm 


/4 


A A \ 

44 


985 


gi45 19541 


Mus musculus 


thrombospondin type 1 domain 


1296 


88 


985 


ABP61846 


Homo sapiens 


Human polypeptide SEQ ID NO 200. 


£1A A 

644 


A/Z 

4o 


985 


AAB99220 


Homo sapiens 


Human thrombospondin-30. 


644 


46 


986 


AAG81417 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:352. 


287 


100 


987 


ABB53264 


Homo sapiens 


Human polypeptide #4. 


1904 


100 


987 


gil8565270 


Homo sapiens 


Lib 


1904 


100 


987 


gil 8565266 


Rattus 
norvegicus 


Lib 


1621 


83 


988 


AAB59022 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
730. 


1385 


100 


988 


gil4124958 


Homo sapiens 


nucleotide binding protein 2 (E.coli 
MinD like) 


1385 


100 


988 


gil3559170 


Homo sapiens 


C447E6.1 (nucleotide binding 
protein 1 (E.coli MinD like) ) 


1380 


100 


989 


AAU99292 


Homo sapiens 


Human chordin-like associated 
protein #1 . 


3916 


100 


989 


AAU99293 


Homo sapiens 


Human chordin-like associated 
protein #2. 


3073 


81 


989 


AAE07119 


Homo sapiens 


Human gene 12 encoded secreted 
protein fragment, SEQ ID NO: 13 6. 


2411 


98 


990 


AAB26105 


Homo sapiens 


Human DAN/Cerberus-related 
protein 6 (hDCR6) #1. 


1693 


87 


990 


AAE 17089 


Homo sapiens 


Human osteolevin protein. 


439 


66 


990 


ABB07209 


Homo sapiens 


TT t "1 J f\ 1 * * 1 

Human cloaked- 2 polypeptide 
sequence. 


439 


66 


991 


' c\c\ f a Ann 

gi9964099 


Chlamydia 
trachomatis 


inclusion membrane protein 


/4 




992 


gi21619848 


Homo sapiens 


Similar to immunoglobulin lambda 
joining 3 


1002 


89 


992 


gi4490538 


Homo sapiens 


lambda-immunoglobulin light chain 


927 


81 


992 


gi33746 


Homo sapiens 


immunoglobulin lambda light chain 


914 


80 


993 


AAG03466 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7547. 


97 


100 
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993 


gi4063766 


Aspergillus 
nidulans 


chitinase 


y i 


^a 
3U 


993 


gi36 17766 


Anopheles 
gambiae 


ICHIT protein 


88 


27 


994 


gi203246 


Rattus 
norvegicus 


ji 1 vi 1 i 

cell adhesion-like molecule 


1 11Q 

1 /Dy 


07 

y / 


994 


gi5 14374 


Homo sapiens 


opioid-binding cell adhesion 
molecule 




y / 


994 


gi586 


Bos taurus 


put. pre-OPGAM (AA 1 - 345) 




0*5 

yj 


995 


AAB53088 


Homo sapiens 


Human angiogenesis-associated 
protein PR0328, SbQ 1L) JNO:13z. 


2391 


91 


995 


AAB 80260 


Homo sapiens 


Human PR0328 protein. 


2391 


y i 


995 


AAU12351 


Homo sapiens 


Human PR0328 polypeptide 
sequence. 


2391 


91 


996 


AAB85144 


Homo sapiens 


Human NKCR polypeptide (clone ID 
HMSOM53J. 


1205 


83 


996 


AAY96226 


Homo sapiens 


Human high affinity Fc receptor, 
FcgammaRI. 


354 


41 


996 


gi31332 


Homo sapiens 


FcRI (AA 1-374) 


354 


41 


997 


ABB50835 


Homo sapiens 


Human secreted protein encoded by 
gene 80 SEQ ID NO:788. 


HA 
/4 


DO 


998 


ABP61434 


Homo sapiens 


Human NF-kB activating protein 
SEQ ID NO 21. 


/in 

417 


i An 
IUU 


998 


AAU07442 


Homo sapiens 


Human Wntl Upregulated protein 2 
(WUP2). 


A T H 

417 


1 An 


998 


AAU07441 


Homo sapiens 


Human Wntl Upregulated protein 1 
(WUP1). 


41 / 


1 AA 


999 


AAB08732 


Homo sapiens 


Amino acid sequence of a human 
OLD-35 polypeptide. 


1671 


85 


999 


gi20372922 


Homo sapiens 


polynucleotide phosphorylase-like 
protein 


lo / 1 


Oj 


999 


AAB 92684 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11065. 




OO 


1000 


gi 199582 


Mus musculus 


B(2)-microglobulin 


olo 


yj 


1000 


gi50105 


Mus musculus 


beta2-microglobulin precursor (aa - 
20 to 99) 


614 


95 


1000 


gil99576 


Mus musculus 


B(2)-microglobulin 


609 


94 


1001 


AAB93335 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12441. 


672 




1001 


AAE21620 


Homo sapiens 


Human gene 7 encoded secreted 
protein, SEQ ID NO: 92. 


427 


91 


1001 


AAE21604 


Homo sapiens 


Human gene 7 encoded secreted 
protein rllrUiioj, oxlQ LL> JN(J. /O. 


4UZ 


OA 

yu 


1002 


gil5559608 


Homo sapiens 


Similar to zinc finger protein 16 
(KOX 9) 


2079 


100 


1002 


gi488555 


Homo sapiens 


zinc finger protein ZNF135 


749 


65 


1002 


AAB21006 


Homo sapiens 


TT 1 * " J 1 ' J* 1 ' _ 

Human nucleic acid-bmdmg protein, 


HAH 

74/ 


04 


1003 


ABB79480 


Homo sapiens 


Human zinc finger protein 75.68. 


1689 


56 


1003 


gi 1769491 


Homo sapiens 


kruppel-related zinc finger protein 


1672 


53 


1003 


AAM39130 


Homo sapiens 


Human polypeptide SEQ ID NO 
2275. 


1669 


52 


1005 


AAB23641 


Homo sapiens 


Human secreted protein SEQ ID NO: 
97. 


609 


100 


1005 


gi 13 129458 


Oryza sativa 


polyprotein 


89 


31 
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[Oryza sativa 

(japonica 

cultivar-group)] 








1005 


gi7228457 


Oryza sativa 

(japonica 

cultivar-group) 


Similar to Sorghum bicolor Gypsy- 
Ty3 type retrotransposon RetroSorl, 
polyprotein. (AF098806) 


87 


29 


1006 


gil 1493473 


Homo sapiens 


PR02225 


163 


67 


1007 


AAM52305 


Homo sapiens 


Human zyxine. 


2815 


91 


1007 


AAG68191 


Homo sapiens 


Zyxin protein SEQ ID NO: 107. 


2815 


91 


1007 


gil 545954 


Homo sapiens 


zyxin 


2815 


91 


1008 


AAM00955 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 431. 


704 


100 


1008 


gil 8028488 


Homo sapiens 


cytosolic leucine-rich protein 


631 


99 


1008 


gi2 1666364 


Bos taurus 


leucine-rich protein 


588 


90 


1009 


AAE01420 


Homo sapiens 


Human secreted protein fragment, 
SEQ ID NO: 144. 


612 


100 


1009 


gil 7225457 


Homo sapiens 


autism-related protein 1 


76 


38 


1009 


AAM79126 


Homo sapiens 


Human protein SEQ ID NO 1788. 


72 


29 


1010 


gil6877231 


Homo sapiens 


Similar to RIKENcDNA 
27000 19D07 gene 


997 


100 


1010 


AAM39593 


Homo sapiens 


Human polypeptide SEQ ID NO 
2738. 


94 


100 


1010 


gi7302568 


Drosophila 
melanogaster 


CG15073-PA 


91 


19 


1011 


AAB43434 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 879. 


413 


78 


1011 


AAY07039 


Homo sapiens 


Breast cancer associated antigen 
precursor sequence. 


413 


78 


1011 


gil 7932966 


Homo sapiens 


ribosomal protein PI 


413 


78 


1012 


AAW88457 


Homo sapiens 


Human lysophospholipase IHLP. 


1125 


93 ! 


1012 


ABP51416 


Homo sapiens 


Human MDDT SEQ ID NO 438. 


843 


100 


1012 


AAB75386 


Homo sapiens 


Human secreted protein #45. 


825 


100 


1013 


AAG81374 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:266. 


919 


98 


1013 


gi20977549 


Danio rerio 


DTIPIAIO-Iike protein 


354 


41 


1013 


gi665970 


Saccharomyces 
cerevisiae 


Ylr435wp 


169 


26 


1014 


gi20988991 


Mus musculus 


RIKEN cDNA 2810405K02 gene 


920 


88 


1014 


AAM93895 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4031. 


777 


100 


1014 


ABB84903 


Homo sapiens 


Human PROH98 protein sequence 
SEQ ID NO: 174. 


187 


36 


1015 


AAB56791 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1369. 


496 


98 


1015 


gil9401678 


Giardia 
intestinalis 


endosomal AAA ATPase-like protein 


76 


38 


1016 


ABB44579 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 36. 


655 


100 


1016 


AAB53427 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO:967. 


655 


100 


1016 


gi2286227 


Bos taurus 


myocardial vascular inhibition factor 


655 


100 


1017 


AAB81188 


Homo sapiens 


Human zinc finger protein 52 (ZFP- 
52). 


2389 


98 


1017 


AAB95368 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17684. 


1712 


100 
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1017 


ABP51456 


Homo sapiens 


Human MDDT SEQ ID NO 478. 


1561 


99 


1018 


gi6434346 


Caenorhabditis 
elegans 


Y105E8B.4 


77 


33 


1019 


AAG66831 


Homo sapiens 


Human DNA-dependent protein 
kinase 9. 


A A O 

443 


no 


1019 


AAB90816 


Homo sapiens 


Human shear stress-response protein 
SEQ ID NO: 140. 






1019 


AAG66832 


Homo sapiens 


Human DNA-dependent protein 
kinase 9 N-terminal peptide. 


/y 




1020 


gil377897 


Homo sapiens 


heart protein 


1631 


100 


1020 


gi7209525 


Homo sapiens 


DRAL/Slim3/FHL2 




QQ 


1020 


gi5825391 


Mus musculus 


four and half LIM domain protein 2 


1524 


91 


1021 


gil8139947 


Homo sapiens 


HLCDGP1 




QQ 

yy 


1021 


gi7770259 


Homo sapiens 


PR02975 


794 


100 


1021 


gil237130 


Escherichia coli 


O antigen polymerase 


85 


26 


1022 


gil9263712 


Homo sapiens 


Similar to LOG 146557 


972 


100 


1022 


AAG81348 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:214. 


528 


62 


1022 


AAM88837 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 16430. 


179 


on 

80 


1023 


AAG01390 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5471. 


297 


100 


1023 


gi456681 


Pseudorabies 
virus 


helicase 


OA 

80 


25 


1023 


AAM85692 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:13285. 


78 


63 


1024 


gi4235144 


Homo sapiens 


BC39498 1 


1423 


62 


1024 


gi21265141 


Homo sapiens 


Similar to zinc finger protein 91 
(HPF7, HTF10) 


1404 


60 


1024 


gil4348591 


Homo sapiens 


KRAB zinc finger protein 


1403 


58 


1025 


gi 18490643 


Homo sapiens 


Similar to recombination activating 
gene 2 


2849 


99 


1025 


gil65680 


Oryctolagus 
cuniculus 


recombination activating protein 


2666 


91 


1025 


gi2576246 


Mus musculus 


RAG-2 protein 


2594 


88 


1026 


AAG75278 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6042. 


1058 


95 


1026 


gi600255 


Gallus gallus 


caldesmon 


147 


2o 


1026 


gi211896 


Gallus gallus 


h-caldesmon 


147 


28 


1027 


AAY87341 


Homo sapiens 


Human signal peptide containing 
protein HSPP-118 SEQ ID NO: 118. 


699 


99 


1027 


gil2311853 


Leishmania 
major 


possible surface antigen 


77 


31 


1028 


AAM92844 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 2193. 


86 


38 


1028 


ABB11195 


Homo sapiens 


Human transmembrane protein 
homologue, SEQ ID NO: 1565. 


82 


25 


1028 


ABG66815 


Homo sapiens 


Human prostate specific protein 
DEX0283 123. 


/o 


ou 


1029 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


587 


41 


1029 


AAM79986 


Homo sapiens 


Human protein SEQ ID NO 3632. 


579 


41 


1029 


AAM79002 


Homo sapiens 


Human protein SEQ ID NO 1664. 


579 


41 


1031 


ABP42929 


Homo sapiens 


Human ovarian antigen HPDRS87, 
SEQ ID NO:4061. 


820 


93 
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1031 


ABP41129 


Homo sapiens 


Human ovarian antigen HE2RG21, 
SEQ ID NO:2261. 


820 


93 


1031 


AAU01195 


Homo sapiens 


Human cyclophilin A protein. 


820 


93 


1032 


AAE06643 


Homo sapiens 


Human G-protein coupled receptor 
(NGPCR) #2. 


671 


99 


1032 


AAE06642 


Homo sapiens 


Human G-protein coupled receptor 
(NGPCR) #1. 


671 


99 


1032 


gi7291589 


Drosophila 
melanogaster 


CG18679-PA 


214 


34 


1033 


AAG03055 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7136. 


269 


100 ' 


1034 


AAB38043 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 10 clone 
HWHGP71. 


125 


36 


1034 


gi5305335 


Mycobacterium 
tuberculosis 


proline-rich mucin homolog 


105 


33 


1034 


gi5917666 


Zea mays 


extensin-like protein 


104 


37 


1035 


AAM93942 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4126. 


3226 


99 


1035 


ABB 11422 


Homo sapiens 


Human Zn finger protein homologue, 
SEQ ID NO: 1792. 


2760 


96 


1035 


gi6467206 


Homo sapiens 


gonadotropin inducible transcription 
repressor-4 


2032 


57 


1036 


AAB95007 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16685. 


518 


86 


1036 


gi21410398 


Mus musculus 


RIKEN cDNA 2610034E13 gene 


81 


32 


1036 


gi45906 


Proteus vulgaris 


hlyC protein (AA 1-54) 


72 


45 


1037 


AAY27616 


Homo sapiens 


Human secreted protein encoded by 
gene No. 50. 


562 


99 


1037 


gil7902598 


Rice black 
streaked dwarf 
virus 


P6 protein 


71 


25 


1038 


gil6588681 


Homo sapiens 


anion transporter/exchanger-9 


4295 


95 


1038 


AAE21166 


Homo sapiens 


Human TRICH-10 protein. 


3612 


91 


1038 


gi 13344999 


Homo sapiens 


solute carrier family 26 member 6 


1298 


37 


1039 


AAU12254 


Homo sapiens 


Human PR04343 polypeptide 
sequence. 


780 


100 


1039 


AAM40835 


Homo sapiens 


Human polypeptide SEQ ID NO 
5766. 


780 


100 


1039 


AAY76141 


Homo sapiens 


Human secreted protein encoded by 
gene 18. 


780 


100 


1040 


ABB89694 


Homo sapiens 


Human polypeptide SEQ ID NO 
2070. 


622 


77 


1040 


AAY59672 


Homo sapiens 


Secreted protein 10S-006-5-0-E6-FL. 


622 


77 


1040 


AAB94543 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15290. 


618 


76 


1041 


AAY92710 


Homo sapiens 


Human membrane-associated protein 
Zsig24. 


704 


97 


1041 


ABB89722 


Homo sapiens 


Human polypeptide SEQ ID NO 
2098. 


566 


99 


1041 


AAY87250 


Homo sapiens 


Human signal peptide containing 
protein HSPP-27 SEQ ID NO:27. 


566 


99 


1042 


AAU29316 


Homo sapiens 


Human PRO polypeptide sequence 
#293. 


2932 


99 


1042 


ABB05749 


Homo sapiens 


Human G protein-coupled receptor 


1591 


44 
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NOVla protein SEQ ID NO:2. 






1042 


gil4572521 


Homo sapiens 


NEPH1 


1519 


51 


1043 


AAU00688 


Homo sapiens 


Human CD59 protein. 


710 


100 


1043 


AAW26318 


Homo sapiens 


Human CD59. 


710 


100 


1043 


AAR80240 


Homo sapiens 


Human membrane attack complex 
inhibition factor. 


710 


100 


1044 


gil7390957 


Mus musculus 


Similar to RIKEN cDNA 
2010001E11 gene 


1455 


74 


1044 


gi6841140 


Homo sapiens 


HSPC100 


498 


100 


1044 


gil 7985273 


Brucella 
melitensis 


GLUCOSE/GALACTOSE 
TRANSPORTER 


124 


22 


1045 


AAB56632 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1210. 


3377 


99 


1045 


gil3097708 


Homo sapiens 


ribophorin II 


3152 


100 


1045 


gi4730801 


Homo sapiens 


dJ343K2.2.1 (ribophorin II (isoform 
D) 


3152 


100 


1046 


AAB70690 


Homo sapiens 


Human hDPP protein sequence SEQ 
ID NO:7. 


598 


i rvA 


1046 


AAG89279 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 399. 


598 


100 


1046 


gil 3 182757 


Homo sapiens 


HTPAP 


598 


100 


1047 


gi2276448 


Homo sapiens 


MHC class I HLA-A 


1794 


93 


1047 


gi6815812 


Homo sapiens 


MHC class I antigen heavy chain 


1794 


93 


1047 


gil245460 


Homo sapiens 


MHC class I HLA-A 


1786 


92 


1048 


ABP41629 


Homo sapiens 


Human ovarian antigen HOOJQ91, 
SEQ ID NO:2761. 


675 


90 


1048 


AAB95392 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17743. 


564 


78 


1048 


AAM79768 


Homo sapiens 


Human protein SEQ ID NO 3414. 


564 


78 


1049 


gil4017773 


Mus musculus 


Cgl0671-like 


1517 


96 


1049 


gil4017764 


Mus musculus 


CG10671-like 


1517 


96 


1049 


ABB89676 


Homo sapiens 


Human polypeptide SEQ ID NO 
2052. 


957 


89 


1050 


AAG81431 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:380. 


503 


97 


1050 


AAE23305 


Homo sapiens 


Human nectin-4 protein #4. 


128 


32 


1050 


gil9353148 


Mus musculus 


Similar to poliovirus receptor-related 
4 


127 


27 


1051 


gi20072551 


Mus musculus 


RIKEN cDNA 49305 11 Jll gene 


420 


45 


1051 


gil7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


147 


25 


1051 


gil2836893 


Gallus gallus 


IPR328-like protein 


147 


29 


1052 


ABB84978 


Homo sapiens 


Human PRO4430 protein sequence 
SEQ ID NO:324. 


436 


70 


1052 


ABB95584 


Homo sapiens 


Human angiogenesis related protein 
PRO4430 SEQ ID NO: 324. 


436 


70 


1052 


AAU29273 


Homo sapiens 


Human PRO polypeptide sequence 
#250. 


436 


70 


1053 


AAB88325 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0020. 


912 


99 


1053 


AAB53257 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO:797. 


859 


99 


1053 


AAY87264 


Homo sapiens 


Human signal peptide containing 
protein HSPP-41 SEQ IDNO:41. 


315 


63 


1054 


AAU12201 


Homo sapiens 


Human PR01779 polypeptide 


1819 


54 
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sequence. 






1054 


AAB25594 


Homo sapiens 


Protein encoded by human secreted 
protein gene #L 


1819 


54 


1054 


gil234787 


Xenopus laevis 


up-regulated by thyroid hormone in 
tadpoles; expressed specifically in 
the tail and only at metamorphosis; 
membrane bound or extracellular 
protein; C-terminal basic region 


1799 


53 


1055 


AAW55035 


Homo sapiens 


HPURR amino acid sequence. 


2014 


100 


1055 


AAW47066 


Homo sapiens 


Human brain P2X-1 receptor 
polypeptide. 


2014 


100 


1055 


gi4099139 


Homo sapiens 


P2X4 purinoreceptor 


2014 


100 


1056 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDNA 22360 encoded 
protein. 


1255 


85 


1056 


AAM42468 


Homo sapiens 


Human kidney related polypeptide 
SEQ ID NO 337. 


145 


90 


1056 


AAM99653 


Homo sapiens 


Human excretory related polypeptide 
SEQ ID NO 390. 


145 


90 


1057 


ABB53267 


Homo sapiens 


Human polypeptide #7. 


3372 


98 


1057 


AA014449 


Homo sapiens 


Protein of human Zona Pellucida 1 
(Zzpl). 


3367 


98 


1057 


gi972946 


Mus musculus 


ZP1 precursor 


on 1 c 

11 lo 


O / 


1058 


gil 57791 56 


Homo sapiens 


Similar to RIKEN cDNA 
1810073N04 gene 


1858 


100 


1058 


gil3097045 


Mus musculus 


Similar to RIKEN cDNA 
1810073N04 gene 


1719 


91 


1058 


AAM79693 


Homo sapiens 


Human protein SEQ ID NO 3339. 


1 1 1 o 

1 loo 


1 AH 


1059 


AAM79993 


Homo sapiens 


Human protein SEQ ID NO 3639. 


1736 


orv 


1059 


AAM79009 


Homo sapiens 


Human protein SEQ ID NO 1671. 


1736 




1059 


ABB12000 


Homo sapiens 


Human prostaglandin DP receptor 
homologue, SEQ ID NO: 2370. 


1736 


89 


1060 


AAU79946 


Homo sapiens 


Human transporter protein sequence. 


2907 




1060 


AAE21181 


Homo sapiens 


Human TRICH-25 protein. 


ZD 11 


y i 


1060 


gi2811122 


Xenopus laevis 


NaDC-2 


1742 


54 


1061 


AAM79483 


Homo sapiens 


Human protein SEQ ID NO 3129. 


1698 


88 


1061 


AAM78499 


Homo sapiens 


Human protein SEQ ID NO 1 161 . 


1698 


88 


1061 


ABB11938 


Homo sapiens 


Human cystinosin homologue, SEQ 
ID NO:2308. 


1698 


88 


1062 


gil2656590 


Danio rerio 


P2x purinoceptor subunit 4 


72 


40 


1063 


AAG68335 


Homo sapiens 


Human CSP2 protein SEQ ID NO:4. 


1354 


99 


1063 


gil9525540 


Homo sapiens 


lymphocyte effector toxicity 
activation ligand 


1330 


98 


1063 


AAY36071 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 456. 


1252 


92 


1064 


gil5277509 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 2 


1548 


97 


1064 


ABP41089 


Homo sapiens 


Human ovarian antigen HoIajCjSo, 
SEQ ID NO:2221. 


1 jU l 


i no 


1064 


gil813823S 


Bos taurus 


C-14 sterol reductase 


1376 


87 


1065 


AAM93346 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
2891. 


5017 


98 


1065 


AAM93761 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3754. 


4620 


99 


1065 


AAB92756 


Homo sapiens 


Human protein sequence SEQ ID 


2856 


99 
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0/ 

70 

Identity 








NO:11216. 






1066 


AAE23544 


Homo sapiens 


Human FAIL protein. 


1730 


99 


1066 


AAE23554 


Homo sapiens 


Human FAIL protein #3. 


172o 


QQ 

yy 


1066 


AAE23556 


Homo sapiens 


Human FAIL protein #5. 


1726 


99 


1067 


gil8480772 


Mus musculus 


olfactory receptor MORI 01 -2 


1 1 tin 


on 
OZ 


1067 


gil8479346 


Mus musculus 


olfactory receptor MORI 01-1 


1193 


79 


1067 


AAG72119 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1800. 


1043 


81 


1068 


ABB89454 


Homo sapiens 


Human polypeptide SEQ ID NO 
1830. 


530 


95 


1068 


ABP41764 


Homo sapiens 


Human ovarian antigen HE6CR19, 
SEQ ID NO:2896. 


530 


95 


1068 


AAU04352 


Homo sapiens 


Mammalian toxicological response 
marker protein #4. 


530 


95 


1069 


AAY33300 


Homo sapiens 


Human hALK-2 clone HP53 protein. 


1572 


100 


1069 


AAR85206 


Homo sapiens 


Human ALK-2. 


1572 


100 


1069 


gil381584 


Bos taurus 


activin receptor type I 


1572 


100 


1070 


gil6359163 


Homo sapiens 1 


Similar to RIKEN cDNA 
2310014B08 gene 


1332 


94 


1070 


gil 8043464 


Mus musculus 


RIKEN cDNA 2310014B08 gene 


1226 


77 


1070 


AAB64401 


Homo sapiens 


Amino acid sequence of human 
intracellular signalling molecule 
INTRA33. 


212 


35 


1071 


AAW60043 


Homo sapiens 


Human MHC class I chain-related 
gene A (MICA) polypetide. 


1894 


93 


1071 


gil405893 


Homo sapiens 


MHC class I chain-related protein A 


1894 


93 


1071 


gil6877353 


Homo sapiens 


MHC class I polypeptide-related 
sequence A 


1838 


90 


1072 


gil5292437 


Drosophila 
melanogaster 


LP10272p 


439 


39 


1072 


AAB 80378 


Homo sapiens 


Secreted protein encoded by gene #8. 


210 


28 


1072 


AAY87336 


Homo sapiens 


Human signal peptide containing 
protein HSPP-1 13 SEQ ID NO: 1 13. 


210 


28 


1073 


AAB58289 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 627. 


2303 


00 


1073 


ABB55767 


Homo sapiens 


Human polypeptide SEQ ID NO 140. 


2163 


86 


1073 


AAU39058 


Homo sapiens 


Human secreted protein pe584__2. 


2163 


86 


1074 


AAB58289 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 627. 


2303 


86 


1074 


ABB55767 


Homo sapiens 


Human polypeptide SEQ ID NO 140. 


2163 


86 


1074 


AAU39058 


Homo sapiens 


Human secreted protein pe584_2. 


2163 


86 


1075 


AAM93703 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3632. 


1061 


93 


1075 


AAE04780 


Homo sapiens 


Human vesicle trafficking protein-23 
(VETRP-23) protein. 


864 


100 


1075 


ABB08160 


Homo sapiens 


Human cytoskeleton-associated 
protein (CSAP>4 (ID: 

HA A /~^T~\ 1 \ 

/4 /Z/z4tUl ). 


758 


87 


1076 


ABB06255 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-6 protein SEQ ID NO: 15. 


1511 


100 


1076 


ABB06254 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-5 protein SEQ ID NO:13. 


1511 


100 


1076 


ABB06252 


Homo sapiens 


Human G protein-coupled receptor 
TGR17-3 protein SEQ ID NO:7. 


1511 


100 


1077 


AAE16786 


Homo sapiens 


Human transporter and ion channel- 


2679 


99 
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23 (TRICH-23) protein. 






1077 


ABB08218 


Homo sapiens 


Human membrane transporter protein 
57256. 


1759 


68 


1077 


AAM38697 


Homo sapiens 


Human polypeptide SEQ ID NO 
1842. 


1249 


53 


1078 


AAB85029 


Homo sapiens 


Protein encoded by BAP28 cDNA 
consisting of exons 1 to 45. 


3444 


81 I 


1078 


AAW54099 


Homo sapiens 


Homo sapiens BAP28 sequence. 


2219 


88 


1078 


AAB92729 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 159. 


1588 


92 


1079 


ABB07526 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 1962105CD1). 


2580 


99 


1079 


AAM93720 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3669. 


2331 


92 


1079 


gi57806 


Rattus sp. 


gamma-glutamyltranspeptidase (AA 
1-568) 


236 


28 


1080 


AAM78536 


Homo sapiens 


Human protein SEQ ID NO 1 198. 


6839 


100 


1080 


AAM79520 


Homo sapiens 


Human protein SEQ ID NO 3166. 


6820 


99 i 


1080 


ABG40303 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
29968. 


2092 


100 


1081 


gi20809440 


Homo sapiens 


sterol O-acyltransferase (acyl- 
Coenzyme A: cholesterol 
acyltransferase) 1 


1783 


65 


1081 


gi4878022 


Homo sapiens 


acyl-coenzyme A: cholesterol 
acyltransferase 


1779 


65 


1081 


AAW38416 


Homo sapiens 


Human acyl-coenzyme A: cholesterol 
acyltransferase I. 


1774 


65 


1082 


gi22002433 


Homo sapiens 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


7014 


100 


1082 


gi21979456 


Homo sapiens 


raptor 


7014 


100 


1082 


gi22002435 


Mus musculus 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


6819 


96 


1083 


AAY33301 


Homo sapiens 


Human hALK-3 clone ONF5 protein. 


2647 


89 


1083 


AAR85207 


Homo sapiens 


Human ALK-3. 


2647 


89 


1083 


AAR55368 


Homo sapiens 


Human Activin receptor-like kinase 3 
(hALK-3). 


2647 


89 


1084 


AAY33301 


Homo sapiens 


Human hALK-3 clone ONF5 protein. 


1829 


92 


1084 


AAR85207 


Homo sapiens 


Human ALK-3. 


1829 


92 


1084 


AAR55368 


Homo sapiens 


Human Activin receptor-like kinase 3 
(hALK-3). 


1829 


92 


1085 


AAW90S73 


Homo sapiens 


Human brain- specific dysferlin 
protein. 


1329 


53 


1085 


AAW90868 


Homo sapiens 


Human dysferlin protein. 


1329 


53 


1085 


AAY82643 


Homo sapiens 


Human dysferlin protein sequence 
SEQ ID NO:2. 


1329 


53 


1086 


gil9343765 


Mus musculus 


Similar to dysferlin 


1860 


47 


1086 


ABB89615 


Homo sapiens 


Human polypeptide SEQ ID NO 
1991. 


1853 


47 


1086 


gi6731235 


Homo sapiens 


myoferlin 


1853 


47 


1087 


AAY92321 


Homo sapiens 


Human alpha-2-delta-D calcium 
channel subunit. 


5881 


99 


1087 


AAB62262 


Homo sapiens 


Human calcium channel alpha2delta 


5742 


99 



WO 03/054152 PCT/US02/39555 



Table 2B 
244 



SEQ 
ID 


Hit ID 


Species 


Description 


S 

score 


% 
Identity 








subunit. 






1087 


AAU01038 


Homo sapiens 


Human secreted soluble alpha2delta 
calcium channel subunit #18 protein. 


5742 


99 


1088 


gi 18676422 


Homo sapiens 


FLJ00088 protein 


3492 


96 


1088 


ABP51380 


Homo sapiens 


Human MDDT SEQ ID NO 402. 


2083 


99 


1088 


gi2104689 


Mus museums 


alpha glucosidase II, alpha subunit 


1987 


53 


1089 


AAY01143 


Homo sapiens 


Secreted protein encoded by gene 9 
clone HSIDY06. 


238 


100 


1090 


AAY81261 


Homo sapiens 


Human DNA structure-specific 
recognition protein 1 (SSRP1). 


3683 


100 


1090 


AAW39212 


Homo sapiens 


Human SSRP1 protein. 


3683 


100 


1090 


AAR38744 


Homo sapiens 


Human SSRP. 


3683 


100 


1091 


gil77814 


Homo sapiens 


alpha- 1-antitrypsin-related protein 


1922 


90 


1091 


gil5990507 


Homo sapiens 


Similar to serine (or cysteine) 
proteinase inhibitor, clade A (alpha- 1 
antiproteinase, antitrypsin), member 
1 


1408 


66 


1091 


AAY26925 


Homo sapiens 


Human alpha 1-anti-trypsin type Ml 
protein. 


1407 


66 


1092 


gil6877139 


Homo sapiens 


Similar to RIKEN cDNA 
1300019N10 gene 


2273 


100 


1092 


AAB56819 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1397. 


1054 


100 


1092 


AAM95577 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4235. 


627 


91 


1093 


gil 86055 12 


Homo sapiens 


Similar to CAP-binding protein 
complex interacting protein 2 


1523 


100 


1093 


AAO06814 


Homo sapiens 


Human polypeptide SEQ ID NO 
20706. 


1314 


100 


1093 


AAY57946 


Homo sapiens 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


1094 


AAM93603 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3418. 


2973 


99 


1094 


gil9571657 


Caenorhabditis 
elegans 


similar to Yeast YEH4 like protein 


964 


41 


1094 


ABB89291 


Homo sapiens 


Human polypeptide SEQ ID NO 
1667. 


590 


85 


1095 


gi4959568 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP 


1650 


94 


1095 


AAO17206 


Homo sapiens 


Human secreted protein SEQ ID NO: 
105. 


1336 


79 


1095 


ABB90262 


Homo sapiens 


Human polypeptide SEQ ID NO 
2638. 


872 


69 


1096 


gil8031730 


Homo sapiens 


GK006 


1405 


98 


1096 


AAB92609 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10874. 


1074 


99 


1096 


AAM40309 


Homo sapiens 


Human polypeptide SEQ ID NO 
3454. 


1074 


99 


1097 


AAG03767 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7848. 


612 


90 


1097 


AAB43694 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 11 39. 


612 


90 


1097 


gi285910 


Homo sapiens 


ATP synthase subunit c precursor 


612 


90 


1098 


gi897827 


Homo sapiens 


iron-responsive element-binding 
protein/iron regulatory protein 2 


4968 


99 
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1098 


AAE19851 


Homo sapiens 


Human wild-type IRP-2 protein. 


4909 


99 


1098 


gi897581 


Homo sapiens 


iron-regulatory protein 2 


4909 


99 


1099 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1099 


gi437051 


Acipenser 
transmontanus 


vitellogenin 


76 


23 


1099 


gi21539886 


Arabidopsis 
thaliana 


transcription activator 


75 


26 


1100 


AAM84273 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:11866. 


328 


92 


1100 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1100 


gi535260 


Plasmodium 
reichenowi 


STARP antigen 


76 


21 


1101 


gi3551150 


Canine 
herpesvirus 


immediate-early protein 


77 


28 


1101 


gi437051 


Acipenser 
transmontanus 


vitellogenin 


76 


23 


1101 


gi535260 


Plasmodium 
reichenowi 


STARP antigen 


75 


21 


1102 


AAU12296 


Homo sapiens 


Human PR07171 polypeptide 
sequence. 


168 


67 


1102 


ABB85001 


Homo sapiens 


Human PR028631 protein sequence 
SEQ ID NO:370. 


90 


40 


1102 


ABB95607 


Homo sapiens 


Human angiogenesis related protein 
PR028631 SEQ ID NO: 370. 


90 


40 


1103 


gil 1558264 


Homo sapiens 


sphingosine- 1 -phosphatase 


2032 


89 


1103 


gil3447199 


Homo sapiens 


sphingosine- 1 -phosphate 
phosphatase 


1994 


87 


1103 


gil5778670 


Mus mus cuius 


sphingosine- 1 -phosphate 
phosphatase 


1721 


76 


1104 


ABB72215 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 331. 


1544 


100 


1104 


ABB72150 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 189. 


1544 


100 


1104 


ABB84843 


Homo sapiens 


Human PRO301 protein sequence 
SEQ ID NO:54. 


1544 


100 


1105 


AAG03757 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7838. 


506 


100 


1105 


gil78836 


Homo sapiens 


apolipoprotein C-II 


506 


100 


1105 


gi757915 


Homo sapiens 


apoCII protein 


506 


100 


1106 


AAU97773 


Homo sapiens 


Human Fortilin polypeptide. 


794 


97 


1106 


AAR55698 


Homo sapiens 


Tumor protein p2 1 . 


794 


97 


1106 


gil 8482460 


Sus scrofa 


translationally controlled tumor 
protein 


794 


97 


1107 


gil2082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


3519 


84 


1107 


gil2082723 


Gallus gallus 


B cell phosphoinositide 3-kinase 
adaptor 


ZoUo 


oy 


1107 


gi20987486 


Homo sapiens 


similar to B cell phosphoinositide 3- 
kinase adaptor 


1829 


97 


1108 


gi21708117 


Homo sapiens 


similar to hepatocellular carcinoma- 
associated antigen HCA557b 


318 


37 


1108 


gi!8252514 


Homo sapiens 


hepatocellular carcinoma-associated 
antigen HCA557b 


318 


37 
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1108 


gi20071260 


Mus musculus 


RIKEN cDNA 2310038H17 gene 


306 


37 


1109 


gil5076511 


Homo sapiens 


nonmuscle myosin light chain 2 


881 


99 


1109 


gil3436446 


Homo sapiens 


myosin regulatory light chain 


881 


99 


1109 


gi22137716 


Mus musculus 


myosin regulatory light chain 


881 


99 


1110 


gil7391357 


Homo sapiens 


FXYD domain-containing ion 
transport regulator 7 


390 


96 


1110 


gil9354238 


Mus musculus 


FXYD domain-containing ion 
transport regulator 1 


153 


45 


1110 


gi4206711 


Mus musculus 


phospholemman precursor 


153 


45 


1111 


AAM49040 


Homo sapiens 


Human testicular development- 
specific protein 10 (NYD-SP10). 


1486 


85 


1111 


gil3272522 


Homo sapiens 


transcription factor NYD-splO 


1486 


85 


1111 


gi21040409 


Homo sapiens 


regulatory factor X, 4 (influences 
HLA class II expression) 


1486 


85 


1112 


gi20269720 


Homo sapiens 


neuropilin and tolloid like-1 


1894 


99 


1112 


gi20269724 


Mus musculus 


neuropilin and tolloid like-1 


1839 


96 


1112 


ABB55774 


Homo sapiens 


Human polypeptide SEQ ID NO 154. 


1057 


58 


1113 


AAB61150 


Homo sapiens 


Human NOV19 protein. 


758 


98 


1113 


AAB61149 


Homo sapiens 


Human NOV18 protein. 


758 


98 


1113 


AAY33297 


Homo sapiens 


Human membrane spanning protein 
MSP-4. 


758 


98 


1114 


ABB90021 


Homo sapiens 


Human polypeptide SEQ ID NO 
2397. 


476 


89 


1114 


ABB 11874 


Homo sapiens 


Human secreted protein homologue, 
SEQ IDNO:2244. 


476 


89 


1114 


AAY94914 


Homo sapiens 


Human secreted protein clone 
pw337 6 protein sequence SEQ ID 
NO:34. 


476 


89 


1115 


AAG72407 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2088. 


1281 


100 


1115 


AAG72267 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1948. 


1281 


100 


1115 


gi21928991 


Homo sapiens 


seven transmembrane helix receptor 


1274 


99 


1116 


AAU80496 


Homo sapiens 


Human G-coupled receptor 
(GCREC) protein, Seq ID No 4. 


1905 


98 


1116 


ABP51568 


Homo sapiens 


Human G protein coupled receptor 
SEQ ID NO: 18. 


1543 


98 


1116 


AAU85147 


Homo sapiens 


G-coupled olfactory receptor #8. 


1538 


100 


1117 


gi5802817 


Homo sapiens 


envelope protein 


479 


77 


1117 


gi3 150438 


Human 
endogenous 
retrovirus K 


pol-env 


466 


77 


1117 


gi9558705 


Homo sapiens 


envelope 


466 


77 


1118 


AAE15241 


Homo sapiens 


Human RNA metabolism protein-4 
(RMEP-4). 


514 


100 


1118 


ABB89053 


Homo sapiens 


Human polypeptide SEQ ID NO 
1429. 


514 


100 


1118 


AAG89341 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 461. 


501 


99 


1119 


AAE23979 


Homo sapiens 


Human LP217 secreted protein. 


4563 


50 


1119 


AAB20155 


Homo sapiens 


Secreted protein SECP1. 


4522 


50 


1119 


AAM39295 


Homo sapiens 


Human polypeptide SEQ ID NO 
2440. 


4518 


50 


1120 


AAB28199 


Homo sapiens 


Human HMG-17 non histone 
chromosomal protein. 


429 


94 
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1120 


gi32329 


Homo sapiens 


put. HMG- 17 protein 


429 


94 


1120 


gi306864 


Homo sapiens 


high mobility group protein 17 


429 


94 


1121 


ABP43105 


Homo sapiens 


Human ovarian antigen HVCBB19, 
SEQ ID NO:4237. 


456 


69 


1121 


AAE13797 


Homo sapiens 


Human lung tumour-specific protein 
SALT-T8. 


456 


69 


1121 


AAB44456 


Homo sapiens 


Human lung tumour-specific antigen 
encoded by cDNA #71. 


456 


69 


1122 


AAM93711 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3650. 


2974 


99 


1122 


ABB89767 


Homo sapiens 


Human polypeptide SEQ ID NO 
2143. 


2214 


97 


1122 


gi7303971 


Drosophila 
melanogaster 


CG8230-PA 


1154 


41 


1123 


AAU76036 


Homo sapiens 


Human sugar transporter- 1 (HST-1) 
protein sequence. 


1055 


99 


1123 


AAB60112 


Homo sapiens 


Human transport protein TPPT-32. 


775 


100 


1123 


AAB61903 


Homo sapiens 


Atherosclerosis-associated 
polypeptide. 


380 


48 


1124 


AAR28120 


Homo sapiens 


NKG2 transmembrane protein-D. 


725 


95 


1124 


ABB11846 


Homo sapiens 


Human integral membrane protein 
homologue, SEQ ID NO:2216. 


722 


94 


1124 


gi35063 


Homo sapiens 


Type II integral membrane protein 


722 


94 


1125 


AAM78418 


Homo sapiens 


Human protein SEQ ID NO 1080. 


1878 


94 | 


1125 


gi21518639 


Homo sapiens 


TSLCl-like2 


1870 


97 


1125 


gil9068139 


Mus mus cuius 


membrane glycoprotein 


1849 


96 


1126 


AAB94738 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15776. 


3079 


99 


1126 


AAM41695 


Homo sapiens 


Human polypeptide SEQ ID NO 
6626. 


2456 


99 


1126 


AAM39909 


Homo sapiens 


Human polypeptide SEQ ID NO 
3054. 


1272 


100 


1127 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:148. 


678 


99 


1127 


AAB80437 


Homo sapiens 


Gene #20 associated peptide #1. 


381 


98 


1127 


AAM78175 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
38481. 


365 


100 


1128 


gi291529 


Mouse 

cytomegalovirus 
1 


tegument protein 


89 


24 


1128 


gil4573798 


Caenorhabditis 
elegans 


C. elegans SRD-60 protein 
(corresponding sequence C13B7.3) 


85 


28 


1128 


gil91992 


Mus musculus 


APC 


79 


20 


1129 


gi20987535 


Mus musculus 


RIKEN cDNA 3300002C04 gene 


741 


67 


1129 


gi20799661 


Mus musculus 


mucolipin-2 


741 


67 


1129 


AAB93412 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12616. 


632 


55 


1130 


gil9354289 


Mus musculus 


RIKEN cDNA 2010107G23 gene 


111 


42 


1130 


ABB79328 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 125. 


107 


42 


1130 


gil5488920 


Homo sapiens 


Similar to RIKEN cDNA 
2010107G23 gene 


107 


42 


1131 


AAU70675 


Homo sapiens 


Human otoferlin. 


2432 


42 
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1131 


gi4588470 


Homo sapiens 


otoferlin 


2432 


42 


1131 


AAU70673 


Homo sapiens 


Human otoferlin #2. 


2420 


42 


1132 


gi20071179 


Homo sapiens 


monocyte to macrophage 
differentiation-associated 


1032 


77 


1132 


gil006665 


Homo sapiens 


expression associated with monocyte 
to macrophage differentiation 


1028 


77 


1132 


gil 83 14462 


Mus musculus 


monocyte to macrophage 
differentiation-associated 


1028 


77 


1133 


AAU95752 


Homo sapiens 


Human olfactory and pheromone G 
protein-coupled receptor #239. 


1522 


92 


1133 


AAU85278 


Homo sapiens 


G-coupled olfactory receptor #139. 


1499 


91 


1133 


AAU24658 


Homo sapiens 


Human olfactory receptor 
AOLFR156. 


1499 


91 


1134 


AAE13275 


Homo sapiens 


Human transporters and ion channels 
(TRICH)-2. 


1472 


80 


1134 


gil7384411 


Homo sapiens 


bA251017.3 (similar to aquaporin 7) 


1469 


80 


1134 


AAY70455 


Homo sapiens 


Human membrane channel protein-5 
(MECHP-5). 


1295 


73 


1135 


ABG42409 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
32074. 


317 


100 


1135 


AAM32826 


Homo sapiens 


Peptide #6863 encoded by probe for 
measuring placental gene expression. 


317 


100 


1135 


AAM19638 


Homo sapiens 


Peptide #6072 encoded by probe for 
measuring cervical gene expression. 


317 


100 


1136 


AAB47977 


Homo sapiens 


BCY5. 


2663 


99 


1136 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 


2574 


97 


1136 


gi2 1265 165 


Homo sapiens 


solute carrier family 16 
(monocarboxylic acid transporters), 
member 7 


602 


31 


1137 


ABB08456 


Homo sapiens 


Human tumour specific antigenic 
peptide #2. 


705 


94 


1137 


AAU08592 


Homo sapiens 


Human V-ATPase 16kE>a subunit. 


705 


94 


1137 


gil4424534 


Homo sapiens 


ATPase, H+ transporting, lysosomal 
(vacuolar proton pump) 1 6kD 


705 


94 


1138 


gil6741167 


Mus musculus 


RIKEN cDNA 0610010D20 gene 


1506 


87 


1138 


gil5080314 


Homo sapiens 


Similar to RIKEN cDNA 
0610010D20 gene 


514 


100 


1138 


gil 05 80053 


Halobacterium 
sp. NRC-1 


dihydrodipicolinate synthase; DapA 


375 


33 


1139 


AA014199 


Homo sapiens 


Human transporter and ion channel 
TRICH-16. 


1425 


85 


1139 


ABB80588 


Homo sapiens 


Human sbgl020829SGLT protein. 


1425 


85 


1139 


AAE06614 


Homo sapiens 


Human protein having hydrophobic 
domain, HP 03 974. 


1425 1 


85 


1140 


ABB90752 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 236. 


1890 


76 


1140 


gi6708478 


Mus musculus 


formin-like protein 


1559 


65 


1140 


gi4101720 


Mus musculus 


lymphocyte specific formin related 
protein 


1532 


65 


1141 


AAB94131 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 14389. 


996 


96 


1141 


gi497984 


Bos taurus 


Ac45 


317 


39 


1141 


AAM93671 


Homo sapiens 


Human polypeptide, SEQ ID NO: 


307 


36 
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3556. 






1142 


AAG67137 


Homo sapiens 


Amino acid sequence of a human 
enzyme. 


2701 


96 


1142 


gi2 1707691 


Mus musculus 


RIKEN cDNA 1500002020 gene 


2645 


93 


1142 


AAB94358 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14883. 


2579 


96 


1143 


AAW54370 


Homo sapiens 


G-protein coupled receptor 
HLTEX11. 


1815 


100 


1143 


AAB64854 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 36 SEQ ID 
NO: 140. 


1792 


100 


1143 


AAB64853 


Homo sapiens 


Gene 36 human secreted protein 
homologous amino acid sequence 
#139. 


1792 


100 


1144 


ABB90324 


Homo sapiens 


Human polypeptide SEQ ID NO 
2700. 


1321 


100 


1144 


AAU82004 


Homo sapiens 


Human secreted protein SECP30. 


989 


78 


1144 


AAM95005 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 3663. 


548 


85 


1145 


AAY13458 


Homo sapiens 


Amino acid sequence of human 
Fe65. 


3759 


100 


1145 


gi2734083 


Homo sapiens 


stat-like protein 


3759 


100 


1145 


gi3924936 


Homo sapiens 


Fe65 protein 


3759 


100 


1146 


AAB08900 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:57. 


845 


90 


1146 


AAY27071 


Homo sapiens 


Human JWA protein. 


845 


90 


1146 


AAW75110 


Homo sapiens 


Human secreted protein encoded by 
gene 54 clone HETGL41. 


845 


90 


1147 


AAM93733 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3697. 


764 


87 


1147 


gi2970431 


Florometra 
serratissima 


NADH dehydrogenase subunit 4 


91 


31 


1147 


gil5128604 


Inversidens 
japanensis 


NADH dehydrogenase subunit 4 


77 


29 


1148 


AAB93562 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12957. 


2402 


100 


1148 


gi21626993 


Drosophila 
melanogaster 


CG15078-PA 


990 


39 


1148 


gil 7945442 


Drosophila 
melanogaster 


RE18318p 


990 


39 


1149 


ABB89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


1640 


99 


1149 


ABB89833 


Homo sapiens 


Human polypeptide SEQ ID NO 
2209. 


838 


97 


1149 


gil6359249 


Mus musculus 


RIKEN cDNA 1300010M03 gene 


630 


36 


1150 


ABB89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


1640 


99 


1150 


ABB89833 


Homo sapiens 


Human polypeptide SEQ ID NO 
2209. 


838 


97 


1150 


gil6359249 


Mus musculus 


RIKEN cDNA 1300010M03 gene 


564 


35 


1151 


ABB78999 


Homo sapiens 


Human 9-27 protein sequence SEQ 
ID NO:4483. 


569 


93 i 


1151 


gil 177476 


Homo sapiens 


interferon-inducible protein 


569 


93 


1151 


gil2654159 


Homo sapiens 


interferon induced transmembrane 
protein 1 (9-27) 


569 


93 1 
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1152 


AAU85153 


Homo sapiens 


G-coupled olfactory receptor #14. 


1730 


100 


1152 


AAU24529 


Homo sapiens 


Human olfactory receptor AOLFR14. 


1730 


100 


1152 


gi21928719 


Homo sapiens 


seven transmembrane helix receptor 


1630 


100 


1153 


gi784997 


Homo sapiens 


homologue to Drosophila tumour 
supressor gene 


5300 


95 


1153 


g il944491 


Homo sapiens 


homologue of the murine Llglh gene 


4891 


90 


1153 


gi22347352 


Rattus 
norvegicus 


RGL1 


4878 


88 


1154 


AAM79219 


Homo sapiens 


Human protein SEQ ID NO 1881. 


325 


64 


1155 


AAG63804 


Homo sapiens 


Amino acid sequence of a human 
amino acid transporter. 


2487 


100 


1155 


gi9309293 


Homo sapiens 


asc-type amino acid transporter 1 


2487 


100 


1155 


gil5277644 


Homo sapiens 


amino acid transporter 


2487 


100 


1156 


gi6760373 


Homo sapiens 


ODZ3 


2323 


100 


1156 


gi4760780 


Mus musculus 


Ten-m3 


2248 


96 


1156 


AAU09891 


Homo sapiens 


Human heregulin, gamma-HRG. 


1202 


52 


1157 


AAY51559 


Homo sapiens 


Human RGDI protein. 


724 


81 


1157 


AAW74804 


Homo sapiens 


Human secreted protein encoded by 
gene 75 clone HBIAB39. 


724 


81 


1157 


gi4877285 


Homo sapiens 


prenylated Rab acceptor 1 (PRA1) 


724 


81 


1158 


gil780976 


Human 
endogenous 
retrovirus K 


protease 


912 


59 


1158 


gi5802824 


Homo sapiens 


Gag-Pro-Pol protein 


909 


59 


1158 


gi5802814 


Homo sapiens 


Gag-Pro-Pol-Env protein 


906 


58 


1159 


AAY73339 


Homo sapiens 


HTRM clone 2056042 protein 
sequence. 


867 


80 


1159 


gil3111941 


Homo sapiens 


vesicle-associated soluble NSF 
attachment protein receptor (v- 
SNARE; homolog of S. cerevisiae 
VTI1) 


867 


80 


1159 


gi3861488 


Homo sapiens 


vesicle soluble NSF attachment 
protein receptor VTI2 


867 


80 


1160 


gil922891 


Mus musculus 


alpha 3B chain of laminin-5 


10336 


75 


1160 


gi5777581 


Homo sapiens 


alpha 3B chain of laminin-5 


9398 


99 


1160 


AAB48459 


Homo sapiens 


Human laminin 5 polypeptide, SEQ 
ID NO: 8. 


8690 


100 


1161 


ABP43534 


Homo sapiens 


Human secreted protein (SCEP) 58. 


1984 


91 


1161 


AAG67516 


Homo sapiens 


Amino acid sequence of a human 
secreted polypeptide. 


1984 


91 


1161 


AAE01167 


Homo sapiens 


Human gene 4 encoded secreted 
protein HKAAV61, SEQ ID NO:68. 


1984 


91 


1162 


AAM42034 


Homo sapiens 


Human polypeptide SEQ ID NO 
6965. 


900 


99 


1162 


AAM40248 


Homo sapiens 


Human polypeptide SEQ ID NO 
3393. 


821 


100 


1162 


AAP60254 


Homo sapiens 


Interferon-pseudo-omega-2. 


800 


98 


1163 


AAB 19924 


Homo sapiens 


Human interleukin-1 Hy2 (long 
version). 


818 


100 


1163 


AAB19923 


Homo sapiens 


Human interleukin-1 Hy2 (extended 
form, partial sequence). 


818 


100 


1163 


AAB19922 


Homo sapiens 


Human interleukin-1 Hy2 (short 
version). 


818 


100 


1164 


AAB37512 


Homo sapiens 


Human VEGF C subunit SEQ ID 
NO: 49. 


1061 


90 
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1164 


AAY57035 


Homo sapiens 


Human A215 amino acid sequence. 


1061 


90 


1164 


AAY94803 


Homo sapiens 


Human VEGF. 


1061 


90 


1165 


ABB84974 


Homo sapiens 


Human PR04342 protein sequence 
SEQIDNO:316. 


770 


99 


1165 


ABB95580 


Homo sapiens 


Human angiogenesis related protein 
PR04342 SEQ ID NO: 316. 


770 


99 


1165 


AAB66664 


Homo sapiens 


Protein encoded by extended 

r*\ T TT~?T firiAUT "TVKTA 1*1 

B2HFLS20W cDNA library 
sequence #2. 


770 


99 


1166 


gil321816 


Gorilla gorilla 


interleukin-8 receptor type B 


602 


90 


1166 


AAU 10557 


Homo sapiens 


Human interleukin 8 receptor beta 
(IL8RB) polypeptide. 


599 


88 


1166 


AAU80482 


Homo sapiens 


Human CXCR2 receptor #1. 


599 


88 


1167 


gi 1160967 


Homo sapiens 


palmitoyl-protein thioesterase 


1463 


92 


1167 


gil3 14355 


Homo sapiens 


palmitoyl protein thioesterase 


1463 


92 


1167 


gil4250054 


Homo sapiens 


palmitoyl-protein thioesterase 1 
(ceroid-lipofuscinosis, neuronal 1, 
infantile) 


1463 


92 


1168 


gil77814 


Homo sapiens 


alpha- 1-antitrypsin-related protein 


1952 


90 


1168 


AAY78890 


Homo sapiens 


Human alphal -antitrypsin amino 
acid sequence. 


1450 


69 


1168 


AAW56709 


Homo sapiens 


Amino acid sequence of the alpha- 1- 
antitrypsin. 


1450 


69 


1169 


AAO 12931 


Homo sapiens 


Human polypeptide SEQ ID NO 
26823. 


346 


100 


1169 


AAO02697 


Homo sapiens 


Human polypeptide SEQ ID NO 
16589. 


143 


66 


1169 


AAO08307 


Homo sapiens 


Human polypeptide SEQ ID NO 
22199. 


137 


80 


1170 


AAR15222 


Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 


635 


100 


1170 


gi32402 


Homo sapiens 


HP-l (AA 1-94) 


493 


100 


1170 


gil81529 


Homo sapiens 


defensin l 


493 


100 


1171 


AAU98513 


Homo sapiens 


Transmembrane receptor-like protein 
pTB2l85. 


71 


46 


1171 


AAU98512 


Homo sapiens 


Transmembrane receptor-like protein 
pTB2l84. 


71 


46 


1171 


gi 100 1697 


Synechocystis 
sp. PCC 6803 


sensory transduction histidine kinase 


67 


37 


1172 


AAE14454 


Homo sapiens 


Human protein phosphatase-4. 


2131 


99 


1172 


gi2665458 


Mus museums 


protein-tyrosine-phosphatase 


1361 


63 


1172 


gi22328117 


Homo sapiens 


similar to protein-tyrosine- 
phosphatase homolog 
DKFZp566K0524.l - human 
(fragment) 


729 


96 


1173 


AAM25683 


Homo sapiens 


Human protein sequence SEQ ID 
NO:ll98. 


956 


96 


1173 


AAY48226 


Homo sapiens 


Human prostate cancer-associated 
protein 12. 


yDO 


96 


1173 


ABP41699 


Homo sapiens 


Human ovarian antigen HFTCG52, 
SEQ IDNO:2831. 


950 


94 


1174 


gi20379895 


Homo sapiens 


Similar to small inducible cytokine 
A4 


124 


77 


1174 


AAM52447 1 Homo sapiens 


HIV Nefl fusion protein #14. 


122 


100 


1174 


AAM52446 | Homo sapiens 


HIV Nefl fusion protein #13. 


122 


100 
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1175 


gi20379895 


Homo sapiens 


Similar to small inducible cytokine 
A4 


305 


100 


1175 


AAM52447 


Homo sapiens 


HIV Nefl fusion protein #14. 


123 


92 


1175 


AAM52446 


Homo sapiens 


HIV Nefl fusion protein #13. 


123 


92 


1176 


AAG03315 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7396. 


314 


100 


1176 


gil6415877 


Octopus salutii 


cytochrome oxidase subunit III 


70 


28 


1176 


gi602047 


Octopus 
rubescens 


cytochrome oxidase subunit III 


69 


28 


1177 


AAG03757 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7838. 


453 


89 


1177 


gil78836 


Homo sapiens 


apolipoprotein C-II 


453 


89 


1177 


gi757915 


Homo sapiens 


apoCII protein 


453 


89 


1178 


AAO07986 


Homo sapiens 


Human polypeptide SEQ ID NO 
21878. 


73 


31 


1178 


gi4884689 


Neisseria 
meningitidis 


lactoferrin-binding protein precursor 


70 


34 


1178 


gil72002 


Saccharomyces 
cerevisiae 


homologue of bacterial MutS protein 


69 


24 


1179 


AAB60502 


Homo sapiens 


Human cell cycle and proliferation 
protein CCYPR-50, SEQ ID NO:50. 


1656 


92 


1179 


AAB12144 


Homo sapiens 


Hydrophobic domain protein isolated 
from WERI-RB cells. 


1649 


92 


1179 


gil9880267 


Homo sapiens 


rnetallo phosphoesterase 


1649 


92 


1180 


AAU91402 


Homo sapiens 


Human secreted protein sequence 
#55. 


435 


100 


1180 


AAU91376 


Homo sapiens 


Human secreted protein sequence 
#29. 


435 


100 


1180 


AAU91352 


Homo sapiens 


Human secreted protein sequence #5. 


435 


100 


1181 


AAG01183 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5264. 


275 


94 


1181 


AAO04719 


Homo sapiens 


Human polypeptide SEQ ID NO 
18611. 


99 


36 


1181 


AAO03100 


Homo sapiens 


Human polypeptide SEQ ID NO 
16992. 


99 


42 


1182 


ABB12063 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2433. 


326 


100 


1182 


AAE06730 


Homo sapiens 


Human CASB765 protein. 


200 


100 


1182 


AAU81960 


Homo sapiens 


Human PR0536. 


141 


81 


1183 


AAB75349 


Homo sapiens 


Human secreted protein #8. 


748 


98 


1183 


AAG74502 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:5266. 


748 


98 


1183 


AAU27661 


Homo sapiens 


Human protein AFP485790. 


748 


98 


1184 


ABB84887 


Homo sapiens 


Human PR0791 protein sequence 
SEQ ID NO: 142. 


1025 


90 


1184 


ABB90305 


Homo sapiens 


Human polypeptide SEQ ID NO 
2681. 


1025 


90 


1184 


ABB95493 


Homo sapiens 


Human angiogenesis related protein 
PR0791 SEQ ID NO: 142. 


1 f\K 




1185 


gi2745733 


Homo sapiens 


S2P 


75 


36 


1185 


gi4164134 


Homo sapiens 


SP2 metalloprotease 


75 


36 


1185 


gil5865970 


Varanus 
spenceri 


NADH dehydrogenase subunit 2 


73 


27 


1186 


AAU69480 


Homo sapiens 


Human purified secretory 
polypeptide #49. 


283 


92 
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1186 


gi7770223 


Homo sapiens 


PR02714 


281 


84 


1187 


ABB09144 


Homo sapiens 


Human phospholipase A2 protein 
cPLA2 SEQ IDNO:l. 


4446 


99 


1187 


ABB07494 


Homo sapiens 


Human lipid metabolism molecule 
(LMM) polypeptide (ID: 
1281946CD1). 


4407 


99 


1187 


AAE05958 


Homo sapiens 


Human phospholipase-like protein 
#3. 


4215 


99 


1188 


gil4717809 


Mus musculus 


capsule seleno-protein 


96 


30 


1188 


gil4717800 


Mus musculus 


seleno-protein 


96 


30 


1188 


gi2992470 


Mus sp. 


mitochondrial capsule selenoprotein; 
MCS 


96 


30 


1189 


AAE05095 


Homo sapiens 


Human inter-alpha trypsin inhibitor 
(ITI) light chain. 


1775 


94 


1189 


AAM79366 


Homo sapiens 


Human protein SEQ ID NO 3012. 


1775 


94 


1189 


AAM78382 


Homo sapiens 


Human protein SEQ ID NO 1044. 


1775 


94 


1190 


gi673422 


Homo sapiens 


T-cell receptor alpha-chain 


1323 


93 


1190 


ABB95403 


Homo sapiens 


Human P501S specific T cell clone 
4E5 Va chain protein SEQ ID NO 
906. 


1095 


77 


1190 


AAM01298 


Homo sapiens 


P501S-specific T cell clone 4E5 Va 
chain T cell receptor amino acid. 


1095 


77 


1191 


gi673422 


Homo sapiens 


T-cell receptor alpha-chain 


859 


95 


1191 


gi623119 


Macaca mulatta 


T-cell receptor alpha 


603 


86 


1191 


ABB95403 


Homo sapiens 


Human P501S specific T cell clone 
4E5 Va chain protein SEQ ID NO 
906. 


593 


65 


1192 


gi53861 


Mus musculus 


Q300 protein (AA 1-77) 


69 


41 


1193 


ABB89241 


Homo sapiens 


Human polypeptide SEQ ID NO 
1617. 


247 


69 


1193 


AAB08894 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 4 SEQ ID NO:5 1 . 


208 


57 


1193 


gi21070180 


Danio rerio 


envelope protein 


102 


40 


1194 


AAG03963 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 8044. 


417 


80 


1194 


ABB10412 


Homo sapiens 


Human cDNA SEQ ID NO: 720. 


289 


100 


1194 


ABB10168 


Homo sapiens 


Human cDNA SEQ ID NO: 476. 


289 


100 


1196 


AAM41429 


Homo sapiens 


Human polypeptide SEQ ID NO 
6360. 


328 


100 


1196 


AAM39643 


Homo sapiens 


Human polypeptide SEQ ID NO 
2788. 


328 


100 


1196 


AAM79610 


Homo sapiens 


Human protein SEQ ID NO 3256. 


328 


100 


1197 


AAE19830 


Homo sapiens 


Human patched (Ptch) protein. 


213 


63 


1197 


AAB67163 


Homo sapiens 


Human patched protein. 


213 


63 


1197 


AAW47157 


Homo sapiens 


Nevoid basal cell carcinoma 
syndrome (NBCCS) (PTC) protein. 


213 


63 


1198 


AAB25674 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:63. 


643 


84 


1198 


AAB36613 


Homo sapiens 


Human FLEXHT-35 protein 
sequence SEQ ID NO:35. 


626 


80 


1198 


gil4603247 


Homo sapiens 


Similar to RIKEN cDNA 
5730409G15 gene 


626 


80 


1199 


AAO04718 


Homo sapiens 


Human polypeptide SEQ ID NO 
18610. 


160 


75 


1199 


AAO04681 


Homo sapiens 


Human polypeptide SEQ ID NO 


160 


75 
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18573. 






1199 


AAO02320 


Homo sapiens 


Human polypeptide SEQ ID NO 
16212. 


154 


73 


1200 


AAU12292 


Homo sapiens 


Human PRO6027 polypeptide 
sequence. 


990 


98 


1200 


AAU27673 


Homo sapiens 


Human protein AFP235412. 


987 


99 


1200 


gi7303340 


Drosophila 
melanogaster 


CG4676-PA 


189 


30 


1201 


ABG60015 


Homo sapiens 


Human DITHP polypeptide #73. 


281 


66 


1201 


ABG60059 


Homo sapiens 


Human DITHP polypeptide #117. 


249 


60 


1201 


gi21928245 


Homo sapiens 


seven transmembrane helix receptor 


218 


60 


1202 


ABG60015 


Homo sapiens 


Human DITHP polypeptide #73. 


276 


68 


1202 


ABG60059 


Homo sapiens 


Human DITHP polypeptide #117. 


257 


47 


1202 


AAB43928 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1373. 


223 


55 


1203 


AAE04368 


Homo sapiens 


Human kinase (PKIN)-9. 


120 


85 


1203 


AAM79153 


Homo sapiens 


Human protein SEQ ID NO 1815. 


120 


85 


1203 


AAY68784 


Homo sapiens 


Amino acid sequence of a human 
phosphorylation effector PHSP-16. 


120 


85 


1204 


AAU74760 


Homo sapiens 


Human protease PRTS-20 protein 
sequence. 


1042 


98 


1204 


AAE04733 


Homo sapiens 


Human protease homologue #1. 


1037 


98 


1204 


AAE04734 


Homo sapiens 


Human protease homologue #2. 


1019 


97 


1205 


AAW67842 


Homo sapiens 


Human secreted protein encoded by 
gene 36 clone HODCL36. 


451 


76 


1205 


gi7717366 


Homo sapiens 


tryptophan rich protein, congenital 
heart disease 5 protein CHD5 


451 


76 


1205 


gil946205 


Homo sapiens 


congenital heart disease 5 protein 


445 


75 


1206 


AAG01971 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6052. 


314 


100 


1206 


gi3 183989 


Lycopersicon 
esculentum 


P69E protein 


78 


41 


1206 


gi4200340 


Lycopersicon 
esculentum 


P69D protein 


77 


41 


1207 


gil4043211 


Homo sapiens 


Similar to RIKEN cDNA 
4931428F04 gene 


889 


83 


1207 


gil 1276027 


Rattus 
norvegicus 


LSC 


92 


30 


1207 


gil389756 


Mus musculus 


Lsc 


92 


30 


1208 


AAY91653 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 62 SEQ ID 
NO:326. 


813 


79 


1208 


AAY91512 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 62 SEQ ID 
NO:185. 


813 


79 


1208 


AAY44720 


Homo sapiens 


Human immune system molecule, 
ISMO-1. 


747 


78 


1209 


AAY27648 


Homo sapiens 


Human secreted protein encoded by 
gene No. 82. 


322 


98 


1209 


gi7959897 


Homo sapiens 


PR02379 


68 


39 


1211 


AAU19622 


Homo sapiens 


Human diagnostic and therapeutic 
polypeptide (DITHP) #208. 


339 


62 


1211 


AAB08765 


Homo sapiens 


A human leukocyte and blood related 
protein (LBAP). 


339 


62 


1211 


AAB74718 


Homo sapiens 


Human membrane associated protein 


314 


66 
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MEMAP-24. 






1212 


AAG65893 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 90060. 


1901 


71 


1212 


AAB80264 


Homo sapiens 


Human PR0332 protein. 


928 


41 


1212 


AAU12356 


Homo sapiens 


Human PR0332 polypeptide 
sequence. 


928 


41 


1213 


AAE24240 


Homo sapiens 


Human 23566 (carboxypeptidase) 
protein. 


1551 


99 


1213 


AAG66547 


Homo sapiens 


Human secreted 

metallocarboxypeptidase-like 

polypeptide. 


1551 


99 


1213 


AAG66565 


Homo sapiens 


Human secreted 

metallocarboxypeptidase-like variant 
polypeptide. 


1548 


98 


1214 


AAB82317 


Homo sapiens 


Human immunoglobulin receptor 
IRTA4 protein. 


528 


100 


1214 


AAB85464 


Homo sapiens 


Human immunoglobulin domain- 
containing polypeptide. 


528 


100 


1214 


gil5528833 


Homo sapiens 


Fc receptor-like protein 2 


528 


100 


1215 


ABP43492 


Homo sapiens 


Human secreted protein (SCEP) 16. 


1033 


100 


1215 


gi20380668 


Homo sapiens 


similar to MANNOSE-P- 
DOLICHOL UTILIZATION 
DEFECT 1 PROTEIN HOMOLOG 


1033 


100 


1215 


AAU27663 


Homo sapiens 


Human protein AFP285042. 


905 


90 


1216 


AAU97104 


Homo sapiens 


Human MK61 protein, hMK61T4. 


979 


99 


1216 


AAU97106 


Homo sapiens 


Human MK61 protein, hMK61T6. 


846 


88 


1216 


AAU97101 


Homo sapiens 


Human MK61 protein, hMK61Tl. 


683 


83 


1217 


AAU97106 


Homo sapiens 


Human MK61 protein, hMK61T6. 


833 


94 


1217 


AAU97104 


Homo sapiens 


Human MK61 protein, hMK61T4. 


805 


85 


1217 


AAU97101 


Homo sapiens 


Human MK61 protein, hMK61Tl. 


529 


92 


1218 


AAB92697 


Homo sapiens 


Human protein sequence SEQ ID 
NO:11091. 


71 


44 


1218 


AAB92547 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10726. 


71 


44 


1218 


gi386638 


Mus sp. 


mesenchyme fork head 1 


70 


34 


1220 


gil 171589 


Plasmodium 
falciparum 


frameshift 


74 


33 


1220 


gi4512010 


Escherichia coli 


OrfY 


66 


50 


1221 


AA015424 


Homo sapiens 


Human genset metabolic gene 
(GMG-10) protein. 


1462 


94 


1221 


AAB50371 


Homo sapiens 


Human ZACRP7. 


1462 


94 


1221 


AAE09444 


Homo sapiens 


Human SBhACRP30a protein #2. 


1462 


94 


1222 


ABG40131 


Homo sapiens 


Human peptide encoded by genome- 
derived single exori probe SEQ ID 
29796. 


71 


34 


1222 


AAM05935 


Homo sapiens 


Peptide #4617 encoded by probe for 
measuring breast gene expression. 


71 


34 


1222 


AAM30815 


Homo sapiens 


Peptide #4852 encoded by probe for 
measuring placental gene expression. 


71 


34 


1223 


gi8850245 


Homo sapiens 


activated p21cdc42Hs kinase 


5605 


100 


1223 


gi2921447 


Mus musculus 


non-receptor protein tyrosine kinase 
Ack 


5164 


92 


1223 


gi2078388 


Bos taurus 


Cdc42-associated tyrosine kinase 
ACK-2 


3503 


91 


1224 


AAB84696 


Homo sapiens 


Amino acid sequence of a human 


358 


33 
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zkunlO polypeptide. 






1224 


gi211622 


Gallus gallus 


alpha-3 collagen type VI 


300 


32 


1224 


ABP41277 


Homo sapiens 


Human ovarian antigen HOCQH66, 
SEQ ID NO:2409. 


281 


OA 

30 


1225 


AAB66065 


Homo sapiens 


Human TANGO 294. 


J.L LO 


QQ 

yy 


1225 


AAE11931 


Homo sapiens 


Human CGI 62 (or C59) lipase 
protein #2. 


Olio 


QQ 

yy 


1225 


AAB66067 


Homo sapiens 


Human TANGO 294 mature protein. 


2015 


99 


1226 


AAM06483 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
214. 


283 


64 


1227 


AAR60521 


Homo sapiens 


Human tetranectin. 


869 


83 


1227 


gi37409 


Homo sapiens 


Tetranectin 


869 


83 


1227 


gi825722 


Homo sapiens 


tetranectin 


869 


83 


1228 


gi5790207 


Taenia saginata 


ATPase subunit 6 


68 


32 


1228 


gi8778323 


Arabidopsis 
thaliana 


F14J16.24 


66 


52 


1229 


AAE01790 


Homo sapiens 


Human gene 21 encoded secreted 
protein HDPTW65, SEQ ID NO: 111. 


142 


59 


1229 


AAE01838 


Homo sapiens 


Human gene 21 encoded secreted 
protein HDPTW65, SEQ ID NO: 159. 


140 


57 


1229 


ABB 11479 


Homo sapiens 


Human reverse transcriptase 
homologue, SEQ ID NO: 1849. 


91 


55 


1230 


ABB50466 


Homo sapiens 


Human secreted protein encoded by 
gene 166 SEQ ID NO:414. 


333 


100 


1230 


AAW88699 


Homo sapiens 


Secreted protein encoded by gene 
166 clone HCEQA68. 


333 


100 


1230 


AAU83594 


Homo sapiens 


Human PRO protein, Seq ID No 6. 


327 


100 


1231 


AAG00381 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4462. 


263 


91 


1231 


AAU 19357 


Homo sapiens 


Human G protein-coupled receptor 
nGPCR-2290. 


119 


52 


1231 


AA013251 


Homo sapiens 


Human polypeptide SEQ ID NO 
27143. 


105 


39 


1232 


AAM06558 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
289. 


301 


98 


1233 


AAM06562 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
293. 


383 


100 

i 


1233 


AAU03519 


Homo sapiens 


Human protein kinase #19. 


76 


34 


1234 


gil3561474 


Mertensiella 
luschani atifi 


NADH dehydrogenase subunit 2 


70 


25 


1234 


gil3561462 


Mertensiella 
luschani billae 


NADH dehydrogenase subunit 2 


70 


26 


1234 


gil3561454 


Salamandra 
infraimmaculata 


NADH dehydrogenase subunit 2 


69 


32 


1236 


gi992917 


Glycine max 


acetyl CoA carboxylase 


67 


53 


1236 


gil066857 


Glycine max 


acetyl-CoA carboxylase 


65 


53 


1237 


AAE20197 


Homo sapiens 


Human TIE ligand, FLS139 protein. 


1947 


94 


1237 


AAE19828 


Homo sapiens 


rluman rLo 1 5y ligand protein. 




y-r 


1237 


ABB80020 


Homo sapiens 


TIE ligand FLS139 amino acid. 


1947 


94 


1238 


AAM06568 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
299. 


142 


57 


1245 


gi3873768 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01461 (7TM chemoreceptor), 
Score=28.9, E-value=1.5e-16, N=2 


69 


28 


1245 


AAU83682 


Homo sapiens 


Human PRO protein, Seq ID No 182. 


67 


39 
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1245 


AAU29165 


Homo sapiens 


Human PRO polypeptide sequence 
#142. 


67 


39 


1247 


ABB89102 


Homo sapiens 


Human polypeptide SEQ ID NO 
1478. 


517 


100 


1247 


AAM38788 


Homo sapiens 


Human polypeptide SEQ ID NO 
1933. 


83 


21 


1247 


gi21 109777 


Xanthomonas 
axonopodis pv. 
citri str. 306 


transcriptional regulator 


83 


28 


1248 


gil5292277 


Drosophila 
melanogaster 


LD45324p 


70 


26 


1249 


gi298848 


Equine 
herpesvirus 1 


glycoprotein C homolog 


67 


41 


1249 


gi330808 


Equine 
herpesvirus 1 


membrane glycoprotein C 


67 


41 


1249 


gi330899 


Equine 
herpesvirus 1 


glycoprotein gpl3 precursor 


67 


41 


1251 


AAO 11677 


Homo sapiens 


Human polypeptide SEQ ID NO 
25569. 


74 


54 


1255 


gil6551105 


Crotalus 
adamanteus 


NADH dehydrogenase subunit 5 


66 


28 


1257 


ABB53264 


Homo sapiens 


Human polypeptide #4. 


3044 


100 


1257 


gil 8565270 


Homo sapiens 


Lib 


3028 


99 


1257 


gil 8565266 


Rattus 
norvegicus 


Lib 


2551 


83 


1259 


gi335876 


Vesicular 
stomatitis virus 


matrix (M) protein 


71 


26 


1259 


gi61842 


Vesicular 
stomatitis virus 


M protein (aa 1-237) 


71 


26 


1259 


gi336034 


Vesicular 
stomatitis virus 


M-protein 


71 


26 


1261 


AAB95686 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18490. 


983 


99 


1261 


gil 8446901 


Drosophila 
melanogaster 


AT07234p 


80 


32 


1262 


gi9858058 


Tortula ruralis 


rehydrin 


85 


29 


1262 


gil9882261 


Gallus gallus 


paranemin 


80 


27 


1262 


gi9837383 


Bos taurus 


retinitis pigmentosa GTPase 
regulator 


76 


29 


1263 


AAU98893 


Homo sapiens 


Human protease PRTS 1 1. 


1600 


80 


1263 


gi965014 


Mus musculus 


ADAM 4 protein precursor 


1297 


51 


1263 


gil061159 


Macaca 
fascicularis 


testicular Metalloprotease-like, 
Disintegrin-like, Cysteine-rich 
protein IVa 


1253 


38 


1264 j 


AAE21462 


Homo sapiens 


Human gene 1 1 encoded secreted 
protein HWBBT49, SEQ ID NO:78. 


2107 


99 


1264 


AAM80033 


Homo sapiens 


Human protein SEQ ID NO 3679. 


1895 


98 


1264 


AAM79049 


Homo sapiens 


Human protein SEQ ID NO 1711. 


1895 


98 


1265 


AAB44605 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 10 SEQ ID NO:70. 


93 


70 


1266 


AAO15039 


Homo sapiens 


Human clusterin protein sequence 1 . 


1981 


87 


1266 


gi30251 


Homo sapiens 


SP-40 S 40 prepropetide (AA -22 to 
427) 


1981 


87 


1266 


g il4714741 


Homo sapiens 


clusterin (complement lysis inhibitor, 
SP-40,40, sulfated glycoprotein 2 5 


1981 


87 
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testosterone-repressed prostate 
message 2, apolipoprotein J) 






1267 


AAB37963 


Homo sapiens 


Human antithrombin III amino acid 
sequence. 


2136 


91 


1267 


AAY92224 


Homo sapiens 


Human antithrombin III. 


2136 


91 


1267 


AAR42895 


Homo sapiens 


Human antithrombin III (wild-type). 


2136 


91 


1268 


AAB37963 


Homo sapiens 


Human antithrombin III amino acid 
sequence. 


2276 


96 


1268 


AAY92224 


Homo sapiens 


Human antithrombin III. 


2276 


96 


1268 


AAR42895 


Homo sapiens 


Human antithrombin III (wild-type). 


2276 


96 


1269 


gi203519 


Rattus 
norvegicus 


cytochrome c oxidase subunit VIc 


247 


67 


1269 


gil9354073 


Mus musculus 


cytochrome c oxidase, subunit VIc 


244 


67 


1269 


AAB56523 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1101. 


229 


61 


1270 


gi7981304 


Homo sapiens 


dJ551D2.1.2 (cadherin-like protein 
VR20, isoform 2) 


575 


84 


1270 


AAM96033 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 4691. 


512 


84 


1270 


gi9622236 


Homo sapiens 


cadherin-like protein VR20 


272 


100 


1271 


gi6692690 


Arabidopsis 
thaliana 


F12K11.14 


105 


26 


1271 


gil58696 


Drosophila 
melanogaster 


tropomyosin isoform 9D 


100 


25 


1271 


gil58695 


Drosophila 
melanogaster 


tropomyosin isoform 33 (9C) 


100 


25 


1272 


gi6692690 


Arabidopsis 
thaliana 


F12K11.14 


106 


24 


1272 


gil 1935051 


Oryctolagus 
cuniculus 


sarcolemmal associated protein 1 


99 


27 


1272 


gil850913 


Entamoeba 
histolytica 


myosin heavy chain 


98 


19 


1273 


AAM79617 


Homo sapiens 


Human protein SEQ ID NO 3263. 


1481 


94 


1273 


AAM78633 


Homo sapiens 


Human protein SEQ ID NO 1295. 


1481 


94 


1273 


gil6876913 


Homo sapiens 


mercaptopyruvate sulfurtransf erase 


1481 


94 


1274 


AA021899 


Homo sapiens 


Protein of human protease. 


772 


49 


1274 


AAE18722 


Homo sapiens 


Human DESCl-like serine protease. 


767 


48 


1274 


AAB85039 


Homo sapiens 


Human SER5 protein sequence. 


754 


47 


1275 


gil2584839 


Homo sapiens 


HT036-ISO 


1003 


94 


1275 


gil2584841 


Homo sapiens 


HT036 


826 


93 


1275 


gil7985315 


Brucella 
melitensis 


HYDROXYPYRUVATE 
ISOMERASE 


512 


43 


1277 


AAB92449 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 10478. 


261 


100 


1277 


AAB83299 


Homo sapiens 


Human homoglutamine-rich factor 
56. 


261 


100 


1277 


AAB82342 


Homo sapiens 


Winged helix/zinc finger 
transcription factor FOXP1 variant. 


261 


100 


1278 


AAM25840 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1355. 


208 


88 


1279 


AAU72886 


Homo sapiens 


Human aspartyl protease partial 
protein sequence #11. 


821 


98 


1279 


AAU98884 


Homo sapiens 


Human protease PRTS2. 


821 


98 


1279 


gi7023943 


Homo sapiens 


down-regulated in gastric cancer 


350 


35 


1280 


AAG77975 


Homo sapiens 


Protein of a human soluble adenylyl 


265 


37 
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cyclase. 






1280 


AAE21187 


Homo sapiens 


Human soluble adenylyl cyclase 
(SAC) protein. 


265 


37 


1280 


AAB81929 


Homo sapiens 


Human soluble adenylyl cyclase. 


265 


37 ' 


1282 


gi4079809 


Homo sapiens 


HERC2 


68 


42 


1283 


ABG45272 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
34937. 


153 


67 


1283 


AAM35920 


Homo sapiens 


Peptide #9957 encoded by probe for 
measuring placental gene expression. 


153 


67 


1283 


AAM20697 


Homo sapiens 


Peptide #7131 encoded by probe for 
measuring cervical gene expression. 


153 


67 


1284 


AAG81367 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:252. 


849 


90 


1284 


gil 8088345 


Homo sapiens 


Similar to RIKEN cDNA 
1110066C01 gene 


832 


90 


1284 


gi20381141 


Mus musculus 


RIKEN cDNA 1110066C01 gene 


642 


68 


1285 


AAG81367 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:252. 


939 


91 


1285 


gil 8088345 


Homo sapiens 


Similar to RIKEN cDNA 
1110066C01 gene 


827 


89 


1285 


gi20381141 


Mus musculus 


RIKEN cDNA 1110066C01 gene 


724 


70 


1286 


AAM79782 


Homo sapiens 


Human protein SEQ ID NO 3428. 


95 


36 


1286 


AAM78798 


Homo sapiens 


Human protein SEQ ID NO 1460. 


93 


41 


1286 


gil77179 


Homo sapiens 


alpha-2 type VIII collagen 


93 


41 


1287 


gi5689766 


Homo sapiens 


zinc finger 2.2 


2092 


99 


1287 


AAM39130 


Homo sapiens 


Human polypeptide SEQ ID NO 
2275. 


1048 


46 


1287 


gi3 135968 


Homo sapiens 


b34I8.1 (zinc linger protein 184 
(Kruppel-like)) 


1048 


46 


1288 


ABP43144 


Homo sapiens 


Human ovarian antigen HWCD65, 
SEQ ID NO:4276. 


281 


86 


1288 


AAB44228 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1673. 


281 


86 


1288 


ABB06807 


Homo sapiens 


Human nGPCR-Seql019 protein 
sequence SEQ ID NO:76. 


69 


34 


1290 


AAG03150 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7231. 


307 


98 


1290 


AAW48931 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


286 


100 


1291 


AAB60098 


Homo sapiens 


Human transport protein TPPT-1 8. 


2360 


92 


1291 


gil 537070 


Rattus 
norvegicus 


nucleoporin p54 


2292 


88 


1291 


gi7688695 


Homo sapiens 


nucleoporin p54 protein 


2246 


89 


1292 


AAY94621 


Homo sapiens 


Epidermal growth factor-like variant 
in skin-2 amino acid sequence. 


422 


79 


1292 


ABB72266 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 417. 


415 


78 


1292 


AAB56066 


Homo sapiens 


Skin cell protein, SEQ ID NO: 417. 


415 


78 


1293 


AAM79352 


Homo sapiens 


Human protein SEQ ID NO 2998. 


1018 


98 


1293 


ABB 11835 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2205. 


1018 


98 


1293 


AAW78245 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 19. 


1018 


98 


1294 


AAM99920 


Homo sapiens 


Human polypeptide SEQ ID NO 36. 


669 


85 
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1294 


AAM99933 


Homo sapiens 


Human polypeptide SEQ ID NO 49. 


629 


82 


1294 


AAG75526 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6290. 


188 


38 


1295 


gi2598167 


Homo sapiens 


zinc finger protein 


2772 


99 


1295 


gi5640019 


Mus musculus 


zinc finger protein ZFP235 


1707 


65 


1295 


gil3277768 


Mus musculus 


zinc finger protein 93 


1434 


54 


1296 


gi4567180 


Homo sapiens 


BC37295 2 (partial) 


2995 


100 


1296 


ABB50238 


Homo sapiens 


Human transcription factor TRFX- 
89. 


2734 


100 


1296 


gi9502202 


Homo sapiens 


endothelial zinc finger protein 
induced by tumor necrosis factor 
alpha 


2734 


100 


1297 


ABG38718 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
28383. 


97 


55 I 


1297 


AAM04653 


Homo sapiens 


Peptide #3335 encoded by probe for 
measuring breast gene expression. 


97 


55 


1297 


AAM29443 


Homo sapiens 


Peptide #3480 encoded by probe for 
measuring placental gene expression. 


97 


55 


1298 


AAE14459 


Homo sapiens 


Human protein phosphatase-9. 


601 


100 ! 


1298 


AAM79176 


Homo sapiens 


Human protein SEQ ID NO 1838. 


601 


ioo ! 


1298 


AAE14251 


Homo sapiens 


Human 16051b protein. 


563 


100 


1299 


AAB74690 


Homo sapiens 


Human protease and protease 
inhibitor PPIM-23. 


2420 


96 


1299 


AAB85427 


Homo sapiens 


Human MPROT45 polypeptide. 


2420 


96 


1299 


AAU12243 


Homo sapiens 


Human PR04339 polypeptide 
sequence. 


2420 


96 


1300 


AAE16953 


Homo sapiens 


Human precursor interleukin-18 
(Pro-IL-18) protein. 


138 


92 


1300 


AAG63830 


Homo sapiens 


Amino acid sequence of human 
interleukin 18 (IL-18). 


138 


92 ! 


1300 


AAB30541 


Homo sapiens 


A human IL-18 with a caspase-8 
cleavage site. 


138 


92 


1301 


AAE05302 


Homo sapiens 


Human TANGO 457 protein. 


901 


78 


1301 


AAE05303 


Homo sapiens 


Human mature TANGO 457 protein. 


889 


79 


1301 


AAE05305 


Homo sapiens 


Human TANGO 457 protein 
cytoplasmic domain. 


883 


78 


1302 


gi2202 


Canis sp. 


Clox 


68 


73 


1302 


gi6066468 


Leishmania 
major 


probable DNA polymerase zeta 
catalytic component 


67 


41 


1303 


gi20068312 


Atropa 
belladonna 


maturase 


65 


34 


1304 


gi297146 


Homo sapiens 


retinoic acid receptor gamma 2 


274 


100 


1304 


gil8031837 


Mesocricetus 
auratus 


retinoic acid receptor gamma-2 


256 


94 


1304 


gi200672 


Mus musculus 


retinoic acid receptor gamma 


252 


92 


1305 


AAM39737 


Homo sapiens 


Human polypeptide SEQ ID NO 
2882. 


992 


99 


1305 


ABG34053 


Homo sapiens 


Human Pro peptide #24. 


875 


100 


1305 


AAM41523 


Homo sapiens 


Human polypeptide SEQ ID NO 
6454. 


875 


100 


1307 


gi212485 


Gallus gallus 


ovoinhibitor 


133 


47 


1307 


ABB72111 


Homo sapiens 


Human protein isolated from skin 
cells SEQ ID NO: 150. 


132 


42 


1307 


AAB55911 


Homo sapiens 


Skin cell protein, SEQ ID NO: 150. 


132 


42 
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1308 


AAU27671 


Homo sapiens 


Human protein AFP355471. 


486 


1 A A 

100 


1308 1 


gil256888 


Saccharomyces 
cerevisiae 


Ylr086wp 


75 


30 


1308 


gi21280444 


Homo sapiens 


AF15ql4 


71 


27 


1309 


AAR15222 


Homo sapiens 


Chronic myelogenous leukaemia- 
derived myeloid-related protein. 


620 


1 A A 

100 


1309 


gi32402 


Homo sapiens 


HP-1 (AA 1-94) 


493 


1 AA 

100 


1309 


gil81529 


Homo sapiens 


defensin 1 


493 


1 AA 

100 


1310 


AAB39114 


Homo sapiens 


Human secreted protein #22. 


347 


1 AA 

100 


1310 


gi23 16086 


Pisum sativum 


gibberellin 3B-hydroxylase 


67 


43 


1310 


gi2316018 


Pisum sativum 


gibberellin 3 beta-hydroxylase 


67 


43 


1311 


ABP47852 


Homo sapiens 


Human polypeptide SEQ ID NO 282. 


205 


40 


1311 


ABB90743 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 218. 


186 


42 


1311 


gi3 127926 


Homo sapiens 


collagen type VI, alpha 3 chain 


186 


42 


1312 


gil6555334 


Homo sapiens 


Rig protein 


1012 


1 A A 


1312 


gil6508176 


Homo sapiens 


small GTP-binding tumor suppressor 
1 


1012 


100 


1312 


gi2 1040535 


Homo sapiens 


similar to Rig protein 


1012 


100 


1313 


gi21741597 


Oryza sativa 


OSJNBa0052Pl 6.6 


84 


40 


1313 


gi21741539 


Oryza sativa 


OSJNBa0052P16.7 


84 


40 


1313 


gi21743203 


Oryza sativa 


OSJNBa0085C10.10 


81 


37 


1314 


gi2689446 


Homo sapiens 


R27945 1 


1889 


91 


1314 


AAM71801 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
32107. 


1872 


100 


1314 


AAM79549 


Homo sapiens 


Human protein SEQ ID NO 3 195. 


1145 


55 


1315 


AAM79404 


Homo sapiens 


Human protein SEQ ID NO 3050. 


845 


79 


1315 


gi3228237 


Homo sapiens 


ultra high sulfer keratin 


791 


69 


1315 


gi32472 


Homo sapiens 


high-sulpher keratin 


783 


76 


1316 


gi 12655446 


Homo sapiens 


keratin associated protein 4.4 


768 


78 


1316 


gil3278909 


Homo sapiens 


Similar to RIKEN cDNA 
1110054P19gene 


738 


75 


1316 


gil2655460 


Homo sapiens 


keratin associated protein 4.12 


738 


75 


1317 


gil2655462 


Homo sapiens 


keratin associated protein 4.14 


1102 


88 


1317 


gil2655452 


Homo sapiens 


keratin associated protein 4.7 


1088 


O A 

84 i 


1317 


gil2655456 


Homo sapiens 


keratin associated protein 4.9 


1002 


82 


1318 


AAM39466 


Homo sapiens 


Human polypeptide SEQ ID NO 
2611. 


893 


78 


1318 


AAM41252 


Homo sapiens 


Human polypeptide SEQ ID NO 
6183. 


885 


78 


1318 


gi3228237 


Homo sapiens 


ultra high sulfer keratin 


872 


73 


1319 


ABB79480 


Homo sapiens 


Human zinc finger protein 75.68. 


800 


65 


1319 


gi6467200 


Homo sapiens 


gonadotropin inducible transcription 
repressor- 1 


773 


63 


1319 


gi20306351 


Homo sapiens 


similar to gonadotropin inducible 
transcription repressor- 1 


773 


63 


1320 


gi3036963 


Ciona savignyi 


L-SLJDU4Z 


163 


60 


1320 


gi21667044 


Ustilago maydis 


GTP binding protein Cdc42 


162 


60 


1320 


gil5072535 


Schizophyllum 
commune 


small GTPase CDC42 


162 


60 


1321 


AAE02058 


Homo sapiens 


Human four disulfide core domain 
(FDCD)-containing protein. 


511 


43 


1321 


gil2655452 


Homo sapiens 


keratin associated protein 4.7 


492 


43 


1321 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


481 


42 
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1322 


ABB 12490 


Homo sapiens 


Human bone marrow expressed 
protein SEQ ID NO: 329. 


169 


72 


1323 


AAM39144 


Homo sapiens 


Human polypeptide SEQ ID NO 
2289. 


378 


92 


1323 


gi5921473 


Homo sapiens 


G8 protein 


378 


92 


1323 


AAM40930 


Homo sapiens 


Human polypeptide SEQ ID NO 
5861. 


370 


91 


1324 


gi 18446967 


Drosophila 
melanogaster 


AT14419p 


70 


29 


1325 


gi 15277229 


Homo sapiens 


Homologue to Drosophila 
photoreceptor protein calphotin 


76 


O A 

34 


1325 


gil 136400 


Homo sapiens 


♦ • 1 t T~V "1*1 1 _ » _ i „ 

similar to Drosophila photoreceptor 
cell-specific protein, calphotin. 


76 


1 A 

34 


1327 


AAU07343 


Homo sapiens 


1-aminocyclopropane carboxylate 
(ACPC) synthase #12. 


262 


38 


1327 


AAU91279 


Homo sapiens 


Human NOV3a protein. 


259 


33 


1327 


gi3 13720 


Ovis aries 


KAP5.4 keratin protein 


177 


26 


1328 


AAY08325 


Homo sapiens 


Human granulysin P520 active 
fragment. 


316 


75 


1328 


AAW59874 


Homo sapiens 


Ammo acid sequence of the cDNA 
clone CAT-1 (HTXET53). 


316 


75 


1328 


AAR23732 


Homo sapiens 


Gene 519 cDNA derived peptide. 


316 


75 


1329 


AAM41252 


Homo sapiens 


Human polypeptide SEQ ID NO 
6183. 


345 


43 


1329 


AAM39466 


Homo sapiens 


Human polypeptide SEQ ID NO 
2611. 


345 


43 


1329 


gi200964 


Mus musculus 


serine 2 ultra high sulfur protein 


342 


49 


1330 


gil3937769 


Homo sapiens 


Similar to RIKEN cDNA 
1200013F24 gene 


1256 


1 A A 

100 


1330 


gi75 82294 


Homo sapiens 


BM-011 


TO 1 

781 




1330 


AAG67014 


Homo sapiens 


Human sperm-specific protein EMI, 
EM6-48. 


249 


30 


1331 


gil4718451 


Homo sapiens 


sialic acid-binding lectin 1 1 


793 


70 


1331 


gil9716086 


Homo sapiens 


Sialic acid-binding Ig-like lectin 
Siglec-12 


793 


70 


1331 


AAU29082 


Homo sapiens 


Human PRO polypeptide sequence 
#59. 


551 


50 


1332 


ABP41951 


Homo sapiens 


Human ovarian antigen HDABR73, 
SEQ ID NO:3083. 


848 


98 


1332 


AAB43821 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1266. 


848 


98 


1332 


AAB 14201 


Homo sapiens 


Human placental bikunin protein 
fragment # 16. 


848 


98 


1333 


AAB93164 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12091. 


2165 


98 


1333 


AAM93693 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3604. 


2159 


100 






riomo sapiens 


I-Ti im ar*i traneovitti'i rvn "Pa r»+/vr TT? T-T~V 

riuiiiau iicLiisLiiipLiuu ictCLor iivryv.- 
55. 


1206 


57 


1334 


gil2804907 


Homo sapiens 


Similar to metaxin 1 


1512 


100 


1334 


gi2564913 


Homo sapiens 


metaxin 


1470 


90 


1334 


gil 8606009 


Mus musculus 


metaxin 


1333 


81 


1335 


ABB89371 


Homo sapiens 


Human polypeptide SEQ ID NO 
1747. 


487 


100 


1335 


ABB90319 


Homo sapiens 


Human polypeptide SEQ ID NO 


381 


83 
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2695. 






1335 


ABP61852 


Homo sapiens 


Human polypeptide SEQ ID NO 206. 


381 


83 


1336 


AAB 18456 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2257 


99 


1336 


AAB 18447 


Homo sapiens 


Amino acid sequence of human 
TANGO 216 polypeptide. 


2257 


99 


1336 


AAB 18457 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2254 


99 


1337 


AAB 18456 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2257 


99 


1337 


AAB 18447 


Homo sapiens 


Amino acid sequence of human 
TANGO 216 polypeptide. 


2257 


99 


1337 


AAB 18457 


Homo sapiens 


A human TANGO 216 polypeptide 
clone. 


2254 


99 


1338 


AAY86303 


Homo sapiens 


Human secreted protein HOGCK20, 
SEQ ID NO:218. 


2382 


88 


1338 


AAY86333 


Homo sapiens 


Human secreted protein HOGCK20, 
SEQ ID NO:248. 


2215 


87 


1338 


AAB65254 


Homo sapiens 


Human PR01379 (UNQ716) protein 
sequence SEQ ID NO:340. 


2117 


96 


1339 


gi20072551 


Mus musculus 


RIKEN cDNA 49305 1 1J1 1 gene 


412 


43 


1339 


gil2836893 


Gallus gallus 


IPR328-like protein 


150 


29 


1339 


gil 7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


149 


25 


1340 


gi20072551 


Mus musculus 


RIKEN cDNA 49305 1 1 Jl 1 gene 


420 


45 


1340 


gil 7974542 


Homo sapiens 


voltage-dependent calcium channel 
gamma- 8 subunit 


147 


25 


1340 


gil2836893 


Gallus gallus 


IPR328-like protein 


147 


29 


1341 


AAG89353 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 473. 


692 


100 


1341 


ABB11882 


Homo sapiens 


Human transmembrane protein 
homologue, SEQ ID NO:2252. 


692 


100 


1341 


AAW85737 


Homo sapiens 


Polypeptide with transmembrane 
domain. 


692 


100 


1342 


ABB12032 


Homo sapiens 


Human SIGP 2328134 homologue, 
SEQ ID NO:2402. 


1202 


82 


1342 


AAY21851 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 2328134). 


1202 


82 


1342 


gi4101574 


Homo sapiens 


54TMp 


1196 


81 


1343 


gi3002925 


Homo sapiens 


T cell receptor beta chain 


1658 


100 


1343 


AAE13850 


Homo sapiens 


Human lung tumour-specific N- 
terminal protein 14F10. 


1526 


94 


1343 


AAE13848 


Homo sapiens 


Human lung tumour-specific T cell 
receptor beta chain. 


1526 


94 


1344 


gil4973269 


Streptococcus 

pneumoniae 

TIGR4 


cell wall surface anchor family 
protein 


413 


19 


1344 


gi560649 


Neocallimastix 
patriciarum 


Xylanase B; XYLB 


243 


19 


1344 


gil3094677 


Mus musculus 


ribosome receptor isoform mRRp61 


240 


22 


1345 


AAY07751 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 8. 


293 


100 


1345 


gil 142588 


Trypanosoma 
brucei 


CR3 


79 


42 


1345 


gi3037018 


Bodo saltans 


NADH dehydrogenase subunit 5 


78 


35 
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1346 


gi 18857903 


Homo sapiens 


TCBAl 


867 


100 


1346 


AAG78000 


Homo sapiens 


Human actin 14. 


663 


100 


1346 


ABB89045 


Homo sapiens 


Human polypeptide SEQ ID NO 
1421. 


644 


98 


1347 


gi9837433 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 8 long splice variant 


2206 


88 


1347 


AAW94995 


Homo sapiens 


SAF-2 polypeptide. 


2031 


93 


1347 


gi6980022 


Homo sapiens 


siglec SAF2 


2031 


93 


1348 


gil5451469 


Homo sapiens 


siglec-like protein splice variant- 1 


2689 


99 


1348 


gil5217166 


Homo sapiens 


sialic acid-binding Ig-like lectin 10 


2682 


99 


1348 


gil4164613 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 10 


2356 


98 


1349 


AAU76036 


Homo sapiens 


Human sugar transporter- 1 (HST-1) 
protein sequence. 


1496 


89 


1349 


AAB60112 


Homo sapiens 


Human transport protein TPPT-32. 


775 


100 


1349 


AAB61903 


Homo sapiens 


Atherosclerosis-associated 
polypeptide. 


445 


38 


1350 


ABB06115 


Homo sapiens 


Human NS protein sequence SEQ ID 
NO:207. 


357 


97 


1350 


AAY76219 


Homo sapiens 


Human secreted protein encoded by 
gene 96. 


336 


94 


1350 


gi2906006 


Homo sapiens 


WASP interacting protein 


125 


30 


1351 


AAB08767 


Homo sapiens 


A human leukocyte and blood related 
protein (LBAP). 


87 


27 


1351 


ABB89384 


Homo sapiens 


Human polypeptide SEQ ID NO 
1760. 


86 


28 


1351 


gi576631 


Torpedo 
marmorata 


14 kDa transmembrane protein 


86 


32 


1352 


AAE16765 


Homo sapiens 


Human transporter and ion channel-2 
(TRICH-2) protein. 


312 


96 


1352 


gi7576452 


Homo sapiens 


potent brain type organic ion 
transporter 


159 


37 


1352 


AAY53009 


Homo sapiens 


Human secreted protein clone 
fhl49 12 protein sequence SEQ ID 
NO:24. 


153 


36 


1353 


AAU83670 


Homo sapiens 


Human PRO protein, Seq ID No 158. 


2566 


99 


1353 


ABB84896 


Homo sapiens 


Human PRO 1309 protein sequence 
SEQ ID NO: 160. 


2566 


99 


1353 


ABB95502 


Homo sapiens 


Human angiogenesis related protein 
PRO1309 SEQ ID NO: 160. 


2566 


99 


1354 


AAM93665 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3544. 


498 


40 


1354 


AAU29109 


Homo sapiens 


Human PRO polypeptide sequence 
#86. 


498 


40 


1354 


AAU27785 


Homo sapiens 


Human full-length polypeptide 
sequence #110. 


498 


40 


1355 


AAB87570 


Homo sapiens 


Human PR01268. 


603 


100 


1355 


AAU29173 


Homo sapiens 


Human PRO polypeptide sequence 
#150. 


603 


100 


1355 


AAY78808 


Homo sapiens 


Hydrophobic domain containing 
protein clone HP 10537 protein 
sequence. 


603 


100 


1356 


gi2l5l8639 


Homo sapiens 


TSLCl-like2 


1991 


97 


1356 


gil9068l39 


Mus museums 


membrane glycoprotein 


1970 


96 


1356 


AAM78418 


Homo sapiens 


Human protein SEQ ID NO 1080. 


1905 


97 
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1357 


AAY49573 


Homo sapiens 


Human CLA-1 protein sequence. 


2503 


94 


1357 


gi397607 


Homo sapiens 


CLA-1 


2d(JJ 


C\A 


1357 


ABB12012 


Homo sapiens 


Human SR-BI class B scavenger 
homologue, SEQ ID NO:2382. 


2490 


C\A 

94 


1358 


gi854065 


Human 
herpesvirus 6 


U88 


340 


31 


1358 


gi21928439 


Homo sapiens 


: — p 

seven transmembrane helix receptor 


Zyy 


j j 


1358 


AAB95124 


Homo sapiens 


Human protein sequence SEQ ID 

jNvJ.1 / iZZ. 


Zl I 




1359 


AAE05302 


Homo sapiens 


Human TANGO 457 protein. 


1518 


96 


1359 


AAE05303 


Homo sapiens 


Human mature 1 AiNOU 43 / protein. 


ioy4 


yo 


1359 


AAE05305 


Homo sapiens 


Human TANGO 457 protein 
cytoplasmic domain. 


1260 


100 


1360 


gi20799661 


Mus musculus 


mucolipin-2 


2020 


76 


1360 


gi20987535 


Mus musculus 


RIKEN cDNA 3300002C04 gene 


2017 


75 


1360 


gil9072754 


Homo sapiens 


mucolipin-3 


1406 


53 


1361 


AAE04122 


Homo sapiens 


Human gene 23 encoded secreted 
protein HE80K73, SEQ ID NO: 1 08. 


210 


61 


1361 


AAE04169 


Homo sapiens 


Human gene 23 encoded secreted 
protein HE80K73, SEQ ID NO: 158. 


203 


60 


1361 


AAG00392 


Homo sapiens 


Human secreted protein, SEQ ID 

XTrt A ATI 

NO: 4473. 


112 


43 


1362 


AAY27853 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


Z/4 


y4 


1362 


gi2 17405 60 


Oryza sativa 


OSJNBa0033Gl6.l0 


oy 


An 
4/ 


1362 


gi403081 


Culex pipiens 


reverse transcriptase 


66 


32 


1363 


AAM39421 


Homo sapiens 


Human polypeptide SEQ ID NO 
2566. 


4775 


86 


1363 


gil7016967 


Homo sapiens 


NUANCE 


4775 


86 


1363 


gil7861384 


Homo sapiens 


nesprin-2 gamma 


4775 


86 


1364 


AAB37381 


Homo sapiens 


Human secreted protein #10 encoded 
by cDNA #34. 


347 


no 

yo 


1364 


AAB95854 


Homo sapiens 


Human protein sequence SEQ ID 

\TA .10010 


/U 


in 


1365 


ABB90157 


Homo sapiens 


Human polypeptide btsQ id n<j 

ZD Jo. 


1 JO 


<^ 

JJ 


1365 


AAW296j4 


Homo sapiens 


Human secreted protein DM406_1 . 






1365 


gi22328096 


Homo sapiens 


LOC 145053 


136 


48 


1366 


AAU76873 


Homo sapiens 


Human CRF-like protein LP231. 


1021 


91 


1366 


ABB53290 


Homo sapiens 


Human polypeptide #30. 




CO 


1366 


gi 10566471 


Mus musculus 


Gliacolin 


C\A A 

944 


CD 


1367 


gi2585988 


Homo sapiens 


rib onucl ease k6 precursor 


700 


87 


1367 


gil 8088595 


Homo sapiens 


rib onucl ease, RNase A family, k6 


700 


87 


1367 


gi2745750 


Pan troglodytes 


ribonuclease k6 precursor 


695 


86 


1368 


AAE09651 


Homo sapiens 


Human gene 13 encoded lipid 
metabolism protein HTJNI73, SEQ 
ID NO:45. 


484 


98 


1368 


AAG64355 


Homo sapiens 


Human lambda crystallin. 


A C\C\ 

4UU 


y / 


1368 


ABB89956 


Homo sapiens 


Human polypeptide SEQ ID NO 
2332. 


381 


98 


1369 


AAM25241 


Homo sapiens 


Human protein sequence SEQ ID 
NO:756. 


484 


95 


1369 


AAE09651 


Homo sapiens 


Human gene 13 encoded lipid 
metabolism protein HTJNI73, SEQ 
ID NO:45. 


352 


100 
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1369 


AAG64355 


Homo sapiens 


Human lambda crystallin. 


268 


98 


1371 


gi21685525 


Homo sapiens 


similar to chloride channel protein 
SW:CICH TORMA 


4250 


94 


1371 


gi6653659 


Oryctolagus 
cuniculus 


chloride channel CLC-6 


4155 


91 


1371 


ABB11826 


Homo sapiens 


Human CI channel homologue, SEQ 
IDNO.2196. 


4100 


91 


1372 


gi21928599 


Homo sapiens 


seven transmembrane helix receptor 


749 


100 


1372 


AAG71967 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1648. 


725 


97 


1372 


AAG71962 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1643. 


714 


95 


1373 


ABG45325 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
34990. 


278 


100 


1373 


AAM35993 


Homo sapiens 


Peptide #10030 encoded by probe for 
measuring placental gene expression. 


278 


100 


1373 


AAM20725 


Homo sapiens 


Peptide #7159 encoded by probe for 
measuring cervical gene expression. 


278 


100 


1374 


AAU 12071 


Homo sapiens 


Human PHT1 variant protein from 
Caco-2 cells. 


920 


100 


1374 


AAU12070 


Homo sapiens 


Human PHT1 variant protein from 
BeWo cells. 


920 


100 


1374 


AAU12069 


Homo sapiens 


Human PHT1 protein splice variant. 


920 


100 


1375 


ABB77396 


Homo sapiens 


Human cathepsin L. 


1597 


87 


1375 


AAW47031 


Homo sapiens 


Human procathepsin L. 


1597 


87 


1375 


gi29715 


Homo sapiens 


pro-(cathepsin L) 


1597 


87 


1376 


AAU11764 


Homo sapiens 


Human alpha la adrenergic receptor- 
like GPCR. 


2554 


98 


1376 


AAG64126 


Homo sapiens 


Human G protein-coupled receptor 
GPRv72. 


2554 


98 


1376 


AAU04369 


Homo sapiens 


Human G-protein coupled receptor, 
hRUP15. 


2554 


98 


1377 


gil 8676524 


Homo sapiens 


FLJ00 159 protein 


164 


52 


1377 


gi21392066 


Drosophila 
melanogaster 


RE04357p 


139 


34 


1377 


AAB94312 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14783. 


82 


38 


1378 


ABB08011 


Homo sapiens 


Human secretin receptor-like GPCR. 


2759 


100 


1378 


AAU79494 


Homo sapiens 


Human G protein- coupled receptor 
TGR18-3. 


2582 


97 


1378 


AAE18652 


Homo sapiens 


Human G-protein coupled receptor 
(GCREC-13). 


2569 


96 


1379 


AAY30735 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


280 


100 


1379 


AAY40092 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


65 


100 


1379 


gil75 11709 


Homo sapiens 


down-regulated by Ctnnbl, a 


65 


26 


1380 


gi2459682 


Homo sapiens 


MAGE-B4 


883 


54 


1380 


gi3687196 


Homo sapiens 


MAGE-B1 


864 


54 


1380 


gi4033512 


Homo sapiens 


DAM10-DSS-AHC critical interval 
MAGE superfamily protein 


863 


54 


1381 


AAE21804 


Homo sapiens 


Human TREK2 protein. 


2791 J 


100 


1381 


gil9716290 


Homo sapiens 


potassium channel TREK2 splice 
variant b 


2784 


99 
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1381 


AA014193 


Homo sapiens 


Human transporter and ion channel 
TR1CH-10. 


2747 


98 


1382 


ABB 11297 


Homo sapiens 


Human Coxsackie adenovirus 
receptor homologue, SEQ ID 
NO:1667. 


699 


97 


1382 


gi 14279421 


Danio rerio 


coxsackievirus and adenovirus 
receptor-like protein 


DOO 


oZ 


1382 


ABB84956 


Homo sapiens 


Human PR05723 protein sequence 

ri T~>/~\ TT~\ \TA.OOn 

SEQ ID NO:28U. 


355 


33 


1383 


AAU99321 


Homo sapiens 


Human protooncogene KG- 19 
protein. 


IDD 


1 AA 


1383 


gil 8466808 


Homo sapiens 


cervical cancer 1 proto-oncogene- 
binding protein KG 19 


/DD 


1 AA 
LVV 


1383 


gi21961229 


Homo sapiens 


BRJ3 binding protein 


753 


100 


1384 


AAM06866 


Homo sapiens 


Human ioetal protein, bbQ ID JNU. 
1074. 


lion 

L LD 1 




1384 


gil5099951 


Mus museums 


diacylglycerol acyltransferase 2 


yjD 


^ i 


1384 


ABB75677 


Homo sapiens 


Breast protein-eukaryotic conserved 
gene i (do i r-EK^KJi ) protein. 


y4D 


*;a 
dv 


1385 


AAG68335 


Homo sapiens 


xiuman i_o.rz protein &r!,\t iu invj.^k 


1 1^4 

1 JJH' 


QQ 


1385 


gil9525540 


Homo sapiens 


lymphocyte effector toxicity 
activation ligand 


1330 


98 


1385 


AAY36071 


Homo sapiens 


.extended numan secreted protein 
sequence, SEQ ID NO. 456. 


1 9<?9 


09 


1386 


AAY94527 


Homo sapiens 


Human statherin protein. 






1386 


gi338611 


Homo sapiens 


statherin precurser 


260 


83 


1386 


gi338508 


Homo sapiens 


statherin protein 


o^a 
zou 


OJ 


1387 


AAB94721 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15739. 


1011 


100 


1387 


gil 73 84256 


Homo sapiens 


mucin 5 


i ni 

LVD 


Zo 


1387 


gi!334899 


Human 
herpesvirus 4 


BDLF3 late reading Irarne 9xJNyCi/iS 


1 aa 

IVV 


ZD 


1388 


ABP41842 


Homo sapiens 


Human ovarian antigen HTFML39, 
SEQ ID NO:2974. 


QIC 


ys 


1388 


AAB73302 


Homo sapiens 


Human cyclophilin B (CypB) C - 
terminal deletion mutant, CypB- 
AIAKE. 


y lo 


yo 


1388 


AAB73301 


Homo sapiens 


Human cyclophilin B (CypB). 


918 


98 


1389 


AAE07112 


Homo sapiens 


Human gene 6 encoded secreted 
protein tragment, bcA^ ID iN<J. izy. 


Z4/U 


QQ 

yy 


1389 


AAM93449 


Homo sapiens 


Human polypeptide, SEQ ID NO: 


2378 


99 


1389 


AAM93823 


Homo sapiens 


hLuman polypeptide, oiiv^ ID lskj. 
3881. 


LD /4 


QQ 

yy 


1390 


gil6589056 


Homo sapiens 


type II gonadotropin-releasing 
hormone receptor 


LD /O 


QA 

yn 


1390 


gil9697896 


Homo sapiens 


GnKH receptor 11 5 1 M 




Q1 

yj 


1 ion 


A AT T1 f»Q1 Q 

AAU lUoiy 


Homo sapiens 


numaii i ypt? lv vjfnivri-iv spnve 
variant 1 protein #2. 




Q^ 


1391 


gil 6359249 


Mus musculus 


RIKEN cDNA 13000 10M03 gene 


2226 


91 


1391 


AAM93450 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3100. 


587 


34 


1391 


ABB89832 


Homo sapiens 


Human polypeptide SEQ ID NO 
2208. 


522 


39 


1392 


AAE04896 


Homo sapiens 


Human transporter and ion channel-9 


825 


100 
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C 1 
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0/ 
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(TRICH-9) protein. 






1392 


AAE04899 


Homo sapiens 


Human transporter and ion channel- 
12 (1RICH-12) protem. 


688 


52 


1392 


AAE06616 


Homo sapiens 


Human protein having hydrophobic 

TTTJI A70C 

domain, tir\.i)/5D. 


688 


52 


1393 


gi21928487 


Homo sapiens 


seven transmembrane helix receptor 


1558 


100 


1393 


AAG72603 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2284. 


1 A 11 


on 


1393 


AAG71515 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 196. 


1433 


89 


1394 


ABB89241 


Homo sapiens 


Human polypeptide SEQ ID NO 
1617. 


195 


57 


1394 


AAB08894 


Homo sapiens 


Human secreted protem sequence 
encoded by gene 4 SEQ ID NO:5 1. 


165 


en 

59 


1394 


gi2 1070 180 


Danio rerio 


envelope protein 


111 
111 


A O 

4o 


1395 


gil5157307 


Agrobacterium 
tumefaciens str. 
CSS (Cereon) 


AGR_C_3928p 


85 


21 


1395 


gi9295309 


frog adenovirus 
1 


pllla protein 


82 


41 


1395 


ABB08204 


Homo sapiens 


Human lipid metabolism enzyme-4 

ST H JTT^ A\ 

(LME-4). 


76 


27 


1397 


AAG65914 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 27142. 


5760 


97 


1397 


gil9570398 


Homo sapiens 


hDDM36 


5760 


97 


1397 


AAU77405 


Homo sapiens 


Human NOV1 protein, homologue of 
JNUrJi/r UJNC Ig proteins. 


5619 


95 


1398 


gil5292481 


Drosophila 
melanogaster 


SD03655p 


1165 


59 


1398 


AAB88372 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0108. 


001 

ool 


Of 


1399 


ABB07553 


Homo sapiens 


BSTP-CAD fragment determined 
from I.M.A.G.E clone 52071. 


91 


29 


1399 


ABB07552 


Homo sapiens 


tt T>rtnnr* a t*\. 1 ^ j_* 1 

Human BSTP-CAD polypeptide. 


91 


29 


1399 


gi21959343 


Yersinia pestis 
KIM 


basal-body MS (membrane and 
supramembrane)-ring and collar 
protein 


69 


35 


1401 


AAB95124 


Homo sapiens 


Human protein sequence SEQ ID 
NO:17122. 


1 A O 
145 


1 A 

34 


1401 


gi854065 


Human 
herpesvirus 6 


U88 


139 


40 


1401 


AAO09309 


Homo sapiens 


TT 1 _ j." J CITA TT~\ XT/A 

Human polypeptide SEQ ID NO 
23201. 


1 IT 

137 


2o 


1402 


AAE01332 


Homo sapiens 


TT 1 O 11 » 1 

Human gene 1 8 encoded secreted 
protein fragment, SEQ ID NO: 197. 


222 


1 rvA 
100 


1402 


AAE01299 


Homo sapiens 


Human gene 18 encoded secreted 
protein HFIIN69, SEQ ID NO:162. 


222 


100 


1 A AO 

1402 


A A T7A1 O/fO 

AAiiUi24y 


Homo sapiens 


Human gene 1 8 encoded secreted 
protein HFIIN69, SEQ ID NO:l 1 1. 


79? 


1 00 

1 \J\J 


1403 


AAM06589 


Homo sapiens 


Human foetal protein, SEQ ID NO: 
320. 


237 


100 


1404 


AAU10510 


Homo sapiens 


Human leukocyte immunoglobulin 
receptor-like (LIR-like) protein #8. 


1411 


100 


1404 


AAU10511 


Homo sapiens 


Human leukocyte immunoglobulin 
receptor-like (LIR-like) protein #11. 


595 


89 
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1404 


AAO09486 


Homo sapiens 


Human polypeptide SEQ ID NO 

23378. 


486 


98 


1406 


gi20380511 


Mus musculus 


RIKEN cDNA 31 10007F17 gene 


293 


33 


1406 


gil3377867 


Gallus gallus 


claudin-3 


107 


24 


1406 


gil5553375 


Danio rerio 


claudin h 


106 


23 


1407 


ABB90122 


Homo sapiens 


Human polypeptide SEQ ID NO 
2498. 


1029 


84 


1407 


AAY32204 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 2132179. 


1029 


84 


1407 


AAY78801 


Homo sapiens 


Hydrophobic domain containing 
protein clone HP0063 1 amino acid 
sequence. 


1029 


84 


1408 


gi 13543940 


Homo sapiens 


Similar to RIKEN cDNA 
2610017G09 gene 


2229 


96 


1408 


A A T"» 1 O 1 O O 

AAB12138 


Homo sapiens 


TTJ 11" J ' j * * 1 j j 

Hydrophobic domain protein isolated 
from HT- 1080 cells. 


2218 


96 


1408 


AAB88466 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0260. 


2212 


96 


1 Add 


AAA yf2 Q/C/1 O 

AAJvloo043 


; 

Homo sapiens 


Human polypeptide SEQ ID NO 
1788. 


3168 


99 


1 A AG 


A A T TO 1 n/C 1 


Homo sapiens 


Human rKUy4i. 


2441 


98 


i a An 


A A T m^HA 

aau / / /yu 


Homo sapiens 


Human PR0943 protein. 


2441 


98 


1 A 1 A 

1410 


A A T HOAOO 

AAU78083 


Homo sapiens 


Human interleukin 22 receptor (IL- 
22R) protein sequence. 


1699 


100 


1 A 1 A 

1410 


AAU76905 


Homo sapiens 


Human Z-cytor-II protein. 


1699 


100 


1410 


A A T*>07^'A7 

AAB87607 


Homo sapiens 


Human PRO20233. 


1699 


100 


141 1 


a A rvoArn 

AAO20532 


Homo sapiens 


Protein of the human TFM-2 gene 
sequence. 


111 


100 


1411 


AAE21184 


Homo sapiens 


Human TRICH-28 protein. 


111 


100 


141 1 


glloo4UU4/ 


Homo sapiens 


aromatic amino acid transporter 


111 


1 AA 

100 


1 A 1 1 

1412 


A A C1 O A 


Homo sapiens 


Human transporters and ion channels 
(TRICH)-7. 


1317 


78 


141Z 


AAX34/Z/1 


Homo sapiens 


n(J A 1 1 . 


627 


A 1 

41 


1412 


AAY44278 


Homo sapiens 


Human organic anion transporter. 


627 | 


41 


141 J 


A A TV yf7AA77 

aam /yy / / 


Homo sapiens 


Human protein SEQ ID NO 3623. 


100 


31 


1413 


gi2764507 


Locusta 
migratoria 


nicotinic acetylcholine receptor, 
alpha 1 subunit 


100 


36 


1413 


guv 66011 


Gallus gallus 


• 1 1 1* ■ /-1117TT <"» 

winged helix protein CWH-3 


100 


37 


1414 


ABP41590 


Homo sapiens 


Human ovarian antigen HAZAR95, 
SEQ ID NO:2722. 


1522 


100 


1414 


AAB43682 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO:ll27. 


1522 


100 


1414 


gi5931730 


Homo sapiens 


phosphate carrier 


1522 


100 


1415 


gi 17640 15 


Ciona 
intestinalis 


COS4L5 


314 


42 


1415 


AAM42167 


Homo sapiens 


Human polypeptide SEQ ID NO 
7098. 


272 


30 


i/ii^ 
1415 


A A cm /I O A 

AAE03484 


Homo sapiens 


Human gene 12 encoded secreted 
protein HETHW90, SEQ ID NO: 167. 


250 


OA 

29 


1416 


AAB82315 


Homo sapiens 


Human immunoglobulin receptor 
isoform IRTA2c. 


1130 


88 


1416 


gil 5277746 


Homo sapiens 


Fc receptor-like protein 5 


1130 


88 


1416 


gil3591714 


Homo sapiens 


immunoglobulin superfamily 
receptor translocation associated 
protein 2c 


1130 


88 
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JL*rl / 


A AV971 529 
AA J. Z / loZ 


— — ; 

Homo sapiens 


-— — — 

Human RNF5 polypeptide. 


877 


90 




w oyow 1 


Homo sapiens 


Human zinc binding protein ZB-2. 


877 


90 


1417 


gi 13366064 


Homo sapiens 


HsRmal 


877 


90 


1/11 o 
1410 


A AT? 1 HAQI 


Homo sapiens 


Human secretion and trafficking 
protein-2 (SAT-2). 


1446 


100 


1 /! 1 0 
14l0 


ArJijroUZZ / 


Homo sapiens 


Human Mitsugumin29-like protein 
JNUVJa. 


1384 


97 


1418 


AAU96848 


Homo sapiens 


Human NOV3a protein variant. 


1378 


97 


1 a 1 n 


gil 3452508 


Mus musculus 


claudin 14 


371 


40 


1419 


AAU77764 


Homo sapiens 


Tumour associated antigenic target 

1 , • 1 /m A (Tl\ t f c 

polypeptide (TAT) 155. 


370 


40 


1419 


AAU29200 


Homo sapiens 


Human PRO polypeptide sequence 
#177. 


370 


40 


14ZU 


AJb>oo4UOo 


— ; 

Homo sapiens 


Human Pro peptide #39. 


697 


87 


1 AO A 

14ZU 


gizizuoooo 


Homo sapiens 


T-cell activation leucine repeat-rich 
protein; TA-LRRP 


697 


87 


1 aoh 

14ZU 


AArSl / 13Z 


Homo sapiens 


Human adenylyl and guanylyl 
cyclase (ADGUC)-4. 


505 


84 


14Z 1 


A A T T*7CA.Q1 

aau /ouy 1 


Homo sapiens 


Human nonerythroid Rh glycoprotein 
RhBG protein sequence. 


1891 


88 


1 /lO 1 


gil J / 1 o4 / 1 


Homo sapiens 


Rh type B glycoprotein 


1891 


88 


1421 


gil4346006 


Pan troglodytes 


Rh type B glycoprotein 


1886 


88 


1 /loo 


AAM0Q949 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 425. 


215 


46 


1422 


gil4209836 


Mus musculus 


ATP-binding cassette transporter 
sub-family A member 7 


175 


40 


1422 


AAO14210 


Homo sapiens 


Human transporter and ion channel 
lKlCH-27. 


174 


40 


1423 


ABB84987 


Homo sapiens 


Human PRO6006 protein sequence 
SEQ ID NO:342. 


718 


99 


I4Z3 


A Ttf"lA ACO 

Ar>LrJ4UZ>y 


Homo sapiens 


Human Pro peptide #30. 


718 


99 


1423 


ABB95593 


Homo sapiens 


Human angiogenesis related protein 
PRO6006 SEQ ID NO: 342. 


718 


99 


1424 


gil 8539467 


Homo sapiens 


nucleolar RNA-associated protein 
beta 


3851 


98 


1424 


gil 8539465 


Homo sapiens 


nucleolar RNA-associated protein 
alpha 


3851 


98 


14Z4 


gizuyoco / / 


Homo sapiens 


1 _ 1 T~» X T A _ j t j • 

nucleolar RNA-associated protein 


3823 


98 


1425 


AAE 15635 


Homo sapiens 


Human G-protein coupled receptor-5 
(OCREC-5) protein. 


499 


92 


1425 


AAB66272 


Homo sapiens 


Human TANGO 378 SEQ ID NO: 

on 

29. 


499 


92 


14ZD 


AAJbUj>4Uo 


Homo sapiens 


Human secreted protein fragment, 
d.bv^ ID 1N(J:04. 


499 


92 


1426 


gil 8676704 


Homo sapiens 


FLJ00251 protein 


4114 


99 


14Z0 


rrIA ILIQI 

gl4JJooo 


Tripneustes 
gratilla 


dynein heavy chain isotype 5 A 


406 


24 


1426 


ei6706264 


T cVimsiniJi 

JLs&LOLLLUalLla. 

major 


uyiicin iicavy viidin 


41D 


Zd 


1427 


AAE23980 


Homo sapiens 


Human LP220 secreted protein. 


2964 


99 


1427 


AAB70072 


Homo sapiens 


Human secreted protein #11. 


2475 


99 


1427 


AAE17484 


Homo sapiens 


Human leucine-rich repeat- 8 
(ZLRR8) protein #2. 


2407 


100 


1428 


ABB97996 


Homo sapiens 


Human severing stimulating factor 
Kda subunit 86.35. 


415 


39 
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S 
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% 
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1 A1Q 

14Zo 


glZlolo3 14 


Homo sapiens 


Similar to oxidation resistance 1 


413 


o a 

39 


1428 


gil3540300 


Mus musculus 


nucleolar protein C7B 


408 


34 


1429 


ABB90382 


Homo sapiens 


Human polypeptide SEQ ID NO 
2758. 


352 


63 


1 /ion 
1429 


A "T>T"»/4 1 HOC 

ABP41785 


Homo sapiens 


Human ovarian antigen HCFCS40, 
bbQ ID JNU:2917. 


352 


63 


14zy 


AAM4U391 


; 

Homo sapiens 


Human polypeptide SEQ ID NO 
3536. 


131 


32 


143U 


AAYU//J1 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 8. 


293 


1 A A 

100 


i /ion 
1430 


'1 1 /loroo 

gll 14Z3oo 


Trypanosoma 
brucei 


/-it-j n 


TO 

78 


A n 

43 


i /fin 
143U 


gl3U3 /Uio 


Bo do saltans 


NADH dehydrogenase subunit 5 


76 


33 


1431 


ABB04471 


Homo sapiens 


Human endoprotease 3 1 . 


940 


100 


1431 


AAM93525 


Homo sapiens 


TT 1 j.' J PIT") /~\ TT\ \Tr\ 

Human polypeptide, SEQ ID NO: 
3259. 


243 


36 


1431 


ABB 8 9344 


Homo sapiens 


Human polypeptide SEQ ID NO 
1720. 


219 


29 


1432 


AAB88388 


Homo sapiens 


Human membrane or secretory 
protein clone PSECO 131. 


306 


43 


1432 


'1A01 A A AO 

gi208 10493 


Homo sapiens 


Similar to RIKEN cDNA 
2810417M05 gene 


306 


43 


1432 


AAE21272 


Homo sapiens 


Human gene 16 encoded secreted 
protein fragment, SEQ ID NO: 138. 


207 


55 


1433 


AAB88388 


Homo sapiens 


Human membrane or secretory 
protein clone PSECO 131. 


308 


42 


1433 


AO 1 A /I AO 

gi208 10493 


Homo sapiens 


Similar to RIKEN cDNA 
2810417M05 gene 


308 


42 


1433 


A A T - "^ i 

AAE21272 


Homo sapiens 


Human gene 16 encoded secreted 
protein fragment, SEQ ID NO: 138. 


209 


54 


1434 


ABP43139 


Homo sapiens 


Human ovarian antigen HWBT60, 
bbQ ID NO:4271. 


235 


77 


1434 


A DD/IOO 1 C 

Ah>r4Zol3 


; 

Homo sapiens 


Human ovarian antigen HPC0095, 

oxil^ ID JNL).3y4/. 


235 


77 


1434 


ABP42409 


Homo sapiens 


Human ovarian antigen HOCOC38, 

oc,V«{ ID INU.3341. 


235 


77 


1 

1T-JJ 


gi i y 1 /ll OZ 


— — : 

Homo sapiens 


ventrhoid transmembrane protein 


lo ID 


yy 


1433 


A AT? 1 

A/YJtS 1 43 / 1 


Homo sapiens 


Human rhomboid related protein, 
RRP3. 


looy 


QQ 

yy 


1 AQ^ 
1433 


rri 1 G 1 7 1 1 &f\ 

giiyi / 1 iou 


Mus musculus 


ventrhoid transmembrane protein 




CSA 

y4 


1436 


gi7106866 


Homo sapiens 


HSPC238 


237 


56 


1 A Q< 

1430 


AAMyZU3Z 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 1401. 


Z33 


72 


1430 


gll334Z/U/ 


Mus musculus 


DTI/Tj\T -.TYIST A O C AAA AO T 1 A ~ 

KllvbN cUJNA ZDUUUU2L14 gene 


1 nc 

iy3 


/I T 

47 


i /in 
143 / 


A A 17 1 OO 1 1 

AAiiloZlZ 


Homo sapiens 


Human MOL4 protein. 


6140 


99 


1 /in 
143 / 


A A P/COim 

AAOooZy3 


Homo sapiens 


Human semaphorin G-like NHP 
protein SEQ ID NO: 10. 


£1 1 O A 

6134 


99 


1417 




Homo sapiens 


numan semapnonn ^j-iikc iNxii 
protein SEQ ID NO: 12. 




yo 


1438 


AAU95381 


Homo sapiens 


Human calcium transport protein 
CaTrF2Ell. 


4607 


99 


1438 


AAY96479 


Homo sapiens 


Human vanilloid receptor 3. 


4607 


99 


1438 


gi2 1690507 


unidentified 


CaTrF2Ell 


4607 


99 


1439 


gi 11055322 


Homo sapiens 


vanilloid receptor-related osmotically 
activated channel 


3324 


100 
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ID 


Hit ID 


Species 


Description 


score 


0/ 

Identity 


1439 


AAU74935 


Homo sapiens 


Amino acid sequence of human 
vanilloid receptor-like protein 2a 
(VRL~2a). 


3318 


99 


1439 


ABB79191 


Homo sapiens 


Human YR4 protein bEQ ID JN(J:2. 


33 15 


yy 


1440 


gil4574118 


Caenorhabditis 
elegans 


C. elegans DPY-19 protein 
(corresponding sequence F22B7.10) 


76 


24 


1440 


AAU83621 


Homo sapiens 


Human PRO protein, Seq ID No 60. 


74 


24 


1440 


AAO05826 


Homo sapiens 


Human polypeptide SEQ ID NO 
19718. 


70 


26 


1441 


gi4235228 


Mus musculus 


leucine zipper-EF-hand containing 
transmembrane protein 1 


497 


59 


1441 


gil 5 680275 


Homo sapiens 


<-i • 'lit ' ' T~ * T" 1 1 J 

Similar to leucine zrpper-EF-nand 
containing transmembrane protein 1 


494 


60 


1441 


gi 18204589 


Homo sapiens 


leucine zipper-EF-hand containing 
transmembrane protein 1 


A C\A 


/CA 


1442 


AAE17482 


Homo sapiens 


Human leucine-rich repeat-7 
(ZLRR.7) protein. 


1 It// 


1 AA 


1442 


ABB 11242 


Homo sapiens 


TT. OT IT 1 1 1 „ OCA TT\ 

Human SLIT-2 nomologue, SEQ ID 
NO:1612. 


oj3 


yy 


1442 


AAB07469 


Homo sapiens 


A human leucine-rich repeat protein 
designated Zlrr3. 


A A 1 

443 


JO 


1443 


AAU97218 


Homo sapiens 


Human G protein-coupled receptor, 
TGR25. 


1645 


O / 


1443 


AAE23415 


Homo sapiens 


Human G-protein coupled receptor-7 
(GCREC-7). 


1645 


87 


1444 


gil 5487341 


Escherichia coli 


macrolide-specific ABC-type efflux 
carrier 


697 


n 1 
91 


1444 


gi4062463 


Escherichia coli 


ABC transporter probable ATP- 
binding subunit homolog 


o97 


Q1 

y i 


1445 


gil7978985 


Arabidopsis 
thaliana 


At 1 g43 690/F2J 6_4 


/J 


JLJ. 


1445 


gi4239789 


Treponema 
maltophilum 


major sheath protein 


T A 

74 


23 


1446 


gi21426922 


Homo sapiens 


PELP1 


A 1 1 1 

4113 


y_> 


1446 


AAW31185 


Homo sapiens 


Human pl60 polypeptide 160.L 


3176 


87 


1446 


gi3 168604 


Homo sapiens 


proline and glutamic acid rich 
nuclear protein isoform 


1641 


99 


1447 


AAB94495 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15188. 


68 


36 


1447 


AAG67254 


Homo sapiens 


Amino acid sequence of a human 
liver-associated gene. 


68 


36 


1447 


gi2 14894 12 


Bombus 

hypocrita 

sapporoensis 


cytochrome oxidase 1 


66 


35 


1448 


AAB80418 


Homo sapiens 


Gene #4 associated peptide #1 . 




1 AA 
1UU 


1448 


AAB65162 


Homo sapiens 


Human PRO290 (UNQ253) protein 
sequence SEQ ID NO:33. 


1972 


100 


1 /l A O 

1448 


AAYooo3y 


Homo sapiens 


iviemorane-Douna protein rs\\jzy\j. 


1 Q79 


1 on 


1449 


gil61 17372 


Macropodid 
herpesvirus 1 


ICP4 


95 


27 


1449 


gi4063766 


Aspergillus 
nidulans 


chitinase 


92 


24 


1449 


gil5277229 


Homo sapiens 


Homologue to Drosophila 
photoreceptor protein calphotin 


89 


31 


1450 


ABB09760 


Homo sapiens 


Amino acid sequence of human 


1029 


100 
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ID 
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Species 
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S 

score 


% 
Identity 








hearing defect related protein 38.39. 






1450 


AAU27660 


Homo sapiens 


Human protem AFP671052. 


889 


100 


1450 


AAG74151 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:49l5. 


748 


100 


1452 


AAE21175 


Homo sapiens 


Human TRICH-19 protein. 


1041 


93 


1452 


gil5559050 


Ethmostigmus 
rubripes 


ADP-ATP translocator 


733 


68 


1452 


gi7542476 


Xenopus laevis 


adenine nucleotide translocase 


702 


67 


1454 


gi4929597 


Homo sapiens 


CGI-64 protein 


1879 


96 


1454 


gi6995987 


Homo sapiens 


mitochondrial carrier homolog l 
isoform a 


1818 


99 


1454 


gi6995989 


Homo sapiens 


mitochondrial carrier homolog l 
isoform b 


1790 


95 


1455 


gil7225210 


Podospora 
anserina 


beta transducin-like protein HET- 
D2Y 


247 


25 


1455 


gi886024 


Thermomonospo 
ra curvata 


PkwA 


239 


29 


1455 


gil7131893 


Nostoc sp. PCC 
7120 


WD-repeat protein 


238 


26 


1456 


AAB36840 


Homo sapiens 


Human insulin receptor-related 
receptor protein with signal peptide. 


6736 


98 


1456 


ABB 11702 


Homo sapiens 


Human insulin receptor-related 
protein homologue, SEQ ID 
NO:2072. 


673 1 


99 


1456 


AAB36836 


Homo sapiens 


Human insulin receptor-related 
receptor protein. 


673 1 


99 


1457 


ABB 11803 


Homo sapiens 


Human GPI-122 homologue, SEQ ID 
NO:2l73. 


6420 


99 


1457 


AAY50125 


Homo sapiens 


Human glycophosphatidylinositol- 
anchored protein GPI-122. 


6323 


100 


1457 


AAB94751 


Homo sapiens 


Human protein sequence SEQ ID 
NO:l5805. 


5872 


99 


1458 


AAU00023 


Homo sapiens 


Human activated T-lymphocyte 
associated sequence 2, ATLAS-2. 


3623 


99 


1458 


AAE04546 


Homo sapiens 


Human G-protein coupled receptor-2 
(GCREC-2) protein. 


3325 


89 


1458 


gil9387136 


Homo sapiens 


PYRIN-containing APAFl-like 
protein 5 


3316 


89 


1459 


AAE14719 


Homo sapiens 


Human carbohydrate-associated 
protein (CARBAP)-l. 


3132 


99 


1459 


AAB92508 


Homo sapiens 


Human protein sequence SEQ ID 
NO:l063l. 


2742 


100 


1459 


AAB95542 


Homo sapiens 


Human protein sequence SEQ ID 
NO:l8l55. 


2687 


99 


1460 


gi2960194 


Human T-cell 
lymphotropic 
virus type 2b 


rex 


81 


24 


1460 


gi404041 


Human T- 
lymphotropic 
virus 2 


rex protein 


81 


24 


1460 


gi348140 


Human T- 
lymphotropic 
virus 2 


rex 


81 


24 


1461 


gi 1103953 


Homo sapiens 


MHC class I HLA-B*l523 


71 


28 


1461 


gi!815616 


Homo sapiens 


HLA-B-1521 


71 


28 
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S 
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% 
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1461 


gil7467060 


Homo sapiens 


MHC class I 


71 


28 


1462 


gil 1066090 


Homo sapiens 


matrix metalloprotease MMP-27 


1188 


85 


1462 


AAU00469 


Homo sapiens 


Human TANGO 210 protein. 


1185 


85 


1462 


AAU12267 


Homo sapiens 


Human PR05992 polypeptide 
sequence. 


1185 


85 


1463 


gi601948 


Drosophila 
melanogaster 


Inscuteable 


132 


23 


1463 


gi7291194 


Drosophila 
melanogaster 


CG11312-PA 


132 


23 


1463 


gil657962 


Drosophila 
melanogaster 


Nem 


123 


24 


1464 


AAB45378 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 38 SEQ ID 
NO:130. 


433 


96 


1464 


gi406058 


Mus museums 


protein kinase 


216 


53 


1464 


AAE16277 


Homo sapiens 


Human kinase PKIN-23 protein. 


213 


53 


1465 


AAG00866 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4947. 


123 


54 


1466 


AAY97293 


Homo sapiens 


Lipid associated protein (LIPAP) 
3335404CD1. 


2178 


68 


1466 


AAM39997 


Homo sapiens 


Human polypeptide SEQ ID NO 
3142. 


1214 


43 


1466 


AAB24231 


Homo sapiens 


Human vesicle associated protein 10 
SEQ ID NO: 10. 


1214 


43 , 


1467 


ABP41928 


Homo sapiens 


Human ovarian antigen H6EEO05, 
SEQ ID NO:3060. 


922 


96 


1467 


AAB94535 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 15273. 


922 


96 


1467 


AAM79530 


Homo sapiens 


Human protein SEQ ID NO 3176. 


922 


96 


1468 


AAB94336 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14836. 


694 


77 


1468 


AAB97250 


Homo sapiens 


HOMO RNA cyclase 41 protein. 


694 


77 


1468 


gil2654401 


Homo sapiens 


Similar to RNA cyclase homolog 


694 


77 


1469 


ABB05645 


Homo sapiens 


Human thyroglobulin 38 protein SEQ 
ID NO:2. 


273 


98 


1469 


AAM89857 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:17450. 


81 


43 


1470 


AAO04234 


Homo sapiens 


Human polypeptide SEQ ID NO 
18126. 


185 


76 


1470 


AAM92673 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 2022. 


184 


97 


1470 


gi65265 


Xenopus laevis 


a xenopus upstream binding factor 


130 


30 


1471 


ABB 12219 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:2589. 


182 


97 


1471 


ABG60080 


Homo sapiens 


Human DITHP polypeptide #138. 


153 


82 


1471 


ABB90307 


Homo sapiens 


Human polypeptide SEQ ID NO 
2683. 


148 


96 


1472 


AAO 12550 


Homo sapiens 


Human polypeptide SEQ ID NO 
26442. 


ADD 


oz 


1472 


AAG03600 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7681. 


245 


86 


1472 


gi7770239 


Homo sapiens 


PR02831 


136 


71 


1473 


AAW49698 


Homo sapiens 


Human Notch3 protein. 


99 


55 


1473 


gi3065951 


Homo sapiens 


Notch3 


99 


55 


1473 


AAR67242 


Homo sapiens 


Huaman glial cell growth factor 


96 


42 
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S 

score 
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Identity 








heregulin segment E. 






1474 


gi22002433 


Homo sapiens 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


323 


54 


1474 . 


gi2 1979456 


Homo sapiens 


raptor 






1474 


gi22002435 


Mus musculus 


pi 50 target of rapamycin (TOR)- 
scaffold protein containing WD- 
repeats 


318 


53 


1475 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2U bJbv il> 

SSKJ.L JO. 


289 


96 


1476 


gi7io0y 15 


— — ; 

Homo sapiens 


VNN3 protein 


2254 


87 


1476 


AAM40410 


Homo sapiens 


Human polypeptide SEQ ID NO 


2249 


87 


1 A 

147o 


gioiuzyyo 


— — 

Mus musculus 


V aHIII J 


1857 


71 


1477 


gi 16877231 


Homo sapiens 


Similar to RIKEN cDNA 

970001QFI07 cre*rt/> 

z/uuuiyuvj/ gene 


997 


100 


1478 


gi340201 


Homo sapiens 


voltage-dependent anion channel 


479 


74 


1478 




Homo sapiens 


voltage- dependent anion channel 2 


470 


74 


1478 


gi511426l 


Homo sapiens 


voltage-dependent anion channel 
isoform 2 


479 


74 


1479 


ABB085 1 1 


Homo sapiens 


Human protein r riz- 1 d . 


1 0? 


4S 


1479 


A A V*C\A /TOO 

AAB94622 


Homo sapiens 


riuman proiem sequence dev^ iu 

IN KJ. 13*+ /O. 


i J i 


69 


1479 


ABJU3 /41 


— _ . 

Homo sapiens 




128 


37 


1480 


AAB93446 


Homo sapiens 


Human protein sequence SEQ ID 

]\TfV 1 ?^Q0 
1NU. IZOi7l/. 


204 


100 


1480 


AAM80227 


Homo sapiens 


Human protein SEQ ID NO 3873. 


204 


100 


1480 


AAA >TT/^0 /I 1 

AAM79243 


Homo sapiens 


riuman protein oxiv,/ iu in^ x^ud. 




100 
1 w 


1482 


AAW42095 


Homo sapiens 


riuman rvao protein /v ^iijxadaj. 




80 


1482 


gi 1457954 


Homo sapiens 


Rab22b 


192 


80 


1482 


gil 01 79679 


Homo sapiens 


small vjirase KA.r>zzt> 


1 09 


OV/ 


1483 


ABG62142 


Homo sapiens 


Human prostate specific polypeptide 
#75. 


170 


65 


1483 


AAE24060 


Homo sapiens 


Human prostate specific protein 
(PSP)#3. 


1 70 

X /u 


oo 


1483 


gi7959778 


Homo sapiens 


THT> f~\ 1 Z.AC 

PK01j4o 


XOj 




1484 


AAU69407 


Homo sapiens 


Lung small cell carcinoma antigen 

ffl. 


J 50 


OJ 


1484 


ABB03602 


Homo sapiens 


Human musculoskeletal system 
related polypeptide o&y^i xu in 
1549. 


215 


69 


1484 


gi6979921 


Drosophila 
melanogaster 


Xviiovj x x^ase 


X Ji* 




1485 


A A T THrtAn O 

AAU99098 


Homo sapiens 


Human glycosyl transferase 47169. 




fn 

D / 


1485 


gil8676612 


Homo sapiens 


FLJ00205 protein 


391 


67 


14cO 


mi at <;oa<;o 

gll^lDU^DU 


XV allUS 

norvegicus 


KJxJx -Wa.llNy-VC'.pUiypcpLlUw IN - 

acetylgalactosaminyltransferase T9 


343 


60 


1486 


AAO 11830 


Homo sapiens 


Human polypeptide SEQ ID NO 
25722. 


283 


82 


1486 


AA012132 


Homo sapiens 


Human polypeptide SEQ ID NO 
26024. 


280 


81 


1487 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:121. 


1249 


85 
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1488 


AAY27572 


Homo sapiens 


Human secreted protein encoded by 
gene No. 6. 


ion 




1488 


gil3560707 


Homo sapiens 


group XIII secreted phospholipase 
A2 


ICQ 


D\) 


1488 


gi 12698926 


Mus musculus 


group XIII secreted phospholipase 
A2 


jll 




1489 


gi2995442 


Homo sapiens 


UDPGal:GIcNAc bl,4 
galactosyltran sferase 


1 QOA 

i.yzu 


on 
yu 


1489 


gi4520136 


Homo sapiens 


beta~l,4-galactosyltransferase II 


1 GO ft 


on 
yu 


1489 


gi3 132896 


Homo sapiens 


beta- 1 ,4-galactosyltransferase 


1877 


88 


1490 


AAOH284 


Homo sapiens 


Human polypeptide SEQ ID NO 
25176. 


1 tic 

165 


69 


1490 


AAO06074 


Homo sapiens 


Human polypeptide SEQ ID NO 
19966. 


1 CA 

lo4 


oi 


1490 


AAO02002 


Homo sapiens 


Human polypeptide SEQ ID NO 
15894. 


1 en 


o4 


1491 


AAB93371 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12521. 


a on 

487 


no 

98 


1491 


AAM41334 


Homo sapiens 


T-r "I j ♦ J r-i T->/""V TF\ \TA 

Human polypeptide SEQ ID NO 
6265. 


a on 

4o / 


no 

y© 


1491 


AAM41333 


Homo sapiens 


Human polypeptide SEQ ID NO 
6264. 


A on 

4o7 


no 


1492 


AAE 19435 


Homo sapiens 


Human 27803 (a member of human 
adenylate kinase family). 


341 




1492 


AAU29224 


Homo sapiens 


Human PRO polypeptide sequence 
#201. 


1 C\n 

19/ 




1493 


gi21410587 


Homo sapiens 


similar to RIKEN cDN A 
2310041H06 


Q1 O 

o lo 


1 


1493 


gi6855513 


Gallus gallus 


syndesmos 




oU 


1493 


gil8034388 


Mus musculus 


syndesmos 


537 


58 


1494 


gi20987486 


Homo sapiens 


similar to B cell phosphoinositide 3- 
kinase adaptor 


805 


92 


1494 


gil2082725 


Mus musculus 


B cell phosphoinositide 3-kinase 
adaptor 


331 


62 


1494 


gil2082811 


Gallus gallus 


B cell phosphoinositide 3-kinase 
adaptor 


205 


48 


1495 


AAB64482 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 13 SEQ ID 
NO: 120. 


273 


100 


1495 


AAB65921 


Homo sapiens 


TT j J * * OT?/~\ TT"\ "NT f~\ . 

Human secreted protem SEQ ID NO. 
61. 


Lid 


1 nn 


1495 


AAB95728 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18606. 


273 


100 


1496 


gi56463 


Rattus 
norvegicus 


gp210 (AA 1-1886) 


*71 11 
/ 1 /Z 


/Z 


1496 


gi6650678 


Mus musculus 


■ ; 

nuclear pore membrane glycoprotein 

JPOM21U 


'711/1 
/ 1 14 


1 1 


1 AQ& 

i^fyo 


giioo /ojju 


nomo s*ipicxio 


FT TOO 1 72 nrotein 


5038 


95 


1497 


AAY40432 


Homo sapiens 


Amino acid sequence of vk65.15, a 
human VK gene fragment. 


607 


100 


1497 


AAW62185 


Homo sapiens 


Human DNA vkappa65.15 fragment. 


607 


100 


1497 


AAW41147 


Homo sapiens 


Human vkappa65.15 fragment. 


607 


100 


1498 


gi431857 


Homo sapiens 


delta 4-3-oxosteroid 5 beta-reductase 


460 


76 


1498 


gil 1640835 


Homo sapiens 


5-beta steroid reductase 


460 


76 


1498 


gi5689216 


Oryctolagus 


delta4-3-oxosteroid 5beta-reductase 


442 


70 
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SEQ 
ID 


Hit ID 
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S 

score 


% 

Identity 






cuniculus 








1500 


gi2 1748478 


Homo sapiens 


FLJ00264 protein 


3399 


99 


1500 


AAB93159 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12081. 


2209 


64 


1500 


AAB58796 


Homo sapiens 


Breast and ovarian cancer associated 

t ■ , • flirt /a tt\ 

antigen protein sequence SEQ ID 
504. 


1456 


64 


1502 


AAB58202 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 540. 


427 


100 


1502 


AAU79340 


Homo sapiens 


Human caveolin-1 (Cav-1). 


345 


100 


1502 


AAE22092 


Homo sapiens 


Human caveolin-1 protein. 


345 


100 


1503 


gi3 115996 


Homo sapiens 


dJ79C4.1.2 (Homeobox protein 
PMX-1 (PHOX1) isoform 2) 


410 


97 


1503 


gi3 115995 


Homo sapiens 


dJ79C4.1.1 (Homeobox protein 
PMX-1 (PHOX1) isoform 1) 


410 


97 


1503 


gi460125 


Mus musculus 


homeobox protein 


405 


96 


1504 


gi8 163762 


Homo sapiens 


membrane cofactor protein CD46 
variant 


639 


83 


1504 


AAG75528 


Homo sapiens 


Human colon cancer antigen protein 
SEQ IDNO.6292. 


635 


82 


1504 


AAB58394 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 732. 


635 


82 


1505 


AAO21802 


Homo sapiens 


Lung-specific amino acid sequence 
SEQ ID No 113. 


301 


61 


1505 


AAO07848 


Homo sapiens 


Human polypeptide SEQ ID NO 
21740. 


161 


89 


1505 


AAO09394 


Homo sapiens 


Human polypeptide SEQ ID NO 
23286. 


115 


51 


1506 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 

"XT/™\ 1 /Ton i 

NO: 16231. 


214 


65 


1506 


gi 1335205 


Homo sapiens 


ORFII 


209 


69 


1507 


ABB445 66 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 23. 


1058 


100 


1507 


gil 1414896 


Homo sapiens 


nucleoporin 


1058 


100 


1507 


gil 184173 


Homo sapiens 


nucleoporin 98 


1058 


100 


1508 


AAM79518 


Homo sapiens 


Human protem SEQ ID NO 3164. 


1062 


96 


1508 


AAM78534 


Homo sapiens 


Human protein SEQ ID NO 1 196. 


1062 


96 


1508 


ABB11951 


Homo sapiens 


Human K/Cl cotransporter 
homologue, SEQ ID NO:2321. 


1062 


96 


1509 


AAB21033 


Homo sapiens 


Human nucleic acid-binding protein, 
NuABP-37. 


293 


81 


1509 


gi6691968 


Homo sapiens 


dJ148M19.1 (zinc finger protein) 


293 


81 


1509 


gi4096339 


Homo sapiens 


zinc ringer protein 


293 


81 


1511 


gi8809808 


Mus musculus 


KRAB zinc ringer protein 


329 


39 


1511 


AAM40475 


Homo sapiens 


Human polypeptide SEQ ID NO 
5406. 


322 


79 


1511 


AAM38689 


Homo sapiens 


Human polypeptide SEQ ID NO 

1 Q1A 


322 


79 


1512 


gil 5029737 


Mus musculus 


complement component 2 (within H- 
2S) 


89 


26 


1512 


gi3 986766 


Mus musculus 


C2 


89 


26 


1512 


gil 92435 


Mus musculus 


complement component C2 


86 


26 


1513 


AAB60489 


Homo sapiens 


Human cell cycle and proliferation 
protein CCYPR-37, SEQ ID NO:37. 


1039 


100 


1513 


AAB38417 


Homo sapiens 


Fragment of human secreted protein 


1039 


100 
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S 

score 


% 
Identity 








encoded by gene 5 clone HCGMF16. 






1513 


gi4337460 


Homo sapiens 


neuroblastoma-amplified protein 


1039 


100 


1514 


gil621611 


Homo sapiens 


TRAP family member-associated 
NF-kB activator TANK 


2026 


87 


1514 


AAY88237 


Homo sapiens 


Human I-TRAF protein. 


1996 


86 


1514 


AAW27163 


Homo sapiens 


Human TRAF inhibitor protein I- 
TRAF. 


1996 


86 


1515 


ABP47963 


Homo sapiens 


Human polypeptide SEQ ID NO 393. 


239 


100 


1515 


AAG67216 


Homo sapiens 


Amino acid sequence of human 
Parkin- Associated Protein 1 (PAP1). 


239 


100 


1515 


gil 8490728 


Mus musculus 


synaptotagmin-like 3 


142 


60 


1516 


AAB93432 


Homo sapiens 


Human protein sequence SEQ ID 
NO:1266L 


1322 


86 


1516 


AAW88438 


Homo sapiens 


Disease associated protein kinase 
DAPK-7. 


1322 


86 


1516 


gi5931569 


Homo sapiens 


H91620p 


1322 


86 


1517 


gi5931821 


Homo sapiens 


dJ228H13.3 (zinc finger protein) 


2360 


100 


1517 


ABP51457 


Homo sapiens 


Human MDDT SEQ ID NO 479. 


1497 


100 


1517 


ABP51362 


Homo sapiens 


Human MDDT SEQ ID NO 384. 


1497 


100 


1518 


AAU80035 


Homo sapiens 


Beta 2 microglobulin (beta2M)/HFE 
monochain. 


602 


100 


1518 


ABP43154 


Homo sapiens 


Human ovarian antigen HWCG93, 
SEQ ID NO:4286. 


602 


100 


1518 


AAY44412 


Homo sapiens 


Wild type human beta-2 
microglobulin. 


602 


100 


1520 


AAO07974 


Homo sapiens 


Human polypeptide SEQ ID NO 
21866. 


158 


68 


1520 


AAO07413 


Homo sapiens 


Human polypeptide SEQ ID NO 
21305. 


149 


65 


1520 


AAO05994 


Homo sapiens 


Human polypeptide SEQ ID NO 
19886. 


121 


56 


1521 


gil 13861 13 


Homo sapiens 


FKSG25 


2457 


94 


1521 


gil3548673 


Homo sapiens 


SCOT-t 


2451 


93 


1521 


gi20988313 


Homo sapiens 


3-oxoacid CoA transferase 2 


2448 


94 


1522 


AAM42400 


Homo sapiens 


Human polypeptide SEQ ID NO 133. 


215 


56 


1522 


AAM92851 


Homo sapiens 


Human digestive system antigen 
SEQ ID NO: 2200. 


215 


56 


1522 


gil7862572 


Drosophila 
melanogaster 


LD38503p 


179 


31 


1523 


gi3419880 


Homo sapiens 


MDC/ADAM1 1 


3883 


91 


1523 


AAR75352 


Homo sapiens 


Human fetal brain MDC protein. 


3878 


91 


1523 


gi5736634 


Mus musculus 


AD AM 11 


3658 


87 


1524 


gil 109782 


Homo sapiens 


protein-tyrosine phosphatase 


2517 


98 


1524 


gil781037 


Mus musculus 


neuronal tyrosine threonine 
phosphatase 1 


2083 


79 


1524 


AAM25744 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 1259. 


987 


45 


1525 


AAG75396 


Homo sapiens 


Human colon cancer antigen protein 
SEQIDNO:6160. 


442 


100 


1525 


AAB53529 


Homo sapiens 


Human colon cancer antigen protein 
sequence SEQ ID NO: 1069. 


442 


100 


1525 


AAR84194 


Homo sapiens 


Human A2b adenosine receptor. 


442 


100 


1526 


ABB44587 


Homo sapiens 


Human wound healing related 
polypeptide SEQ ID NO 44. 


336 


75 


1526 


gil 84448 


Homo sapiens 


transcription factor 


336 


75 
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S 
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% 
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1526 


gi 183930 


Homo sapiens 


helix-loop-helix protein 


336 


75 


1527 


AAB43940 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 13 85. 


1241 


78 


1527 


gi2 198557 


Homo sapiens 


spermidine aminopropyltransferase 


1190 


78 


1527 


gil4602778 


Homo sapiens 


spermine synthase 


1190 


78 


1528 


ABP41860 


Homo sapiens 


Human ovarian antigen HTPH072, 
SEQ IDNO:2992. 


1189 


62 


1528 


AAW21949 


Homo sapiens 


E6-binding protein E6-BPSD7. 


1189 


62 


1528 


gi469885 


Homo sapiens 


EF-hand protein 


1189 


62 


1529 


AAU83711 


Homo sapiens 


Human PRO protein, Seq ID No 240. 


606 


100 


1529 


AAB84327 


Homo sapiens 


Amino acid sequence of a human 
lyase and associated protein HLYAP- 
2. 


606 


100 


1529 


AAU12301 


Homo sapiens 


Human PRO6079 polypeptide 
sequence. 


606 


100 


1530 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:121. 


3094 


92 


1530 


AAB38012 


Homo sapiens 


Human secreted protein encoded by 
gene 3 clone HNHCT15. 


3094 


92 


1530 


ABJ03747 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 189. 


3085 


91 


1531 


gil3477353 


Homo sapiens 


Similar to rabphilin 3A-like (without 
C2 domains) 


428 


51 


1531 


ABB06134 


Homo sapiens 


Human NS protein sequence SEQ ID 
NO:226. 


427 


51 


1531 


gi5596433 


Homo sapiens 


candidate tumor suppressor protein 
NOC2 


330 


87 


1532 


AAU81224 


Homo sapiens 


Human lung cancer protein, Seq ID 
No 57. 


2576 


86 


1532 


gil531645 


Rattus 
norvegicus 


C2-HC type zinc finger protein r- 
MyT3 


1913 


66 


1532 


gi2914751 


Rattus 
norvegicus 


neural zinc finger factor 3; NZF-3 


1908 


66 


1533 


AAE23388 


Homo sapiens 


Human intracellular signalling 
(INTSIG- 15) protein. 


643 


39 


1533 


ABB78745 


Homo sapiens 


Human Tigger 1 transposase protein 
sequence. 


438 


71 


1533 


AAO01214 


Homo sapiens 


Human polypeptide SEQ ID NO 
15106. 


366 


65 


1534 


AAU27733 


Homo sapiens 


Human full-length polypeptide 
sequence #58. 


3947 


90 


1534 


AAM78731 


Homo sapiens 


Human protein SEQ ID NO 1393. 


3943 


90 


1534 


AAB95860 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18924. 


3934 


89 


1535 


gil5928572 


Mus musculus 


Similar to leucine rich repeat (in 
FLU) interacting protein 2 


103 


66 


1535 


gil7389307 


Homo sapiens 


Similar to leucine rich repeat (in 
FLU) interacting protein 2 


97 


63 


1535 


gi5257201 


Homo sapiens 


LRR FLI-I interacting protein 2 


97 


63 


1536 


AAG02639 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6720. 


160 


71 


1536 


AAG02753 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6834. 


141 


60 


1536 


gi7959778 


Homo sapiens 


PR01546 


140 


60 


1537 


ABG47195 


Homo sapiens 


Human peptide encoded by genome- 


507 


100 
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ID 
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S 

score 


% 
Identity 








derived single exon probe SEQ ID 
36860. 






1537 


AAM38436 


Homo sapiens 


Peptide #12473 encoded by probe for 
measuring placental gene expression. 


507 


100 


1537 


AAM22055 


Homo sapiens 


Peptide #8489 encoded by probe for 
measuring cervical gene expression. 


507 


i r\f\ 

100 


1538 


gi3 133291 


Homo sapiens 


mitogen activated protein kinase 
activated protein kinase 


853 


85 


1538 


gil75 12453 


Mus musculus 


MAP kinase-activated protein kinase 
5 


850 


84 


1538 


gi2911813 


Mus musculus 


mitogen-activated protein kinase- 
activated protein kinase 


850 


84 


1539 


ABG38678 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
28343. 


Tin 

339 


i r\f\ 


1539 


AAM04612 


Homo sapiens 


Peptide #3294 encoded by probe for 
measuring breast gene expression. 


339 


100 


1539 


AAM29402 


Homo sapiens 


Peptide #3439 encoded by probe for 
measuring placental gene expression. 


Tin 
339 


1 HA 


1540 


gil3 162677 


Homo sapiens 


GLUT4 enhancer factor 


1499 


65 


1540 


gil4627122 


Homo sapiens 


dJ583Pl 5.5.1 (Glut4 enhancer factor 
(isoform 3) ) 


1466 


65 


1540 


AAB58934 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
642. 


1395 


99 


1541 


ABB90747 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 226. 


1606 


98 


1541 


ABB50291 


Homo sapiens 


Collagen type III alpha- 1 ovarian 
tumour marker protein, SEQ ID 
NO: 72. 


1606 


98 


1541 


ABG60248 


Homo sapiens 


Human ovarian antigen #10. 


1606 


98 


1542 


AAE01436 


Homo sapiens 


Human gene 1 encoded secreted 
protein HWLFJ10, oEQ ID N 0:91. 


1622 




1542 


AAE01515 


Homo sapiens 


XT 1 1 1 . .- J _ Jf 

Human gene 1 encoded secreted 
protein fragment, bEQ ID NO. 1 12.. 




no 


1542 


AAE01464 


Homo sapiens 


Human gene 1 encoded secreted 
protein HWErJIO, bEQ ID JNU: 11 y. 


1618 


98 


1543 


gi 186043 


Homo sapiens 


immunoglobulin light chain variable 
region 


546 


89 


1543 


gi219886 


Homo sapiens 


Ig kappa light chain 


543 


89 


1543 


AAY96298 


Homo sapiens 


Human IGFAM-10 immunoglobulin. 


!>4U 


CO 


1544 


AAE19183 


Homo sapiens 


Human protease, PRTS-20 protein. 


3530 


100 


1544 


AAU74761 


Homo sapiens 


Human protease PRTS-21 protein 
sequence. 


2219 


o3 


1544 


AAO20514 


Homo sapiens 


Protein of APP related human 
nomologue nCP2Ql588. 


ZUd 1 


9o 


1545 


gi6693836 


Rattus 
norvegicus 


SNIP-b 


A A 1 1 

441 d 


HQ 

fo 


1545 


gi6693834 


Rattus 
norvegicus 


SNIP-a 


4410 


81 


1545 


gi3098418 


Mus musculus 


P140 


4130 


79 


1546 


gi 179433 


Homo sapiens 


biglycan 


2043 


95 


1546 


AAB85043 


Homo sapiens 


Human biglycan protein sequence. 


1894 


89 


1546 


gi6960459 


Homo sapiens 


biglycan 


1894 


89 


1547 


gi20987689 


Homo sapiens 


Similar to allantoicase 


1332 


64 
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1547 


gi9255889 


Mus musculus 


allantoicase 




Do 


1547 


gil47 18648 


Homo sapiens 


allantoicase 


1 (Y7 1 


A£ 

oo 


1548 


AAM79289 


Homo sapiens 


Human protein SEQ ID NO 195 1 . 


O AO 


/o 


1548 


AAG03927 


Homo sapiens 


Human secreted protein, SEQ ID 

\t/-v. o r\r\ o 

NO: 8008. 


Zlo 


/o 


1548 


AAB57110 


Homo sapiens 


Human prostate cancer antigen 
protein sequence a.bQ ID MO. looo. 


ZIL5 


HQ 
IO 


1549 


gil838956 


Bos taurus 


capping protein, beta3 isoform 


100 


1 c\c\ 


1549 


gi2239063 


Bos taurus 


actin-binding protein CP3 


156 


100 


1549 


gi595257 


Homo sapiens 


F-actin capping protein beta subunit 




1 C\C\ 

1UU 


1550 


gi3370998 


Homo sapiens 


B All -associated protein 1 


526 


91 


1550 


gi 15278 193 


Homo sapiens 


MAGI- 1C beta 


ZT 1 /C 

526 


91 


1550 


gil5278186 


Homo sapiens 


MAGI- 1 A 


526 


91 


1551 


ABG61604 


Homo sapiens 


Human DPRP-2 splice variant #3. 


2850 


95 


1551 


ABG61602 


Homo sapiens 


Human DPRP-2 splice variant #1. 


2850 


95 


1551 


ABG61592 


Homo sapiens 


Human DPPIV related serine 
protease DPRP-2. 


2850 


95 


1552 


AAG73867 


Homo sapiens 


Human colon cancer antigen protein 
SEQIDNO:4631. 


322 


36 


1552 


AAB58391 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 729. 


322 


36 


1552 


gi296667 


Homo sapiens 


ornithine decarboxylase (ODC) 


322 


36 


1553 


AAW96153 


Homo sapiens 


Human FADD-interacting protein 
(FIP). 


168 


100 


1553 


gil61 18555 


Homo sapiens 


ELMOl 


168 


100 


1553 


gi21594742 


Mus musculus 


Similar to engulfment and cell 
motility 1, ced-12 homolog (C. 
elegans) 


168 


100 


1554 


ABB89950 


Homo sapiens 


Human polypeptide SEQ ID NO 
2326. 


244 


81 


1554 


AAU86135 


Homo sapiens 


Human PR0274 polypeptide. 1 


244 


81 


1554 


AAB93352 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12476. 


244 


81 


1556 


AAM38652 


Homo sapiens 


Human polypeptide SEQ ID NO . 
1797. 


1838 ! 


85 


1556 


gi4530437 


Homo sapiens 


thyroid hormone receptor-associated 
protein complex component 
TRAP240 


1645 


78 


1556 


AAM40438 


Homo sapiens 


Human polypeptide SEQ ID NO 
5369. 


1611 


77 


1557 


AAB64943 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO: 121 . 


2327 


89 


1557 


AAB38012 


Homo sapiens 


Human secreted protein encoded by 
gene 3 clone HNHCT15. 


2322 


89 


1558 


gi 11066463 


Rattus 
norvegicus 


RhoGEF glutamate transport 
modulator GTRAP48 


3 /94 


11 


1558 


gil9387126 


Mus musculus 


guanine nucleotide exchange factor 


1051 


37 


1 ceo 


AAr>yU /43 


Homo sapiens 


xiuman v^w^-zu z protein sequence 
SEQ ID 186. 


1 040 




1559 


gil8033747 


Homo sapiens 


myosin IIIB 


1032 


66 


1559 


AAE24138 


Homo sapiens 


Human kinase (PKIN)-9 protein. 


964 


65 J 


1559 


AAU03552 


Homo sapiens 


Human protein kinase #52. 


964 


65 


1560 


AAM79228 


Homo sapiens 


Human protein SEQ ID NO 1 890. 


3488 


64 


1560 


AAM39300 


Homo sapiens 


Human polypeptide SEQ ID NO 
2445. 


3467 


64 
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1560 


AAM41086 


Homo sapiens 


Human polypeptide SEQ ID NO 
6017. 


345 o 


63 


1561 


AAB59019 


Homo sapiens 


Breast and ovarian cancer associated 
antigen protein sequence SEQ ID 
727. 


158 


47 


1561 


ABG35360 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
25U25. 


131 


6U 


1561 


AAM01340 


Homo sapiens 


Peptide #22 encoded by probe for 
measuring human breast gene 
expression. 


131 


60 


1562 


gi 1504002 


Homo sapiens 


similar to a human major CRK- 

^f^+Qi'n T^f^f^Tfl QC\ 

binding protein jjla^jvIou. 


ZD4y 


oZ 


1562 


gil3195147 




Mus museums 






70 


1562 


AAW03515 


Homo sapiens 


Human DOCK180 protein. 


1636 


53 


1563 


AAB52017 


Homo sapiens 


Human secreted protein sequence 
encoded by gene o oJtsQ ID JNU.oo. 


Zoo 


i ah 


1563 


AAM91576 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:19169. 


77 
1 1 


^7 


1564 


AAU10690 


Homo sapiens 


Human agouti polypeptide. 




Q7 

y i 


1564 


gi540073 


Homo sapiens 


agouti precursor 


362 


97 


1564 


gi8953446 


Homo sapiens 


dJ785G19.3 (agouti (mouse)- 
signaling protein) 


30Z 


y I 


1565 


AAM79853 


Homo sapiens 


Human protein SEQ ID NO 3499. 


Z3UZ 


oZ 


1565 


AAM78869 


Homo sapiens 


Human protein SEQ ID NO 153 1 . 


2302 


82 


1565 


gi6l9877 


Homo sapiens 


hydroxymethylglutaryl-CoA 
synthase 


ZiUZ 


oZ 


1566 


AAM41373 


Homo sapiens 


Human polypeptide SEQ ID NO 
6304. 


J5o 


1 df\ 


1566 


AAM39587 


Homo sapiens 


Human polypeptide SEQ ID NO 
2732. 


JJO 


lUU 


1566 


AAU12276 


Homo sapiens 


Human PRO6001 polypeptide 
sequence. 


1 ZA 

154 


A 1 


1567 


AAU27723 


Homo sapiens 


Human full-length polypeptide 
sequence #48. 


ZoZ 


on 
oU 


1567 


AAO102H 


Homo sapiens 


Human polypeptide SEQ ID NO 
24103. 


219 


84 


1567 


AAO01262 


Homo sapiens 


Human polypeptide SEQ ID NO 
15154. 


218 


74 


1568 


gil 1595428 


Homo sapiens 


dJ702J19.1 (glycine cleavage system 
protein H (aminomethyl carrier)) 


oOZ 




1568 


gi2 19671 


Homo sapiens 


hydrogen carrier protein precursor 


OJO 


01 


1568 


gil2653985 


Homo sapiens 


glycine cleavage system protein H 
(aminomethyl carrier) 


856 


91 


1569 


AA014211 


Homo sapiens 


Human transporter and ion channel 
TRICH-28. 


228 


59 


1569 


AAG77968 


Homo sapiens 


Human ion channel protein IC23949. 


217 


57 


1569 


A A X jTO A O C 1 

AAM3485 1 


Homo sapiens 


reptide tFoooo encoded by prooe tor 
measuring placental gene expression. 




1 00 


1570 


gi532505 


Homo sapiens 


bile acid CoA: Amino acid N- 
acyltransferase 


2049 


99 


1570 


gil 6306985 


Homo sapiens 


bile acid Coenzyme A: amino acid 
N-acyltransferase (glycine N- 
choloyltransferase) 


2049 


99 


1570 


gil5215152 


Mus musculus 


Similar to bile acid-Coenzyme A 


1410 


68 
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dehydrogenase: amino acid n- 
acyltransferase 






1571 


gi 15487674 


Homo sapiens 


OoBr-relatea protein l, UKri 




i nn 
iuu 


1571 


gil7529999 


Homo sapiens 


oxysterol-binding protein-like protein 
OSBPL1B 


204 


100 


1571 


AAG89290 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 410. 


198 


97 


1572 


AA017114 


Homo sapiens 


Human Gli3 protein SEQ ID NO: 21. 


1105 


62 


1572 


gi6102812 


Homo sapiens 


GLI3 protein 




/CO 

oz 


1572 


gil 83248 


Homo sapiens 


DNA-binding protein 


1105 


62 


1573 


g il 8447471 


Drosophila 
melanogaster 


RE41571p 


177 


47 


1573 


gi9965400 


murine 

herpesvirus 72 


membrane virion glycoprotein 150 


128 


30 


1573 


gil019435 


Trypanosoma 
cruzi 


mucin-like protein 


127 


36 


1574 


ABG38560 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
28225. 


Z5y 


i nn 


1574 


AAM04517 


Homo sapiens 


Peptide #3199 encoded by probe for 
measuring breast gene expression. 


239 


100 


1574 


AAM29283 


Homo sapiens 


Peptide #3320 encoded by probe tor 
measuring placental gene expression. 


Tin 

Z5y 


i 

IUU 


1575 


AAU83597 


Homo sapiens 


Human PRO protein, beq ID JNo lz. 


1 OO/I 


1UU 


1575 


AAB74709 


Homo sapiens 


Human membrane associated protein 
MEMAP-15. 


1 OO/l 


1 nn 


1575 


AAU96185 


Homo sapiens 


Human secreted protein, otsv^ 1JJ jno 
87. 


1 1 01 


an 

oU 


1576 


AAM79494 


Homo sapiens 


Human protein bEQ ID NO 3140. 


Z4oZ 


OU 


1576 


gil77338l 


Homo sapiens 


APXL 


2432 


86 


1576 


AAM78510 


Homo sapiens 


Human protein SEQ ID NO 1 172. 


2428 


86 


1577 


AAU07829 


Homo sapiens 


Human ARTS-1 polypeptide. 


3395 


76 


1577 


gi2l3l5078 


Homo sapiens 


type 1 tumor necrosis factor receptor 
shedding aminopeptidase regulator 


3395 


76 


1577 


gi638l989 


Homo sapiens 


adipocyte-derived leucine 
aminopeptidase 


3388 


76 


1578 


ABG35337 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
25002. 


150 


75 


1578 


AAM01319 


Homo sapiens 


Peptide #1 encoded by probe for 
measuring human breast gene 
expression. 


150 


75 


1578 


AAM25964 


Homo sapiens 


Peptide #1 encoded by probe for 
measuring placental gene expression. 


150 


75 


1579 


gi7657864 


Homo sapiens 


match to nuclear protein, NP220; 
note: sequence difference at residue 
58 


4034 


98 


1 ^"7Q 

1 j /y 




riomo sapiens 


precursor sequence. 


4023 


98 


1579 


gi!374698 


Homo sapiens 


nuclear protein, NP220 


4023 


98 


1580 


ABB09439 


Homo sapiens 


Serpin domain protein Zserpl5 
associated amino acid sequence. 


1172 


98 


1580 


AAW51933 


Homo sapiens 


Human protease nexin I type alpha. 


1172 


98 


1580 


AAW51934 


Homo sapiens 


Human protease nexin I type beta. 


1172 


98 


1581 


gi499184 


Felis catus 


neuronal protein 


305 


93 
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1581 


AAB95041 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16804. 


284 


73 


1581 


gil3960126 


Homo sapiens 


Similar to leucine-rich neuronal 
protein 


230 


75 


1582 


AAB80253 


Homo sapiens 


Human PR0293 protein. 


3181 


99 


1582 


AAY13385 


Homo sapiens 


Amino acid sequence of protein 
PR0293. 


3181 


99 


1582 


AAU83686 


Homo sapiens 


Human PRO protein, Seq ID No 190. 


1669 


55 


1583 


AAB73858 


Homo sapiens 


Human NPM/ALK fusion protein. 


268 


75 


1583 


AAB28184 


Homo sapiens 


Human Nucleophosmin. 


268 


75 


1584 


AAB93492 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12796. 


697 


100 


1584 


gi7302187 


Drosophila 
melanogaster 


CG7849-PA 


274 


40 


1584 


gi21626800 


Drosophila 
melanogaster 


CG7849-PB 


274 


40 


1585 


gi8572061 


Bombyx mori 


fibroin heavy chain Fib-H 


156 


34 


1585 


gi765323 


Bombyx mori 


silk fibroin heavy chain 


155 


39 


1585 


gil55999 


Bombyx mori 


silk fibroin 


147 


39 


1586 


gil2002682 


Homo sapiens 


FERM-containing protein 


2019 


83 


1586 


ABP37983 


Homo sapiens 


Human GS93382 protein. 


531 


99 


1586 


AAB12318 


Homo sapiens 


Human secreted protein encoded by 
gene 18 clone HE2FL70. 


189 


100 


1587 


ABP41887 


Homo sapiens 


Human ovarian antigen HKZBB48, 
SEQ IDNO:3019. 


1005 


90 


1587 


AAB95585 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18250. 


1005 


90 


1587 


AAY86208 


Homo sapiens 


Nuclear transport protein clone 
hfb030 protein sequence. 


1005 


90 


1590 


AAG02851 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 6932. 


226 


88 


1590 


AAB38280 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 20 SEQ ID 
NO:136. 


226 


88 


1591 


AAB95593 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 18273. 


2718 


82 


1591 


AAU27637 


Homo sapiens 


Human protein AFP39158. 


609 


75 


1591 


ABG45908 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
35573. 


311 


100 


1592 


ABB05746 


Homo sapiens 


Human GASC1 protein SEQ ID 
NO:l. 


780 


33 


1592 


gil0567164 


Homo sapiens 


gene amplified in squamous cell 
carcinoma- 1 


780 


33 


1592 


AAM39339 


Homo sapiens 


Human polypeptide SEQ ID NO 
2484. 


777 


33 


1593 


AAY69069 


Homo sapiens 


Amino acid sequence of a human 
reduced tropoelastin derivative. 


2887 


80 


1593 


gil 82020 


Homo sapiens 


elastin 


2887 


80 


1593 


AAO17360 


Homo sapiens 


Human elastin. 


2870 


79 


1595 


gi291854 


Homo sapiens 


aminopeptidase A 


1289 


91 


1595 


gil518865 


Sus scrofa 


aminopeptidase A 


1127 


80 


1595 


gi7673021 


Rattus 
norvegicus 


aminopeptidase A 


1103 


77 


1596 


AAU71952 


Homo sapiens 


Human bone marrow tissue 


356 


58 
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polypeptide #30. 






1597 


gi556769 


Homo sapiens 


inositol 1,4,5-triphophate 5- 
phosphatase 


D*\Z 


yz 


1597 


ABB77441 


Homo sapiens 


Human tumour marker protein Li9- 1 . 


HQ 


oi 
y i 


1597 


gi21262190 


Homo sapiens 


CTCL tumor antigen Li9-1 


339 


91 


1598 


ABB08405 


Homo sapiens 


Alpha 1,6-fucosyl transferase amino 
acid sequence. 


367 


1 AA 


1598 


AAW22125 


Homo sapiens 


Human alpha 1-6 fucosyltransferase. 


367 


100 


1598 


gi7638407 


Bos taurus 


6-alpha-L-fucosyltransferase 


367 


100 


1599 


AAM90773 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 18366. 


905 


AT 

97 


1599 


AAU74354 


Homo sapiens 


Human cytoskeleton-associated 
protein (CYSKP) #25. 


740 


43 


1599 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 
NO.12300. 


149 


46 


1600 


AAB48140 


Homo sapiens 


Human TANGO 209 variant I 
polypeptide. 


D 1 Z 


oU 


1600 


AAB48139 


Homo sapiens 


Human TANGO 209 variant 1 
polypeptide. 


372 


80 


1600 


AAB48107 


Homo sapiens 


Human TANGO 209 polypeptide. 


372 


80 


1601 


gi6941623 


Homo sapiens 


dJ469D22.1 (Rhesus blood group, 
CcEe antigens) 


268 


/o 


1601 


AAB03339 


Homo sapiens 


Human RhCe protein. 


252 


1 AA 
100 


1601 


AAB03338 


Homo sapiens 


Human RhCE protein. 


252 


1 AA 
100 


1602 


AAB94196 


Homo sapiens 


Human protein sequence SEQ ID 
NO:14530. 


784 


79 


1602 


AAB93449 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12696. 


784 


79 


1602 


AAB97178 


Homo sapiens 


Human EF-chiral calcium-binding 
protein 96. 


784 


79 


1603 


gil3096814 


Mus musculus 


RIKEN cDNA 4932442K08 gene 


165 


72 


1603 


gil 1493928 


Homo sapiens 


homeo domain-interacting protein 
kinase 2 


74 


OA 

20 


1604 


gil321596 


Homo sapiens 


immunoglobulin lambda-chain 
subgroup II 


378 


63 


1604 


AAG76014 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6778. 


362 


97 


1604 


gi2 16 19848 


Homo sapiens 


Similar to immunoglobulin lambda 
joining 3 


1 A A 

344 


no 

yz 


1605 


AAB95234 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17375. 


2336 


92 


1605 


AAB27239 


Homo sapiens 


Human EXMAD-17 SEQ ID NO: 17. 


2334 


99 


1605 


gil 9263740 


Mus musculus 


axotrophin 


1935 


79 


1606 


AAB92702 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 102. 


3250 


AO 

yz 


1606 


gi20451682 


Homo sapiens 


JEMMA protein 


3250 


92 


1606 


AAB93188 


Homo sapiens 


Human protein sequence SEQ ID 

\TA. 1 n 1 A(\ 


3088 


91 


1607 


AAB31693 


Homo sapiens 


Amino acid sequence of viral 
encoded semaphorin protein receptor. 


1014 


79 


1607 


AAB70131 


Homo sapiens 


Human VESPR. 


1014 


79 


1607 


AAY 13462 


Homo sapiens 


Viral-encoded semaphorin protein 
receptor (VESPR) polypeptide. 


1014 


79 


1608 


AAW29683 


Homo sapiens 


Human Na-K-2C1 cotransporter 
NKCC2. 


1295 


99 



J 
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1608 


gi 1373425 


Homo sapiens 


bumetamde-sensitive Na-K-2C1 
cotransporter 


1295 


99 


1608 


gi5 16001 


Oryctolagus 
cuniculus 


bumetanide-sensitive Na-K-Cl 
cotransport protein splice isoform F 


1244 


95 


1610 


AAB88388 


Homo sapiens 


Human membrane or secretory 
protein clone PSEC0131. 


923 


100 


1610 


gi208 10493 


Homo sapiens 


Similar to RIKEN cDNA 
2810417M05 gene 


920 


99 


1610 


AAE21272 


Homo sapiens 


Human gene 16 encoded secreted 
protein fragment, SEQ ID NO: 13 8. 


432 


100 


1611 


AAY53753 


Homo sapiens 


Amino acid sequence of the MMSC2 
protein. 


269 


98 


1611 


AAY04732 


Homo sapiens 


Protein containing PDZ domain from 
clone 3 8-2- la. 


269 


no i 

y© 


1611 


AAY04734 


Homo sapiens 


Protein containing PDZ domain from 
clone 38-2-lc. 


269 


no 
yo 


1612 


AAR85912 


Homo sapiens 


Oncostatin M receptor-beta subunit. 


1 1 o O 

1 138 


92 


1612 


gi 17942 11 


Homo sapiens 


oncostatin-M specific receptor beta 
subunit 


1138 


92 


1612 


gil5012082 


Homo sapiens 


Similar to oncostatin M receptor 


113o 


yz 


1613 


ABP42361 


Homo sapiens 


Human ovarian antigen HNOKG34, 
SEQ ID NO:3493. 


406 


82 


1613 


AAB48966 


Homo sapiens 


Human heterogeneous nuclear 
ribonucleoprotein Al (hnRNP Al). 


406 


82 


1613 


AAW55828 


Homo sapiens 


Human heterogeneous nuclear 
ribonucleoprotein core protein Al . 


406 


82 


1614 


ABB07649 


Homo sapiens 


Human LOR-1 protein. 


898 


99 


1614 


AAB00077 


Homo sapiens 


Human lysyl oxidase related protein 
(Lor). 


898 


99 


1614 


gil890108 


Homo sapiens 


lysyl oxidase-related protein 


898 


99 


1615 


gil2803157 


Homo sapiens 


COX 15 (yeast) homolog, cytochrome 
c oxidase assembly protein 


163 


48 


1615 


gi3603230 


Homo sapiens 


cytochrome oxidase assembly factor 


163 


A O 

48 


1615 


gi76 19702 


Homo sapiens 


bA483Fl 1.2.1 (COX15 (yeast) 
homolog, cytochrome c oxidase 
assembly protein (isoform 1)) 


163 


48 


1616 


gi7672979 ! 


Homo sapiens 


glucosidase II beta subunit 


2553 


O A 

84 


1616 


ABB50266 


Homo sapiens 


Protein kinase C substrate 80K-H 
ovarian tumour marker protein, #20. 


2536 


84 


1616 


gil293640 


Homo sapiens 


protein kinase C substrate 80K-H 


2536 


84 


1617 


AAY07076 


Homo sapiens 


Renal cancer associated antigen 
precursor sequence. 


355 


100 


1617 


gil022888 


Oryctolagus 
cuniculus 


protein phosphatase 2 AO B* subunit 
beta4 isoform 


355 


100 


1617 


gil022886 


Oryctolagus 
cuniculus 


protein phosphatase 2A0 B' 
regulatory subunit beta3 isoform 


355 


100 


1618 


gil4010930 


Haemophilus 
influenzae Rd; 
similar to 
AAC22226.1 
(PID:gl 573555) 
[Homo sapiens] 


similar to transcription accessory 
protein (tex) 


1868 


100 


1618 


AAB92758 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11220. 


1863 


99 


1618 


ABB89105 


Homo sapiens 


Human polypeptide SEQ ID NO 


1231 


95 



WO 03/054152 PCT/US02/39555 



Table 2B 
287 



SEQ 
ID 


Hit ID 


Species 


Description 


e 

score 


/o 

Identity 








1 A Q 1 

14ol. 






1619 


AAW64262 


Homo sapiens 


Human neutrophil elastase. 


628 


100 


1619 


AAP80335 


Homo sapiens 


Sequence of serine protease (SP) of 
human myeloid cellorigin and leader 
peptide. 


628 


100 


1619 


gi296665 


Homo sapiens 


serine protease 




1 nn 


1620 


gi9965989 


Homo sapiens 


calcineurin A catalytic subunit 
gamma isoform 


1648 


97 


1620 


gi258001 


Homo sapiens 


calcineurin A catalytic subunit; 

1 11* 1 1 i „ J_ * 

calmoduhn-dependent protein 
phosphatase catalytic subunit; CaM- 
PrP catalytic subunit 


1637 


96 


1620 


gi200466 


Mus musculus 


phosphoprotein phosphatase 


14/1 


QK 
OJ 


1621 


gi!778538 


Escherichia coli 


anaerobic carrier for c4, 
dicarboxylates 


739 


100 


1621 


gil786839 


Escherichia coli 
K12 


transport of dicarboxylates 


739 


100 


1621 


gil25 13520 


Escherichia coli 

0157-.H7 

EDL933 


transport of dicarboxylates 


739 


100 


1622 


AAE15635 


Homo sapiens 


Human G-protein coupled receptor-5 
(GCREC-5) protein. 


271 


QA 


1622 


AAB66272 


Homo sapiens 


Human TANGO 378 SEQ ID NO: 
29. 


271 


y4 


1622 


AAB82487 


Homo sapiens 


Human secretin-like receptor Zgprl 
splice variant. 


271 


94 


1623 


gi 1787320 


Escherichia coli 
K12 


homolog of Salmonella P-ring of 
flagella basal body 


749 


87 


1623 


gi4062658 


Escherichia coli 


Flagellar basal body P-ring protein 
precursor 




QH 
O / 


1623 


gil25 14621 


Escherichia coli 

0157:H7 

EDL933 


homolog of Salmonella P-ring of 
flagella basal body 


748 


86 


1624 


gil3360031 


Escherichia coli 
0157:H7 


allantoinase 




AO 


1624 


gi2735238 


Escherichia coli 


GlxB3 


A 01 


/V 


1624 


gil773192 


Escherichia coli 


similar to S. cerevisiae dall 


491 


70 


1625 


gi606368 


Escherichia coli 


aspartate semialdehyde 
dehydrogenase 


n /no 

1478 


93 


1625 


gi3859587 


Shigella sonnei 


aspartate semialdehyde 
dehydrogenase 


1478 


93 


1625 


gil789841 


Escherichia coli 
K12 


aspartate-semialdehyde 
dehydrogenase 


1478 


93 


1626 


gil788006 


Escherichia coli 
K12 


phenylalanine tRNA synthetase, 
beta-subunit 


735 


94 


1626 


gi 125 15724 


Escherichia coli 
0157:H7 

T7F*T Gil 


phenylalanine tRNA synthetase, 
beta-subunit 


735 


94 


1626 


gil742806 


Escherichia coli 


Phenylalanine—tRNA ligase (EC 
6. LI. 20) b chain 


735 


94 


1627 


AAB49502 


Homo sapiens 


Clone HYASC03. 


310 


98 


1627 


gi22 137373 


Mus musculus 


similar to RIKEN cDNA 
28 1005 1A14 gene 


240 


95 


1627 


gi20071228 


Mus musculus 


RIKEN cDNA 2810051A14 gene 


151 


51 


1628 


gi!7431382 


Ralstonia 


PROBABLE TRANSCRIPTION 


357 


37 
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solanacearum 


REGULATOR PROTEIN 






1628 


gi2 11 14950 


Xanthomonas 
campestris pv. 
campestris str. 
ATCC 33913 


transcriptional regulator 


331 


38 


1629 


gi3 11422 


Escherichia coli 


ORF-2 


282 


100 


1630 


gi732874 


Neisseria 
meningitidis 


IgAl protease 


101 


33 


1630 


gi600255 


Gallus gallus 


caldesmon 


99 


23 


1631 


gil788756 


Escherichia coli 
K12 


PEP-protein phosphotransferase 
system enzyme I 


785 


96 


1631 


gil2516792 


Escherichia coli 

0157:H7 

EDL933 


PEP-protein phosphotransferase 
system enzyme I 


785 


96 


1631 


gil799835 


Escherichia coli 


PHOSPHOENOLPYRUVATE- 
PROTEIN 

PHOSPHOTRANSFERASE (EC 
2.7.3.9) 

(PHOSPHOTRANSFERASE 
SYSTEM, ENZYME I). 


785 


96 


1632 


gi887820 


Escherichia coli 


UUG start; possible frameshift at 
end? 


810 


72 


1632 


gi466651 


Escherichia coli 


No definition line found 


343 


82 


1632 


gi2 1957797 


Yersinia pestis 
KIM 


acridine efflux pump 


250 


56 


1633 


AAB92950 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11629. 


290 


100 


1633 


gi2 1627 153 


Drosophila 
melanogaster 


CG8155-PA 


82 


39 


1633 


gi3850257 


Coltivirus JKT- 
7043 


Vp9 


78 


40 


1634 


gil2580933 


Homo sapiens 


dJ505P2.1.1 (ribonuclease 6 
precursor) 


442 


62 


1634 


AAY21852 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 2652271). 


437 


61 


1634 


AAW75103 


Homo sapiens 


Human secreted protein encoded by 
gene 47 clone HMCBP63. 


437 


61 


1635 


gi 18 146756 


Homo sapiens 


limkain beta 2 


1620 


100 


1635 


AAY86509 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:424. 


696 


100 


1635 


AAY86510 


Homo sapiens 


Human gene 70-encoded protein 
fragment, SEQ ID NO:425. 


436 


100 


1636 


gi8096340 


Homo sapiens 


RERE 


513 


71 


1636 


gi4689163 


Caenorhabditis 
elegans 


EGL-27 


83 


23 


1636 


gi21913127 


Caenorhabditis 
elegans 


C. elegans EGL-27 protein 
(corresponding sequence C04A2.3b) 


83 


23 


1637 


AAU69417 


Homo sapiens 


Lung small cell carcinoma antigen 
#11. 


514 


94 


1637 


AAB85481 


Homo sapiens 


Human 23553 sulfatase polypeptide. 


514 


94 


1637 


AAM25714 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 1229. 


514 


94 


1638 


AAM43540 


Homo sapiens 


Human polypeptide SEQ ID NO 218. 


741 


100 


1638 


AAM38682 


Homo sapiens 


Human polypeptide SEQ ID NO 
1827. 


741 


100 
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ID 
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Description 


S 

score 


% 
Identity 


1638 


AAY42750 


Homo sapiens 


Human calcium binding protein 1 
(CaBP-1). 


741 


100 


1639 


AAB94175 


Homo sapiens 


Human protein sequence SEQ ID 
NO-.14484. 


908 


86 


1639 


AAM40361 


Homo sapiens 


Human polypeptide SEQ ID NO 
3506. 


908 


86 


1639 


gi 152 15085 


Mus musculus 


Similar to COP9 (constitutive 
photomorphogenic), subunit 7b 
(Arabidopsis) 


900 


85 


1640 


gil529 


Oryctolagus 
cuniculus 


calcium channel BI-2 


95 


30 


1640 


gi2653364 


Bovine 

herpesvirus type 
LI 


immediate-early transactivator 
protein (cell nucleus) 


91 


25 


1640 


gi291536 


Bovine 
herpesvirus 1 


BICP4 


91 


25 


1641 


AAB26105 


Homo sapiens 


Human DAN/Cerberus-related 
protein 6 (hDCR6) #1. 


2012 


100 


1641 


AAE17089 


Homo sapiens 


Human osteolevin protein. 


758 


100 


1641 


ABB07209 


Homo sapiens 


Human cloaked-2 polypeptide 
sequence. 


758 


100 


1642 


AAU29026 


Homo sapiens 


Human PRO polypeptide sequence 
#3. 


279 


94 


1642 


AAY99458 


Homo sapiens 


Human PRO 193 amino acid 
sequence SEQ ID NO:410. 


279 


94 


1642 


AAY36125 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 510. 


279 


94 


1643 


ABG60166 


Homo sapiens 


Human DITHP polypeptide #224. 


426 


76 


1643 


AAG76121 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6885. 


426 


76 


1643 


AAM38668 


Homo sapiens 


Human polypeptide SEQ ID NO 
1813. 


426 


76 


1644 


AAU84339 


Homo sapiens 


Protein HPV16ElBind differentially 
expressed in breast cancer tissue. 


514 


82 


1644 


gi2232019 


Homo sapiens 


HPV16 El protein binding protein 


514 


82 


1644 


gil2653271 


Homo sapiens 


thyroid hormone receptor interactor 
13 


514 


82 


1645 


AAU74752 


Homo sapiens 


Human protease PRTS-12 protein 
sequence. 


3063 


99 


1645 


ABG34074 


Homo sapiens 


Human Pro peptide #45. 


2954 


98 


1645 


AAB50936 


Homo sapiens 


ADAM protein #2. 


2814 


100 


1646 


AAO20516 


Homo sapiens 


Protein of APP related human 
homologue hCP50592. 


273 


98 


1646 


gil232077 


Homo sapiens 


huMCM2 


273 


98 


1646 


gil3544066 


Homo sapiens 


Similar to mini chromosome 
maintenance deficient 2 (S. 
cerevisiae) 


273 


98 


1647 


AAG75416 


Homo sapiens 


Human colon cancer antigen protein 
SEQIDNO:6180. 


655 


74 


1647 


AAB57016 


Homo sapiens 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1594. 


655 


74 


1647 


gil4328059 


Homo sapiens 


argininosuccinate synthetase 


655 


74 


1648 


gi7960207 


Oncorhynchus 
mykiss 


vitelline envelope protein alpha 


252 


43 


1648 


gil5384295 


Mycoplasma 


variable surface lipoprotein Vsp422- 


177 


34 
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ID 


Hit ID 
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Description 


!S 

score 


0/ 
/o 

Identity 






bovis 


8 






1648 


gi2281750 


Salmo salar 


eggshell protein 


164 


3o 


1649 


AAM79049 


Homo sapiens 


Human protein SEQ ID NO 171 1. 


1 A C 

345 


no 


1650 


gi7018384 


Homo sapiens 


&T193N13.1 (mannosidase, alpha, 
class 1A, member 1) 


1026 


no 


1650 


gil 5 929672 


Mus musculus 


mannosidase 1, alpha 


909 


88 


1650 


gi474280 


Mus musculus 


mannosyl-oligosaccharide alpha- 1,2- 
mannosidase 


909 


88 


1651 


AAU10551 


Homo sapiens 


Human A259 polypeptide. 


835 


71 


1651 


ABP47883 


Homo sapiens 


Human polypeptide SEQ ID NO 313. 


835 


71 


1651 


AAB50085 


Homo sapiens 


Human A259. 


835 


71 


1652 


gi45 12295 


Homo sapiens 


immunogloblin heavy chain variable 
region 


619 


100 


1652 


AAR66320 


Homo sapiens 


Human immunoglobulin variable 
heavy chain #26. 


613 


99 


1652 


gi296657 


Homo sapiens 


Ig heavy chain gene variable region 
V(12G-1) 


613 


99 


1653 


AAE24079 


Homo sapiens 


Human MDPK protein. 


185 


85 


1653 


AAE24150 


Homo sapiens 


Human kinase (PKIN)-21 protein. 


181 


97 


1653 


AAU03501 


Homo sapiens 


Human protein kinase #1. 


181 


97 


1654 


AAU75784 


Homo sapiens 


Human protein phosphatase 2 (PP2) 
protein sequence. 


667 


92 


1654 


AAE04841 


Homo sapiens 


Human SGP039 phosphatase 
polypeptide. 


667 


92 


1654 


gi8954030 


117 


Contains similarity to protein 
phosphatase 2C from Arabidopsis 
thaliana gb|AF085279. It contains a 
protein phosphatase 2C domain 
PF|00481 


42 




1655 


AAU96179 


Homo sapiens 


Human secreted protein, SEQ ID No 
81. 


86 


27 


1655 


gil 7942 19 


Homo sapiens 


150 lvDa oxygen-regulated protein 
ORP150 


82 


25 


1655 


AAB01381 


Homo sapiens 


Neuron-associated protein. 


76 


41 


1656 


ABB84869 


Homo sapiens 


Human PRO1079 protein sequence 
SEQ ID NO: 106. 


404 


78 


1656 


ABB95475 


Homo sapiens 


Human angiogenesis related protein 
PRO1079 SEQ ID NO: 106. 


404 


78 


1656 


AAB75372 


Homo sapiens 


Human secreted protein #3 1 . 


404 


78 


1657 


AAM41200 


Homo sapiens 


Human polypeptide SEQ ID NO 
6131. 


332 


97 


1657 


AAM39414 


Homo sapiens 


Human polypeptide SEQ ID NO 
2559. 


332 


97 


1657 


gi31867 


Homo sapiens 


N-acetylglucosamine-6-sulphatase 


332 


97 


1658 


AAW25154 


Homo sapiens 


Human disulphide epimerase like 
enzyme, EP52. 


2281 


100 


1658 


gil 136743 


Homo sapiens 


human P5 


2281 


100 


1658 


gil 2654931 


Homo sapiens 


protein disulfide isomerase-related 
protein 


2281 


1 AA 

100 


1659 


AAM41859 


Homo sapiens 


Human polypeptide SEQ ID NO 
6790. 


357 


71 


1659 


gi6694278 


Homo sapiens 


cell recognition molecule Caspr2 


357 


71 


1659 


gi!3624214 


Homo sapiens 


contactin-associated protein 2 


357 


71 


1660 


AAOH734 


Homo sapiens 


Human polypeptide SEQ ID NO 
25626. 


141 


100 



WO 03/054152 PCT/US02/39555 



Table 2B 
291 



SEQ 
ID 
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S 

score 


% 
Identity 


1661 


AAG74841 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID N 0:5605. 


DOo 


O / 


1661 


gi4588087 


Homo sapiens 


PTH-responsive osteosarcoma Bl 
protein 


568 


87 


1661 


gi20987880 


Mus museums 


Similar to PTH-responsive 
osteosarcoma Bl protein 


536 


86 


1662 


g i4754907 


Homo sapiens 


histone deacetylase 4 


2891 


77 


1662 


gil4495171 


Gallus gallus 


histone deacetylase-4 


2525 


oo 


1662 


AAB49957 


Homo sapiens 


Human histone deacetylase HDAC-4. 


2508 


79 


1663 


ABB94075 


Homo sapiens 


Human secreted protein SEQ ID NO: 
118. 


183 


1 f\r\ 
100 


1663 


ABB94056 


Homo sapiens 


Human secreted protein SEQ ID NO: 
99. J 


183 


100 


1663 


ABB94030 


Homo sapiens 


Human secreted protein SEQ ID NO: 
73. 


183 


1 A Pi 

100 


1664 


ABB37187 


Homo sapiens 


Peptide #4693 encoded by human 
foetal liver single exon probe. 


196 


1 C\C\ 

lUu 


1664 


gil 33 10486 


Homo sapiens 


C2H2 zinc finger protein 


91 


1 1 

31 


1664 


gil 5 159543 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_L_2143p 


88 


25 


1665 


AAB32388 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 bbQ ID JNO. /4. 


359 


1 aa 


1665 


AAY91419 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 6 bbQ iu JNU: I4U. 


82 


36 


1665 


gi21708132 


Homo sapiens 


Similar to LOC 1494 15 


oZ 


j i 


1666 


gi219615 


Homo sapiens 


preproelafm 


621 


100 


1666 


gi299841 


Homo sapiens 


pre-elafin 




1 A A 


1666 


gil 90338 


Homo sapiens 


elafin precursor 




1 A A 


1667 


gil8072031 


Homo sapiens 


zinc finger protein 328 


397 


78 


1667 


AAU27687 


Homo sapiens 


Human full-length polypeptide 
sequence #12. 


270 


00 


1667 


AAM79885 


Homo sapiens 


Human protein SEQ ID NO 353 1 . 


z59 


<Q 
DO 


1669 


AAG03136 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 7217. 


179 


75 


1669 


AAU85309 


Homo sapiens 


G-coupled olfactory receptor #170. 


176 


68 


1669 


AAU24689 


Homo sapiens 


Human olfactory receptor 
AOLFR188. 


176 


68 


1670 


AAY21852 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 2652271). 


375 


83 


1670 


AAY48563 


Homo sapiens 


Human breast tumour-associated 
protein 24. 


375 




1670 


AAW75103 


Homo sapiens 


Human secreted protein encoded by 
gene 47 clone HMCBP63. 


375 


83 


1671 


gi6572310 


Homo sapiens 


cB13C9.1 (protein 239AB) 


395 


no 


1671 


AAU69508 


Homo sapiens 


Human purified secretory 
polypeptide #77. 


348 


60 


1671 


gl73 /41 12 


Homo sapiens 




^47 


80 


1672 


AAG63163 


Homo sapiens 


Amino acid sequence of 
carcinoembryonic antigen-like 
polypeptide. 


2187 


99 


1672 


AAR54714 


Homo sapiens 


Carcinoembryonic antigen CEA-(c). 


456 


30 


1672 


AAB43688 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1133. 


455 


31 


1673 


AAU74354 


Homo sapiens 


Human cytoskeleton-associated 


1163 


34 
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SEQ 
ID 
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Description 


S 

score 


% 
i a entity 








protein (CYSKP) #25. 






1673 


AAB93267 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 12300. 


40.3 


55 


1675 


AAM06070 


Homo sapiens 


Peptide #4752 encoded by probe for 
measuring breast gene expression. 


259 


i r\r\ 


1676 


ABP43066 


Homo sapiens 


Human ovarian antigen HVCAE01, 
SEQ IDNO:4198. 


292 


Do 


1676 


ABP42762 


Homo sapiens 


Human ovarian antigen HOVJU75, 
SEQ ID NO:3894. 


292 


58 


1676 


AAG01285 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 5366. 


292 


58 


1677 


gi458226 


Homo sapiens 


a gene isolated from a CpG island 

1 i _ _____ O'T'C? _ J T/ - K T 

between STS and KAL 


1249 


97 


1677 


AAG00737 


Homo sapiens 


Human secreted protein, SEQ ID 
NO: 4818. 


531 


94 


1677 


gi9663151 


Homo sapiens 


transport-secretion protein 2.1 (TTS- 
2.1) 


421 


35 


1678 


gil806102 


Homo sapiens 


T cell receptor beta chain 


444 


100 


1678 


AAB68370 


Homo sapiens 


Human beta-chain variable region of 
T cell receptors (Vbetal7). 


440 


98 


1678 


AAW76993 


Homo sapiens 


Human T cell receptor beta chain 
variable region protein V-beta 17. 


440 


98 


1679 


AAB92624 


Homo sapiens 


tt i • riTi/^ TTA 

Human protein sequence SEQ ID 
NO:10919. 


455 


100 


1679 


ABG39897 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
29562. 


155 


69 


1679 i 


AAM05720 


Homo sapiens 


Peptide #4402 encoded by probe for 
measuring breast gene expression. 


155 


69 


1680 


gil0186503 


Homo sapiens 


sialic acid-specific acetylesterase II 


907 


68 


1680 


gi 10242345 


Homo sapiens 


sialic acid-specific 9-0- 
acetylesterase I 


907 


68 


1680 


gi5917657 


Mus musculus 


cytosolic sialic acid 9-0- 
acetylesterase 


732 


55 


1681 


gi46540 


Staphylococcus 
aureus 


coagulase precursor 


82 


25 


1681 


gi20338653 


Staphylococcus 
aureus 


coagulase 


82 


25 


1681 


gi21328339 


Caenorhabditis 
elegans 


C. elegans RBF-1 protein 
(corresponding sequence F37A4.7d) 


80 


31 


1682 ; 


AAO07844 


Homo sapiens 


XX 1 a* % riTl/\ X T~"V "V Tif~\ 

Human polypeptide SEQ ID NO 
21736. 


100 


70 


1682 


AAO07413 


Homo sapiens 


Human polypeptide SEQ ID NO 
21305. 


91 


46 j 


1682 


gil786l952 


Drosophila 
melanogaster 


LD01947p 


O / 


Di 


1683 


AAW99574 


Homo sapiens 


Human early placental insulin-like 

\i-jr puiypcpiiuc 




1 f\C\ 


1683 


AAY26926 


Homo sapiens 


Human insulin-like 4 protein. 


349 


100 


1683 


AAW69168 


Homo sapiens 


Zinsl protein. 


349 


100 


1684 


gi23365 


Homo sapiens 


17-HSD protein (AA 1 - 328) 


1082 


70 


1684 


gil8l95l 


Homo sapiens 


estradiol 17 beta-dehydrogenase 


1082 


70 


1684 


gil77l27 ; 


Homo sapiens 


1 7-beta-hydr oxysteroid 
dehydrogenase 


1082 


70 


1685 


ABB84963 


Homo sapiens 


Human PR04356 protein sequence 


1336 


99 
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ID 


Hit ID 
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S 

score 


% 
Identity 








SEQ ID NO:294. 






1685 


ABB95569 


Homo sapiens 


Human angiogenesis related protein 
PR04356 SEQ ID NO: 294. 


1336 


99 


1685 


AAB31206 


Homo sapiens 


Amino acid sequence of human 
polypeptide PR04356. 


1336 


99 


1686 


ABB80870 


Homo sapiens 


Type II TGFbeta receptor (RII) 
sequence. 


2130 


88 


1686 


AAB82996 


Homo sapiens 


Human transforming growth factor- 
beta receptor RII. 


2130 


88 


1686 


AAY59453 


Homo sapiens 


Human Transforming growth factor- 
beta II protein sequence. 


2130 


88 


1687 


ABB79162 


Homo sapiens 


Human VPAC1 receptor protein SEQ 
ID NO:2. 


386 


76 


1687 


ABB79161 


Homo sapiens 


Human VPAC1 receptor protein SEQ 
ID NO:l. 


386 


76 


1687 


AAB71878 


Homo sapiens 


Human VIPR seven transmembrane 
domain. 


386 


76 


1688 


AAB84663 


Homo sapiens 


Amino acid sequence of human 
tyrosine kinase protein Yes. 


520 


90 


1688 


AAY24421 


Homo sapiens 


Human yesl protein. 


520 


90 


1688 


gil81268 


Homo sapiens 


cellular yes-1 protein 


520 


90 


1690 


AA014938 


Homo sapiens 


Human UFD2-associated protein 1 . 


1386 


76 


1690 


AAM00761 


Homo sapiens 


Human bone marrow protein, SEQ 
ID NO: 124. 


1386 


76 


1690 


gil35 16467 


Homo sapiens 


homzygously deleted in 
neuroblastoma- 1/UFD2 


1386 


76 


1691 


AAB47977 


Homo sapiens 


BCY5. 


334 


98 


1691 


gi2463632 


Homo sapiens 


monocarboxylate transporter 
homologue MCT6 


326 


96 


1691 


gi9246437 


Staphylococcus 
aureus 


fmtA-like protein 


81 


24 


1692 


AAB95143 


Homo sapiens 


Human protein sequence SEQ ID j 
NO:17163. 


394 


98 


1692 


AAB93175 


Homo sapiens 


Human protein sequence SEQ ID 
NO:12114. 


394 


98 


1692 


AAB58175 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 5 13. 


394 


98 


1693 


AAG65906 


Homo sapiens 


Amino acid sequence of GSK gene 
Id 37967. i 


3025 


92 


1693 


AAY67598 


Homo sapiens 


Human adipose tissue protein # 1 . 


501 


40 


1693 


gi3786312 


Homo sapiens 


extracellular matrix protein ! 


501 


40 


1694 


ABB83488 


Homo sapiens 


Human cytoskeleton-associated 
protein, CSAP-17. 


216 


93 


1694 


gi3002588 


Mus musculus 


Plenty ofSH3s; POSH 


216 


93 


1694 


gil 8676610 


Homo sapiens 


FLJ00204 protein \ 


206 


86 


1695 


gil2018147 


Chlamydomonas 
reinhardtii 


vegetative cell wall protein gpl 


132 


30 


1695 


gil 79453 82 


Drosophila 
melanogaster 


RE17165p 


130 


28 


1695 


gil209103 


Rattus 
norvegicus 


atrophin-1 related protein 


130 


25 


1696 


AAB43791 


Homo sapiens 


Human cancer associated protein 
sequence SEQ ID NO: 1236. 


353 


98 


1696 


AAM79791 


Homo sapiens 


Human protein SEQ ID NO 3437. 


152 


77 


1696 


AAM78807 


Homo sapiens 


Human protein SEQ ID NO 1469. 


152 


77 * 
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1697 


AAE22224 


Homo sapiens 


Human 5685C6 protein. 


542 


98 


1697 


ABG40551 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
30216. 


516 


98 


1697 


AAM18652 


Homo sapiens 


Peptide #5086 encoded by probe for 
measuring cervical gene expression. 


516 


98 


1698 


AAM39274 


Homo sapiens 


Human polypeptide SEQ ID NO 
2419. 


1596 


79 


1698 


AAB29653 


Homo sapiens 


Human membrane-associated protein 
HUMAP-10. 


1596 


79 


1698 


gi9858855 


Homo sapiens 


HPT protein 


1596 


79 


1699 


gi6841138 


Homo sapiens 


HSPC099 


275 


100 


1699 


AAB93037 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 11 816. 


130 


23 


1699 


AAB61308 


Homo sapiens 


Human transcriptional regulator 
protein #8. 


125 


22 


1700 


AAM93959 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4164. 


1182 


99 


1700 


AAB36587 


Homo sapiens 


Human FLEXHT-9 protein sequence 
SEQ ID NO:9. 


590 


53 


1700 


AAB93652 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13161. 


586 


54 


1701 


gi30264 


Homo sapiens 


cystatin D 


392 


100 


1701 


gil 61 16526 


Homo sapiens 


yX60D10.1 (cystatin D) 


387 


100 


1701 


AA015149 


Homo sapiens 


Human cystatin D protein sequence. 


380 


98 


1702 


gil374754 


Staphylothermus 
marinus 


tetrabrachion 


72 


27 


1703 


AAE17127 


Homo sapiens 


Human GnT-V protein. 


628 


77 


1703 


AAB48911 


Homo sapiens 


Human beta-l,6-N- 
acetylglucosamine transferase (GnT- 
V). 


628 


77 


1703 


AAB83010 


Homo sapiens 


Human GnT-V. 


628 


77 


1704 


AAY68736 


Homo sapiens 


Short chain alcohol dehydrogenase- 
related molecule ScRM-2 cDNA. 


1249 


81 


1704 


AAY69295 


Homo sapiens 


A human human protein, designated 
HSPC021 (CBFAIG06). 


1249 


81 


1704 


AAB58463 


Homo sapiens 


Lung cancer associated polypeptide 
sequence SEQ ID 801. 


1249 


81 


1705 


gil 5294065 


Ictalurus 
punctatus 


40S ribosomal protein S26-2 


212 


58 


1705 


ABP42978 


Homo sapiens 


Human ovarian antigen HPDWD69, 
SEQ ID NO:41 10. 


208 


58 


1705 


AAG76127 


Homo sapiens 


Human colon cancer antigen protein 
SEQIDNO:6891. 


208 


58 


1706 


gi65 80428 


Homo sapiens 


IkappaBR 


2759 


95 


1706 


gi 14250636 


Homo sapiens 


nuclear factor of kappa light 
polypeptide gene enhancer in B-cells 
inhibitor-like 2 


2606 


86 


1706 


gi746415 


Homo sapiens 


I kappa BR 


1655 


71 


1707 


AAM79219 


Homo sapiens 


Human protein SEQ ID NO 1881. 


325 


64 


1708 


gi5901529 


Homo sapiens 


C2H2 type Kruppel-like zinc finger 
protein splice variant b 


565 


99 


1708 


gi5901527 


Homo sapiens 


C2H2 type Kruppel-like zinc finger 
protein 


317 


33 


1708 


gil5029916 


Mus musculus 


Similar to Zinc finger protein 118 


296 


31 
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S 
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1709 


AAU85415 


Homo sapiens 


Human protein NOV13. 


953 


98 


1709 


AAU72895 


Homo sapiens 


Human metal loprotease partial 
protein sequence #7. 


925 


100 


1709 


AAU74750 


Homo sapiens 


Human protease PRTS-10 protein 
sequence. 


925 


100 


1710 


gil2862392 


Mus musculus 


D86 


2863 


64 


1710 


ABG37531 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
27196. 


1687 


100 


1710 


AAM03553 


Homo sapiens 


Peptide #2235 encoded by probe for 
measuring breast gene expression. 


1687 


100 


1711 


gil787337 


acyl-carrier- 
protein synthase 
II [Escherichia 
coli K12] 


3-oxoacyl- 


315 


91 


1711 


gil25 14639 


acyl-carrier- 
protein synthase 
II [Escherichia 
coliG157:H7 
EDL933] 


3-oxoacyl- 


315 


91 


1711 


gi664870 


Escherichia coli 


beta-ketoacyl-acyl carrier protein 
synthase II 


315 


91 


1712 


AAG75407 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6171. 


1258 


82 


1712 


AAR89952 


Homo sapiens 


Insulin-like growth factor binding 
protein-3. 


1232 


80 


1712 


AAU85512 


Homo sapiens 


Clone #19095 (L549S) of lung 
tumour protein. 


1228 


80 


1713 


AAB94696 


Homo sapiens 


Human protein sequence SEQ ID 
NO:15673. 


710 


97 


1713 


AAB99892 


Homo sapiens 


Human RNA helicase gene helicain 
C protein sequence SEQ ID NO:6. 


710 


97 


1713 


AAB99891 


Homo sapiens 


Human RNA helicase gene helicain 
B protein sequence SEQ ID NO:4. 


710 


97 


1714 


gil5384740 


Homo sapiens 


paralemmin-2 


1652 


100 


1714 


gil5384742 


Homo sapiens 


Palm2-AKAP2 fusion protein 


1577 


100 


1714 


gil4041780 


Homo sapiens 


AKAP-2 protein 


410 


73 


1715 


gil3021825 


Homo sapiens 


polymerase 


575 


48 


1715 


gi3600067 


Homo sapiens 


polymerase 


572 


48 


1715 


gil780973 


Human 
endogenous 
retrovirus K 


pol protein 


572 


48 


1717 


gi35825 


Homo sapiens 


pregnancy zone protein 


6592 


88 


1717 


gi579594 


Homo sapiens 


alpha 2-macroglobulin 690-740 


4985 


66 


1717 


AAR11749 


Homo sapiens 


Human alpha-2 macroglobulin bait 
region mutant. 


4976 


66 


1718 


gil3366277 


Homo sapiens 


dJ998H6.1 (ortholog of rat PB- 
Cadherin) 


1167 


93 


1718 


gi4760578 


Mus musculus 


PB-Cadherin 


1028 


84 


1718 


gil398912 


Rattus 
norvegicus 


short type PB-cadherin 


1022 


84 


1720 


AAE06588 


Homo sapiens 


Human protein having hydrophobic 
domain, HP 10778. 


687 


100 


1720 


AAM40979 


Homo sapiens 


Human polypeptide SEQ ID NO 
5910. 


687 


100 
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1720 


gil5072402 


Raja erinacea 


organic solute transporter alpha 


357 


45 


1721 


AAG81345 


Homo sapiens 


Human AFP protein sequence SEQ 
ID NO:208. 


839 


62 


1721 


gi 16359082 


Homo sapiens 


Similar to RIKEN cDNA 
2810049G06 gene 


839 


62 


1721 


AAB93797 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13560. 


836 


62 


1722 


gi871883 


Homo sapiens 


lanosterol 14-demethylase 


2180 


99 


1722 


gi 1809225 


Homo sapiens 


lanosterol 14-demethylase 
(cytochrome p450) 


2180 


99 


1722 


gil698396 


Homo sapiens 


lanosterol 14-demethylase 
cytochrome P450 


2180 


99 


1723 


ABG41541 


Homo sapiens 


Human peptide encoded by genome- 
derived single exon probe SEQ ID 
31206. ! 


232 


100 


1723 


AAM32019 


Homo sapiens 


Peptide #6056 encoded by probe for 
measuring placental gene expression. 


232 


100 


1723 


AAM71727 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
32033. 


232 


100 


1724 


AAY53040 


Homo sapiens 


Human secreted protein clone 
kj320__l protein sequence SEQ ID 
NO:86. 


2480 


100 


1724 


gi35 10639 


Rattus 
norvegicus 


UDP-GaINAc:poIypeptide N- 
acetylgalactosaminyltransferase T5 


1345 


59 


1724 


gi6688167 


Homo sapiens 


GalNAc-T5 


1082 


100 


1725 


ABP41917 


Homo sapiens 


Human ovarian antigen H6EDF71, 
SEQ ID NO:3049. ; 


1329 


97 


1725 


AAG75406 


Homo sapiens 


Human colon cancer antigen protein 
SEQ ID NO:6170. 


1329 


97 


1725 


gi396176 


Homo sapiens 


antigenic surface determinant OA3 


1329 


97 


1726 


AAU79036 


Homo sapiens 


Human SHPS-1 (not defined) 
receptor. 


oro 
ODO 


no 
yo 


1726 


AAW40481 


Homo sapiens 


Human SH2 binding protein. 


OC Q 
OJO 




1726 


AAW49909 


Homo sapiens 


Signal regulatory protein 4 (SIRP4). 


858 


98 


1727 


gi2707601 


Homo sapiens 


synaptophysin 


1656 


1 r\n 
100 


1727 


gil5928723 


Mus museums 


synaptophysin 


1585 


94 


1727 


gi57326 


Rattus 
norvegicus 


synaptophysin (AA 1-307) 


1577 


95 


1728 


gil5590682 


Homo sapiens 


histone deacetylase 9a 


628 


96 


1728 


gil5590680 


Homo sapiens 


histone deacetylase 9 


628 


96 


1728 


gil2060992 


Mus musculus 


MEF2-interacting transcription 
repressor MITR 


625 


95 


1729 


gi2 1430596 


Drosophila 
melanogaster 


RE16431p 


891 


39 


1729 


gil4164377 


Mus musculus 


Type II membrane protein of 
ER~mouse gene similar to alpha- 
mannosidase 


350 


33 


1729 


gil504008 


Homo sapiens 


Containing ATP/GTP-binding site 
motif A(P-loop): Similar to 
C.elegans 

protein(Pl:CEC47E128);Similar to 
Mouse alpha- 
mannosidase(Pl :B54407) 


346 


33 


1730 


gi7861753 


Mus musculus 


GABA-A receptor epsilon-like 


412 


32 
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subunit 






1730 


gil 1610622 


Rattus 
norvegicus 


GABA-A epsilon subunit splice 
variant 


A 1 1 

411 


32 


1730 


gi7861743 


Rattus 
norvegicus 


GABA-A receptor epsilon-like 
subunit 


411 


32 


1731 


AAW80135 


Homo sapiens 


Human recombinant neurokinin-2 
(NK-2) receptor protein. 


1743 


95 


1731 


gil89222 


Homo sapiens 


neurokinin-2 receptor 


1743 


95 


1731 


gil89135 


Homo sapiens 


neurokinin A receptor 


1742 


95 


1732 


AAB75594 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 37 SEQ ID 
NO:148. 


678 


99 


1732 


AAB80437 


Homo sapiens 


Gene #20 associated peptide #1. 


381 


98 


1732 


AAM78175 


Homo sapiens 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
38481. 


365 


100 


1733 


ABB84853 


Homo sapiens 


Human PRO 1120 protein sequence 
SEO ID NO:74. 


293 


100 


1733 


ABB90378 


Homo sapiens 


Human polypeptide SEQ ID NO 
2754. 


293 


100 


1733 


ABB95459 


Homo sapiens 


Human angiogenesis related protein 
PROl 120 SEQ ID NO: 74. 


293 


100 


1734 


ABB07527 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 5643401CD1). 


1652 


79 


1734 


ABB07515 


Homo sapiens 


Human drug metabolizing enzyme 
(DME) (ID: 8097779CD1). 


867 


73 


1734 


gil3161409 


Mus musculus 


family 4 cytochrome P450 


718 


58 


1735 


AAM40183 


Homo sapiens 


Human polypeptide SEQ ID NO 
3328. 


1322 


67 


1735 


AAY71159 


Homo sapiens 


Human phosphodiesterase interacting 
protein, myomegalin. 


1322 


67 


1735 


gi4761644 


Rattus 
norvegicus 


myomegalin 


886 


44 


1736 


AAM94312 


Homo sapiens 


Human reproductive system related 
antigen SEQ ID NO: 2970. 


500 


71 


1736 


ABJ03726 


Homo sapiens 


Human ovary specific protein SEQ 
ID NO: 168. 


298 


41 


1736 


gi8439396 


HERV-H/env62 


envelope protein 


292 


A A 

40 


1737 


AAY76177 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


288 


100 


1738 


AAY92075 


Homo sapiens 


Human DKR-4. 


759 


100 


1738 


AAB08875 


Homo sapiens 


Amino acid sequence of a human 
Dickkopf (Dkk)-4 protein. 


759 


100 


1738 


AAW73017 


Homo sapiens 


Human cysteine-rieh secreted protein 
CRSP-2. 


759 


100 


1739 


ABB97828 


Homo sapiens 


Human secretory polypeptide 
(SPTM) 80. 


2007 


86 


1739 


ABB90159 


Homo sapiens 


Human polypeptide kxiQ 1JJ JN<j 
2535. 


1 *\A1 
1 D^t I 


OR 


1739 


gil 8642980 


Homo sapiens 


GTPase 


1524 


61 


1740 


AAB49278 


Homo sapiens 


Protein encoded by zsig8 1 cDNA 
fragment. 


755 


94 


1740 


AAU29276 


Homo sapiens 


Human PRO polypeptide sequence 
#253. 


755 


94 


1740 


gil2003127 


Eremothecium 


GTPase activating protein BEM2 


83 


28 
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gossypii 








1741 


gu 1707232 


Homo sapiens 


similar to cnoime transporter-iiKe 
proiem 


1 fi'M 


/ u 


1741 


AAB65196 


Homo sapiens 


Human PROl 1 15 (UNQ558) protein 
sequence SEQ ID NO: 177. 


1636 


76 


1741 


AAB87541 


Homo sapiens 


u 1Tlvl0ri pDni 11^ 
riuman Jr KU 1 1 1 D . 




76 


1742 


AAG66142 


Homo sapiens 


Human PAS Kinase (PASK) 
polypeptide. 


2227 


99 


1742 


AAM79231 


Homo sapiens 


Human protein oisQ id jnu loyo. 


zzz / 


QQ 

yy 


1742 


AAM79230 


Homo sapiens 


Human protein SEQ ID NO 1892. 


2227 


99 


1743 


gil3879899 


Mycobacterium 

tuberculosis 

CDC1551 


PPE family protein 


1 AO 

1UZ 


z/ 


1743 


gi2653311 


Bovine 

herpesvirus type 
l.l 


very large virion protein (tegument) 


99 


27 


1743 


gil491621 


Bovine 
herpesvirus l 


T IT 1 £L 

UL36 


yy 


z / 


1744 


gi593l7l8 


Chlamydomonas 
reinhardtii 


1 -alpha dynein heavy chain 


Z1ZO 


^6 

jO 


1744 


gi9409781 


Chlamydomonas 
reinhardtii 


1 beta dynein heavy chain 


1121 


34 


1744 


gi5142l5 


Chlamydomonas 
reinhardtii 


dynein beta heavy chain 


1070 


33 


1745 


AAB53088 


Homo sapiens 


Human angiogenesis-associated 
protein rKUozo, oEKl 11J INtJ.UZ. 


1972 


89 


1745 


AAB80260 


Homo sapiens 


Human PR0328 protein. 


1972 


89 


1745 


AAU12351 


Homo sapiens 


Human rK0328 polypeptide 
sequence. 


iy /z 


so 
oy 


1746 


AAU29172 


Homo sapiens 


Human PRO polypeptide sequence 
#149. 


/jU 


6S 


1746 


AAY99398 


Homo sapiens 


Human PRO1301 (UNQ667) amino 
acid sequence bbQ 1JJ JNtJ.ZiZ. 


730 


68 


1746 


AAM38651 


Homo sapiens 


Human polypeptide SEQ ID NO 
1796. 


728 


68 


1747 


AAE21056 


Homo sapiens 


Human drug metabolising enzyme 
(DME-14) protein. 


111 


59 


1748 


AAE03560 


Homo sapiens 


Human differentially expressed 
kidney cDlNA 22360 encoded 
protein. 


1140 


88 


1748 


AAM42434 


Homo sapiens 


Human kidney related polypeptide 
oca jri mo 10^ 


Af,f, 


Q& 


1 "1 A O 

1748 


AAA /fflO/CI A 


— — : 

Homo sapiens 


nurnan excretory reiaieu poiypeptiue 
SEQ ID NO 356. 


H'UU 




1749 


A A "croon 1 1 
AAB2Z9 1 1 


Homo sapiens 


riuman transporter ana ion cnannei 
(TRICH) 10. 






1 'HA A 

1749 


gll350oo05 


Homo sapiens 


thymic stromal co-transporter 




1 00 

X \J\J 


1 74.Q 


cril ^06R0R 

1 D u \J\J OVJO 


A./fno tvin colli HQ 


thvmic stromal co-Iran snorter 


1782 


77 


1750 


gil 8 157547 


Mus musculus 


pecanex-like 3 


7005 


89 


1750 


gil 5076843 


Homo sapiens 


pecanex-like protein 1 


4359 


51 


1750 


gil3171105 


Takifugu 
rubripes 


pecanex 


4118 


58 


1751 


AAY06603 


Homo sapiens 


Retinoblastoma interacting protein 
GluTl* 


411 


100 


1751 


gi825504 


Homo sapiens 


glutamate transporter 


411 


100 
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c 

SPUTA 


/o 

TH**ri'f'ii"v 
iucu li i y 


1751 


gi825663 


Homo sapiens 


GLAST1 


411 


100 


1752 


gi2078518 


Homo sapiens 


neogenin 


Oy D 


100 


1752 


gi641966 


Gallus gallus 


neogenin 


591 


98 


1752 


* "1 ^1 t~t tT A/AO 

gil785999 


Rattus 
norvegicus 


neogenin 


JOU 


Q7 
y / 


1753 


ABB53278 


Homo sapiens 


Human polypeptide #18. 


315 


26 


1753 


ABB53277 


Homo sapiens 


Human polypeptide #17. 


315 


26 


1753 


AAP94014 


Homo sapiens 


Carcinoembryonic cell surface 
antigen. 


oo< 
ZZj 


ZD 


1754 


AA021681 


Homo sapiens 


Human secreted protein SEQ ID No 
23. 


AA'X 

443 




1754 


AAB75375 


Homo sapiens 


Human secreted protein #34. 


A A1 

443 


yj 


1754 


AAB88603 


Homo sapiens 


Human hydrophobic domain 
containing protein clone HP10770 
#127. 


A A1 

443 


yj 


1755 


AAE22906 


Homo sapiens 


Human transporter and ion channel 
(TRICH) 5. 


1973 


90 


1755 


AAE16350 


Homo sapiens 


Human tetracycline transporter like- 
like protein, POLY 14. 


1969 


90 


1755 


gi2506078 


Mus musculus 


tetracycline transporter-like protein 


1961 


90 


1756 


AAB 88469 


Homo sapiens 


Human membrane or secretory 
protein clone PbECuUz/. 


IOoj 


no 


1756 


AAE06608 


Homo sapiens 


Human protein having hydrophobic 
domain, HP 10798. 


1065 


98 


1756 


AAM40347 


Homo sapiens 


Human polypeptide JSEQ ID JNU 

o a no 
34yZ. 


1 

1U0D 


QC 
yo 


1757 


gi8925284 


Homo sapiens 


phosphatidylinositol polyphosphate 
5 -phosphatase type IV 




01 
y i 


1757 


gi9295353 




Mus musculus 


inositol poiypnospnaxe j-piiobpiid.id.ise 


i yoy 


78 


1757 


gi5360761 


Rattus 
norvegicus 


pharbin 


1928 


77 


1758 


gi395207 


Bos taurus 


potassium channel (BGK5) 


753 


82 


1758 


gil 86669 


Homo sapiens 


potassium channel 


/ JU 


oZ, 


1758 


gi3 04652 


Canis familiaris 


delayed rectifier K+ channel 


f H-KJ 


O 1 


1759 


AAB65058 


Homo sapiens 


Gene #7 associated peptide #12. 


908 


93 


1759 


AAB64999 


Homo sapiens 


Human secreted protein #7. 






1759 


AAU01099 


Homo sapiens 


Gene 35 Human secreted protein 
homologous amino acid sequence. 


814 


91 


1760 


AAU11384 


Homo sapiens 


Human T2R61 (hT2R61) 
polypeptide. 


802 


100 


1760 


gi20336531 


Homo sapiens 


candidate taste receptor T2RP1 


794 


99 


1760 


ABB06836 


Homo sapiens 


Human nGPCR-Seql048 protein 
sequence SEQ ID NO: 105. 


718 


79 


1761 


AAY36115 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 500. 


514 


87 


1761 


AAW64556 


Homo sapiens 


T-f i_ * 11 1 • TT r~\ 

Human osterocarcinoma cell line U-z 
OS clone HP 1 0305 protein. 


<1 A 

j 14 


of 


1 /Ol 


rril AO ^01 00 

gll4ZDUlZZ 


it. Ohio Sapiens 


stem/progenitor cells protein 
MDS029 


514 


87 


1762 


AAG78575 


Homo sapiens 


Human SLC5A3 amino acid 
sequence. 


3621 


100 


1762 


AAB47976 


Homo sapiens 


BCW2. 


3621 


100 


1762 


gi2739094 


Homo sapiens 


sodium/myo-inositol cotransporter 


3621 


100 


1763 


ABB 80578 


Homo sapiens 


Human sbg6 1 8069LRR protein #2. 


3076 | 99 
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1763 


ABB 8 0577 


Homo sapiens 


Human sDg61oUo9LKK protein wl. 


LD IZ 




1763 


AAW84596 


Homo sapiens 


Amino acid sequence of the human 
Tango-79 protein. 


lzuy 


AA 

44 


1764 


ABB 81 460 


Homo sapiens 


Human aggrecanase MDTS8 protein 

oiSV^ iU JN\J:Z. 




csy 


1764 


gu917115U 


— ; 

Homo sapiens 


J\U am. ioio protein 






1764 


AAU72893 


Homo sapiens 


Human metalloprotease partial 
protein sequence #5. 


2762 


90 


1765 


gil695682 


Homo sapiens 


hepatic triglyceride lipase 


152 


64 


1765 


gi32498 


Homo sapiens 


-Arm, mm r-t. s-m-m -m mmt<-l -mm f A A OO ■#**-». A T m£\ 

precursor (AA -zJ to 4/oj 


1 

IJZ 


04 


1765 


gi339593 


Homo sapiens 


triglyceride lipase 


LdZ 


04 


1766 


AAM79459 


Homo sapiens 


Human protein SEQ ID NO 3 105. 


1493 


100 


1766 


AAM78475 


Homo sapiens 


Human protein SEQ ID NO 113 7. 




1 f\f\ 

1UU 


1766 


ABB 11930 


Homo sapiens 


Human secreted protein homologue, 
SEQ ID NO:23UU. 


1493 


100 


1767 


AAM47914 


Homo sapiens 


Human lysophosphatidic aminoacyl 
transferase 42. 


3Z4 


yu 


1767 


AAE15296 


Homo sapiens 


Human LrAA 1 delta protein. 


lO A 


yu 


1767 


AAB65188 


Homo sapiens 


Human PRO1016 (UNQ499) protein 
sequence oriQ ID NU: loo. 


324 


90 


1768 


AAE23757 


Homo sapiens 


Human metabotropic glutamate 
(mGluR4) receptor protein. 


926 


99 


1768 


AAR82658 


Homo sapiens 


Human mGluR4. 


926 


99 


1768 


gill60183 


Homo sapiens 


metabotropic glutamate receptor type 
4 


926 


99 


1769 


AAM41363 


Homo sapiens 


Human polypeptide SEQ ID NO 
6294. 


2370 


98 


1769 


AAE17500 


Homo sapiens 


Human secretion and trafficking 
protein-9 (SAT-9). 


2207 


99 


1769 


AAM39577 


Homo sapiens 


Human polypeptide SEQ ID NO 
2722. 


2207 


AA 

99 


1770 


AAB73512 


Homo sapiens 


Human transferase HTrS-19, bbQ 


zu4 


A^ 

4D 


1770 


AAB56487 


Homo sapiens 


Human prostate cancer antigen 
protein sequence isiiQ IJJ jnu. lUOD. 


264 


45 


1770 


gil4249942 


Homo sapiens 


Similar to RIKEN cDNA 
UolUOUbrlo gene 


264 


45 


1771 


gi7678873 


Homo sapiens 


vascular cadherin-2 


5369 


99 


1771 


gi7407150 


Homo sapiens 


protocadherin 12 


Djoy 


nn 

yy 


1771 


gi8 164037 


Homo sapiens 


vascular endothelial cadherin 2 




yy 


1772 


AAM93947 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4138. 


231 


93 


1772 


AAM93886 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4013. 


231 


93 


1772 


AAM93857 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3949. 


231 


93 


1773 


gi 1335205 


Homo sapiens 


ORFII j 


270 


48 


1773 


AAB94891 


Homo sapiens 


Human protein sequence SEQ ID 
NO:16231. 




OZ 


1773 


AAG73650 


Homo sapiens 


Human colon cancer antigen protein 
SEQIDNO:4414. 


263 


58 


1774 


AAB93885 


Homo sapiens 


Human protein sequence SEQ ID 
NO:13815. 


1088 


85 


1774 


AAM93980 


Homo sapiens 


Human stomach cancer expressed 
polypeptide SEQ ID NO 29. 


1088 


85 
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1774 


gi3603459 


Homo sapiens 


tetraspan NET-5 


1088 


85 


1775 


ABB06607 


Homo sapiens 


G protein-coupled receptor GPCR5 
protein SEQ ID NO:24. 


982 


77 


1775 


AAG71597 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1278. 


982 


77 


1775 


gi2 1928553 


Homo sapiens 


seven transmembrane helix receptor 


982 


77 


1776 


AAB64888 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 7 SEQ ID NO:66. 


252 


85 


1776 


AAB38011 


Homo sapiens 


Human secreted protein encoded by 
gene 3 clone HPJCX13. 


252 


85 


1776 


AAB94917 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 16357, 


191 


62 


1777 


gi 182851 


Homo sapiens 


G0S2 protein 


497 


100 


1777 


gil213013 


Mus musculus 


G0S2-like protein 


377 


77 


1777 


AAM79519 


Homo sapiens 


Human protein SEQ ID NO 3 165. 


81 


26 


1778 


ABB89432 


Homo sapiens 


Human polypeptide SEQ ID NO 
1808. 


473 


100 


1778 


ABP61790 


Homo sapiens 


Human polypeptide SEQ ID NO 144. 


473 


100 


1778 


AAY80991 


Homo sapiens 


Human VAMP-2 homologue, 
CBCBMH06. 


473 


100 


1779 


gi5264503 


Mus musculus 


sif and Tiaml-like exchange factor 


82 


30 


1779 


gi9295309 


frog adenovirus 
1 


pllla protein 


78 


40 


1779 


gil6415263 


Listeria innocua 


similar to two-component sensor 
histidine kinase 


78 


27 


1780 


gi2 1693020 


Homo sapiens 


MHC class I antigen 


1311 


70 


1780 


gil 399321 


Macaca mulatta 


MHC class I antigen Mamu B*08 


1307 


71 


1780 


gi8117799 


Pan troglodytes 


MHC class I antigen 


1302 


70 


1781 


ABB75677 


Homo sapiens 


Breast protein-eukaryotic conserved 
gene 1 (BSTP-ECG1) protein. 


754 


40 


1781 


AAU29191 


Homo sapiens 


Human PRO polypeptide sequence 
#168. 


754 


40 


1781 


AAY99421 


Homo sapiens 


Human PR01433 (UNQ738) amino 
acid sequence SEQ ID NO:292. 


754 


40 


1782 


gi2343157 


Homo sapiens 


peroxisomal membrane protein 69 


2796 


89 


1782 


gi2706518 


Homo sapiens 


peroxisomal ABC-transporter 


2796 


89 


1782 


gil5215442 


Homo sapiens 


Similar to ATP -binding cassette, sub- 
family D (ALD), member 4 


2788 


88 


1783 


AAB35235 


Homo sapiens 


Human neurotransmitter transporter 
protein GC42. 


3573 


98 


1783 


AAB35236 


Homo sapiens 


Human glycine transporter type lc. 


3559 


98 


1783 


gi546769 


Homo sapiens 


glycine transporter type lb; GlyT-lb 


3559 


98 


1784 


AAU00017 


Homo sapiens 


Human Plexin-Dl. 


7512 


90 


1784 


ABB 11709 


Homo sapiens 


Human plexin-Bl/SEP receptor 
homologue, SEQ ID NO:2079. 


7467 


90 


1784 


gi5918167 


Homo sapiens 


plexin-Bl/SEP receptor 


2120 . 


33 


1785 


ABB44591 


Homo sapiens 


Human wound healing related 
polypeptide SbQ ID NO 48. 


953 


99 


1785 


gi6 19730 


Homo sapiens 


nuclear factor I 


953 


99 


1785 


gi9653290 


Mus sp. 


NFI-X3 


953 


99 


1786 


gil3810568 


Homo sapiens 


Toll-like receptor 5 


4482 


100 


1786 


ABB11795 


Homo sapiens 


Human Toll/IL-lR-like protein 
homologue, SEQ ID NO:21 65. 


4478 


99 


1786 


gi3 132526 


Homo sapiens 


Toll/interleukin-1 receptor-like 
protein 3 


4464 


99 
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1787 


AAB88597 


Homo sapiens 


Human hydrophobic domain 
containing protein clone HP03670 

JLZ1 r> t 

#121. 


o on 

8S7 


82 


1787 


AAB56473 


Homo sapiens 


Human prostate cancer antigen 
protein sequence Sbv m N\J: ium. 


887 


82 


1787 


AAB60119 


Homo sapiens 


Human transport protein TPPT-39. 


564 


75 


1788 


AAB80300 


Homo sapiens 


Human prostate cancer antigen #28. 


n A 1 
/41 


on 

ay 


1788 


AAB80276 


Homo sapiens 


Human prostate cancer antigen #4. 


741 


89 


1788 


gi4929765 


Homo sapiens 


CGI- 148 protein 


741 


89 


1789 


gi22038189 


Escherichia coli 


multidrug transporter 


2265 


100 


1789 


gil736785 


Escherichia coli 


Acriflavin resistance protein F (EnvD 
protein). 


2265 


100 


1789 


gil5980819 


Yersinia pestis 


AcrB/AcrD/AcrF family membrane 
protein 


1854 


79 


1790 


gil7741602 


ribose 

[Agrobacterium 
tumefaciens str. 
C58 (U. 
Washington)] 


ABC transporter, membrane 
spanning protein 


549 


59 


1790 


gil5160166 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_L_3181p 


549 


59 


1790 


gi20515187 


Thermoanaeroba 
cter 

tengcongensis 


Ribose/xylose/arabinose/galactoside 
ABC-type transport systems, 
permease components 


340 


38 


1791 


gil788573 


Escherichia coli 
K12 


sn-glycerol-3 -phosphate permease 


1073 


100 


1791 


gil799587 


Escherichia coli 


glycerol-3-phosphate transport 
protein 


1073 


100 


1791 


gi41587 


Escherichia coli 


glycerol-3 -phosphatase transporter 
(AA 1 - 452, glpT) 


1073 


100 


1792 


gi 1790233 


Escherichia coli 
K12 


arylsulfatase 


679 


98 


1792 


gil25 18665 


Escherichia coli 

0157:H7 

EDL933 


arylsulfatase 


679 


98 


1792 


gi 13364207 


Escherichia coli 
0157:H7 


arylsulfatase 


679 


98 


1793 


gi21959134 


Yersinia pestis 
KIM 


sulfate transporter 


378 


80 


1793 


gi9657461 


Vibrio cholerae 


sulfate permease family protein 


267 


51 


1793 


gil8145142 


Clostridium 
perfringens str. 
13 


probable sulfate permease 


264 


53 


1794 


gil799719 


PIR Accession 
Number S08346 
[Escherichia 
coli] 


similar to 


922 


100 


1794 


gil5156677 


Agrobacterium 
tumefaciens str. 
C58 (Cereon) 


AGR_C_2926p 


452 


50 


1795 


gilll77166 


Mus musculus 


adhesion molecule ninjurin 


228 


81 


1795 


gi3 077901 


Mus musculus 


ninjurin 


228 


81 


1795 


gil 644366 


Rattus 
norvegicus 


ninjurin 1 


228 


81 
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l.UClllllj' 


1796 


AAU74824 


Homo sapiens 


Human REPTR 7 protein. 


6196 


92 


1796 


ABB90740 


Homo sapiens 


Human Tumour Endothelial Marker 
polypeptide SEQ ID NO 212. 


OlyO 


yj* 


1796 


ABB90725 


Homo sapiens 


Human Tumour Endothelial Marker 

— — .1- ~*~~4.Z A~ CT7A TT\ ICC 

polypeptide JSbQ ID JN(J loo. 


oxyo 


GO 

yz 


1797 


ABP61451 


Homo sapiens 


Human NF-kB activating protein 
SEQ ID NO 55. 


1 QQ 1 
LDD 1 


i no 


1797 


AAY94343 


Homo.sapiens 


Human cell surface receptor protein 
#10. 


1331 


100 


1797 


gil3938575 


Homo sapiens 


Similar to RIKENcDNA 
26105 11E22 gene 


1331 


100 


1798 


gi606234 


Escherichia coli 


secY 




i on 


1798 


gi42989 


Escherichia coli 


SecY (PrlA) polypeptide (aa 1-443) 


953 


100 


1798 


gil 642 1976 


Salmonella 

typhimurium 

LT2 


preprotein translocase of IISP family 




yy 


1799 


gi 18255305 


Mus musculus 


p53 apoptosis effector related to 
Pmp22 


440 


53 


1799 


gi7582391 


Mus musculus 


p53 apoptosis-associated target 


440 


53 


1799 


AAM50572 


Homo sapiens 


Human tumour suppressor protein 
THW. 


oy 1 


i fin 


1800 


AAU11433 


Homo sapiens 


Human short-chain dehydrogenase, 
SCDR. 


ddi 


^c 

DO 


1800 


AAU18368 


Homo sapiens 


Human endocrine polypeptide SEQ 
ID No 323. 


DDL 


JO 


1800 


AAU18369 


Homo sapiens 


Human endocrine polypeptide SEQ 
ID No 324. 


DDK) 


oy 


1801 


ABB12425 


Homo sapiens 


Human bone marrow expressed 
protein SEQ ID NO: 264. 


17cl 


on 

yu 


1801 


AAM50318 


Homo sapiens 


Human membrane transporter (MTP) 
33556. 




yu 


1801 


AAM83805 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO:l ljyo. 


1499 


90 


1802 


gi208 10074 


Homo sapiens 


Similar to pepsinogen 5, group 1 
(pepsinogen A) 


670 


86 


1802 


AAB61351 


Homo sapiens 


Pepsin protein. 


661 


85 


1802 


AAB66589 


Homo sapiens 


Human pepsin. 


DO I 


OJ 


1803 


ABB04707 


Homo sapiens 


Human SPoz protein biiQ ID JNU.z. 


L L ID 


00 
yy 


1803 


ABB84912 


Homo sapiens 


Human PRO 1356 protein sequence 
SEQ ID NO: 192. 


1117 
1 I 1 J 


00 

yy 


1803 


AAU76534 


Homo sapiens 


Tumour-associated antigenic target 
protein, TAT134. 


i 1 1 -a 

1 1 1 D 


QQ 

yy 


1804 


gi2641217 


Oryctolagus 
cuniculus 


anion exchanger 3 brain isoform 


645 


66 


1804 


gi476222 


Homo sapiens 


anion exchanger 3 brain isoform 


645 


66 


1804 


gi8 86256 


Homo sapiens 


anion exchange protein 




DO 


1805 


AAM25789 


Homo sapiens 


Human protein sequence SEQ ID 

XTfVI ^fiA 
INt-/. LDKJ**. 


2659 


56 


1805 


AAB29632 


Homo sapiens 


Human pollinosis-associated gene 
581-encoded protein, SEQ ID 
NO: 12. 


2659 


56 


1805 


gil 8698435 


Homo sapiens 


pVHL-interacting deubiquitinating 
enzyme 1 type II 


2659 


56 


1806 


AAU29153 


Homo sapiens 


Human PRO polypeptide sequence 
#130. 


1771 


98 
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1806 


AAY99363 


Homo sapiens 


Human PRO 1380 (UNQ717) ammo 
acid sequence SEQ ID NO:79. 


1771 


98 


1806 


AAG67487 


Homo sapiens 


Amino acid sequence of a human 
transporter protein. 


1765 


98 


1807 


ABB76315 


Homo sapiens 


Human protein phosphatase PP-8 
Incyte ID No. 4022502CD1. 


f- /CI 

667 


73 


1807 


gi 15778670 


Mus musculus 


sphingosine- 1 -phosphate 
phosphatase 


151 


30 


1807 


gi9623190 


Mus musculus 


sphingosine- 1 -phosphate 
phosphohydrol ase 


151 


30 


1808 


AAM93947 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4138. 


231 


93 


1808 


AAM93886 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
4013. 


231 


93 


1808 


AAM93857 


Homo sapiens 


Human polypeptide, SEQ ID NO: 
3949. 


231 


93 


1809 


gi 145 75 679 


Homo sapiens 


hemicentin 


597 


95 


1809 


gi3328186 


Caenorhabditis 
elegans 


hemicentin precursor 


354 


57 


1809 


AAU75886 


Homo sapiens 


Human adhesion molecule protein 

A T\A I A A T\'"> 1 Olfl 1 

AD4/ AAD2 1 o20. 1 . 


185 


33 


1810 


gi3581982 


Homo sapiens 


extraneuronal monoamine transporter 


775 


99 


1810 


gi 142705 13 


Homo sapiens 


organic cation transporter 3 


775 


99 


1810 


gi 13699874 


Mus musculus 


organic cation transporter 3 


720 


89 


1811 


AAB74762 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO: 71 . 


266 


88 


1811 


AAB74760 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO:69. 


266 


91 


1811 


AAB74759 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 18 SEQ ID NO:68. 


266 


91 


1812 


gil5082375 


Homo sapiens 


Similar to transmembrane 7 
superfamily member 1 (upregulated 
in kidney) 


912 


82 


1812 


gil3096836 


Mus musculus 


Similar to transmembrane 7 
superfamily member 1 (upregulated 
in kidney) 


885 


80 


1812 


ABB97817 


Homo sapiens 


Human secretory polypeptide 
(SPTM) 69. 


432 


68 


1813 


gi 15 04024 


Homo sapiens 


similar to Mouse finger protein(clone 
mkr3)(o03677): 


901 


37 


1813 


gil4549186 


Mus musculus 


zinc finger protein 219 


636 


31 


1813 


AAM39029 


Homo sapiens 


Human polypeptide SEQ ID NO 

o i n a 
2174. 


619 


32 


1814 


A A VC3 £ZA A 

AAY53644 


Homo sapiens 


Protein encoded by the human 
longevity assurance gene 1 (LAG1). 






1814 


AAR20230 


Homo sapiens 


hUOG-1. 


1382 


93 


1814 


• A *t A A s~ a 

gi4324468 


Homo sapiens 


LAG1 protein 


1382 


93 


1 O 1 £ 


A AT T'7^0n'7 

aau oyu / 


Homo sapiens 


Human epidermis-specific serine 
protease #2. 




*+I 


1815 


gi6009515 


Xenopus laevis 


epidermis specific serine protease 


329 


41 


1815 


gi 19353 198 


Mus musculus 


RIKEN cDNA 2010001P08 gene 


323 


41 


1816 


AAM42401 


Homo sapiens 


Human polypeptide SEQ ID NO 134. 


706 


100 


1816 


gil 786232 


Escherichia coli 
K12 


K+ efflux antiporter, glutathione- 
regulated 


706 


100 


1816 


gi21321928 


Escherichia coli 


Glutathione-regulated potassium- 


706 


100 
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efflux system protein KefC 
(K(+)/H(+) antiporter). 






1817 


gil657563 


Escherichia coli 


dioxygenase 


1510 


100 


1817 


gil786565 


Escherichia coli 
K12 


taurine dioxygenase, 2-oxoglutarate- 
dependent 


1510 


100 


1817 


gil 054578 


Escherichia coli 


dioxygenase 


1510 


100 


1818 


gil787550 


Escherichia coli 
K12 


homolog of Salmonella peptide 
transport permease protein 


928 


98 


1818 


gil25 15484 


Escherichia coli 

0157:H7 

EDL933 


homolog of Salmonella peptide 
transport' permease protein 


928 


98 


1818 


gil3361335 


Escherichia coli 
0157:H7 


homolog of Salmonella peptide 
transport permease protein 


928 


98 


1819 


gi9652147 


Homo sapiens 


transmembrane-type protein tyrosine 
phosphatase H 


5771 


98 


1819 


gi475004 


Homo sapiens 


protein tyrosine phosphatase 
precursor 


5235 


93 


1819 


gi!321659 


Rattus 
norvegicus 


brain-enriched membrane-associated 
protein tyrosine phosphatase (BEM)- 
2 


1487 


71 


1820 


gi7141127 


Homo sapiens 


Ellis- van Creveld syndrome protein 


1277 


100 


1820 


gi7271903 


Homo sapiens 


DWF-1 


1271 


99 


1820 


gil3506715 


Bos taurus 


Ellis-van Creveld syndrome protein 


1034 


81 


1821 


AAG72370 


Homo sapiens 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2051. 


568 


98 


1821 


AAG71453 


Homo sapiens 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1 134. 


568 


98 


1821 


AAE04556 


Homo sapiens 


Human G-protein coupled receptor- 
12 (GCREC-12) protein. 


558 


100 


1822 


ABB89189 


Homo sapiens 


Human polypeptide SEQ ID NO 
1565. 


388 


63 


1822 


AAM90349 


Homo sapiens 


Human immune/haematopoietic 
antigen SEQ ID NO: 17942. 


126 


50 


1822 


AAB95094 


Homo sapiens 


Human protein sequence SEQ ID 
NO: 17042. 


94 


41 
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912 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158A 21.81 4.971e-37 124- 
169 BL01158B 17.07 4.343e-23 
169-196 


912 


PF00043 


Glutathione S-transferases. 


PF00043 21.83 5.333e-14 377-407 


913 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 8.200e-17 180- 
208 PD01719A 12.89 2.364e-13 
123-151 


913 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 2.091e-10 817- 
828 


913 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 6.538e-16 812- 
828 BL01187B 12.04 5.696e-13 
569-585 BL01187B 12.04 7.261e- 
13 696-712 BL01187A9.98 
1.429e-10 508-520 BL01187B 
12.04 2.286e- 10 484-500 
BL01187A 9.98 1.750e-09 796-808 


913 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 1.900e-09 821-828 


913 


BL01177 


Anaphylatoxin domain proteins. 


BL01177D 17.50 5.167e-09 503- 
521 


913 


BL00281 


Bowman-Birk serine protease inhibitors 
family proteins. 


BL00281A 14.18 6.754e-09 479- 
496 


913 


BL00799 


Granulins proteins. 


BL00799B 11.02 7.429e-09 475- 
511 


913 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 2.479e-ll 808- 
825 PR00907G 11.63 9.660e-10 
812-839 PR00907G 11.63 9.745e- 
10 696-723 PR00907G 11.63 
9.027e-09 569-596 


914 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158A 21.81 4.073e-33 2-47 
BL01 158B 17.07 2.884e-20 47-74 


915 


BL01158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158B 17.07 4.343e-23 60-87 
BL01158A 21.81 3.656e-22 2-47 


916 


BL00023 


Type II fibronectin collagen-binding 
domain proteins. 


BL00023 24.31 5.091e-27 101-138 


916 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134B 15.99 8.560e-17 626- 
650 BL00134A 11.96 1.321e-16 
469-486 BL00134C 13.45 4.462e- 
16 662-676 


916 


BL01253 


Type I fibronectin domain proteins. 


BL01253C 15.89 9.027e-40 327- 
366 BL01253B 15.21 5.071e-38 
272-316 BL01253H 13.15 7.070e- 
36 644-679 BL01253E 16.01 
1.000e-34 543-580 BL01253F 
14.35 2.846e-34 581-620 
BL01253A 20.33 7.097e-25 201- 
230 BL01253G 11.34 8.085e-18 
625-639 BL01253D 4.84 7.070e-16 
469-483 


916 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 1 


PR00722A 12.27 3.793e-14 470- 
486 PR00722C 10.87 2.059e-13 
625-638 


916 


BL00021 


Kringle domain proteins. 


BL00021D 24.56 3.000e-32 634- 
676 BL00021B 13.33 9.217e-17 
469-487 BL00021C 22.21 4.293e- 
12 554-576 


916 


PR00013 


FIBRONECTIN TYPE II REPEAT 


PR00013C 12.29 4.273e-19 132- 
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SIGNATURE 


148 PR00013A 12.26 6.595e-ll 
105-115 PR00013B 14.75 1.409e- 
10 116-129 


916 


BL00495 


Apple domain proteins. 


BL00495N 11.04 7.987e-21 618- 
653 BL00495O 13.75 3.311e-17 
653-682 BL00495M 8.50 6.243e- 
10 545-580 


916 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 1.900e-09 267-274 


916 


PR00018 


KRINGLE DOMAIN SIGNATURE 


PR00018C 14.30 7.750e-22 331- 
352 PR00018A 14.52 2.286e-12 
286-302 PR00018B 17.75 7.818e- 
10 302-315 PR00018D 13.51 
3.531e-09 356-368 


916 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010A 11.79 4.000e-09 160- 
172 PR00010C 11.16 8.071e-09 
182-193 


916 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 8.826e-09 179- 
198 


916 


PD00919 


CALCIUM-BINDING PRECURSOR 
SIGNAL R. 


PD00919A 11.53 1.000e-08 164- 
176 


917 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 4.429e-15 314-327 
BL01248 11.02 1.375e-14 379-392 
BL01248 11.02 7.975e-ll 1011- 
1024 BL01248 11.02 5.245e-09 
1069-1082 BL01248 11.02 5.585e- 
09 899-912 


917 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011A 14.06 8.579e-19 1060- 
1079 PR00011D 14.03 9.250e-17 
1107-1126 PR00011B 13.08 
7.938e-16 846-865 PR00011A 
14.06 1.340e-15 535-554 
PR00011D 14.03 3.800e-15 846- 
865 PR00011A 14.06 5.755e-15 
846-865 PR00011B 13.08 5. 846e- 
15 1107-1126 PR00011D 14.03 
2.286e-14 1060-1079 PR00011B 
13.08 2.333e-14 485-504 
PR00011B 13.08 6.333e-14 1060- 
1079 PR00011D 14.03 7.429e-14 
485-504 PR00011B 13.08 1.458e- 
13 535-554 PR00011D 14.03 
1.849e-13 535-554 PR00011A 
14.06 3.593e-13 1107-1126 
PR00011A 14.06 7.254e-13 485- 
504 PR00011B 13.08 9.847e-13 
798-817 PR00011A 14.06 1.581e- 
12 798-817 PR00011C 24.25 
7.623e-12 817-846 PR00011D 
14.03 1.148e-ll 798-817 
PR00011A 14.06 3.492e-ll 433- 
452 PR00011D 14.03 9.262e-ll 
433-452 PR0001 1C 24.25 8.800e- 
10 440-469 PR00011C 24.25 
3. 143e-09 455-484 PR00011B 
13.08 6.548e-09 433-452 


917 


PD00320 


LAMININ CHAIN EGF-LIKE 


PD00320A 14.49 4.115e-10 813- 
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DOMAIN P. 


827 PD00320A 14.49 8.071e-09 
1075-1089 


918 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 2.636e-10 132- 
143 


918 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790E 29.58 4.621e-10 666- 
714 


918 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177E 20.64 6.203e-12 377- 
404 BL01177D 17.50 3.000e-09 
277-295 BL01177C 17.39 4.000e- 
09 121-140 


918 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 8.412e-15 379- 
395 BL01187B 12.04 5.304e-13 
299-315 BL01187B 12.04 7.652e- 
13 339-355 BL01187B 12.04 
8.826e-13 127-143 BL01187B 
12.04 6.000e-12 46-62 BL01187B 
12.04 5.200e-ll 258-274 
BL01187B 12.04 6.400e-10 86-102 
BL01187A9.98 8.286e-10 110-122 
BL01187A 9.98 5.125e-09 282-294 
BL01187A 9.98 6.250e-09 68-80 
BL01187A 9.98 8.125e-09 323-335 


918 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907G 11.63 6.774e- 10 46-73 
PR00907B 11.29 7.545e-10 215- 
232 PR00907G 11.63 9.321e-10 
379-406 PR00907B 11.29 7.805e- 
09 82-99 PR00907B 11.29 9.232e- 
09 335-352 PR00907D 5.59 
9.427e-09 344-370 


919 


BL0O335 


Parathyroid hormone family proteins. 


BL00335B 24.23 1.000e-40 133- 
182 BL00335A 17.98 8.393e-39 
80-115 


920 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177C 17.39 4.508e-10 92-1 1 1 


920 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 8.043e-13 12-28 
BL01187B 12.04 1.000e-ll 139- 
155 BL01187A9.98 1.474e-ll 82- 
94 BL01187B 12.04 4.900e-l 1 98- 
114 BL01187A9.98 7.429e-10 
122-134 


920 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011A 14.06 2.964e-14 288- 
307 PR0001 IB 13.08 4.356e-13 
288-307 PR00011D 14.03 6.434e- 
13 376-395 PR00011B 13.08 
2.887e-12 376-395 PR00011D 
14.03 5.421e-12 288-307 
PR00011D 14.03 5.721e-ll 331- 
350 PR0001 IB 13.08 5.826e-10 
331-350 PR00011A14.06 8.957e- 
10 376-395 


920 


DM00864 


EGF-LIKE DOMAIN. 


DM00864A 15.21 3.314e-09 232- 
253 


920 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 5.675e-09 236- 
257 


920 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 7.300e-09 67-74 


920 


BL00799 


Granulins proteins. 


BL00799H 14.15 9.083e-09 227- 
268 
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920 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 8.714e-10 54-71 
PR00907B 11.29 9.561e-09 135- 
152 


920 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL002431 31.77 1.000e-08 387-430 


921 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472B 14.67 2.000e-16 35-53 
BL00472A 7.45 4.724e-09 1-13 


922 


BL00132 


Zinc carboxypeptidases, zinc-binding 
region 1 proteins. 


BL00132C 21.35 L837e-25 227- 
268 BL00132A 26.07 9.791e-24 
149-190 BL00132E 17.72 2.350e- 
22 297-324 BL00132F 13.26 
5.3 13e-l 8 325-347 BL00132B 
15.93 5.065e-16 197-211 
BL00132G 10.94 6.318e-14 382- 
400 BL00132D 12.70 5.3 13e-12 
271-286 


922 


PR00765 


C ARB OX YPEPTIDASE A 
METALLOPROTEASE (M14) 
FAMILY SIGNATURE 


PR00765B 15.57 2.875e-15 197- 
212 PR00765D 14.16 8.412e-15 
330-344 PR00765C 12.55 2.432e- 
09 277-286 


923 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514C 17.41 3.618e-25 93-130 
BL00514E 14.28 8.286e-14 153- 
170 BL00514D 15.35 2.915e-12 
134-147 BL00514G 15.98 4.444e- 
12 223-253 


927 


BL00453 


FKBP-type peptidyl -prolyl cis -trans 
isomerase proteins. 


BL00453B 23.86 8.500e-20 80-1 14 
BL00453A 15.57 1.000e-15 55-70 
BL00453C 9.72 8.650e-l 1 109-122 


937 


BL00796 


14-3-3 proteins. 


BL00796C 17.44 6.250e-38 143- 
193 BL00796B 10.67 1.514e-36 
77-110 BL00796D 17.39 5.696e-34 
194-240 BL00796E 14.15 7.353e- 
29 242-278 


937 


PR00305 


14-3-3 PROTEIN ZETA SIGNATURE 


PR00305F 15.95 3.250e-37 248- 
278 PR00305A9.33 L500e-33 77- 
107 PR00305D 16.34 6.400e-29 
194-221 PR00305C8.68 1.000e-28 
159-182 PR00305B 9.99 4.375e-23 
126-151 PR00305E 13.01 3.571e- 
10 221-248 


938 


BL00303 


S-100/ICaBP type calcium binding 
protein. 


BL00303A 21.77 9.526e-31 64-101 
BL00303B 26.15 5.737e-30 111- 
148 


938 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 9.471e-ll 123-136 


940 


PD02365 


CHAIN FACTOR INTERLEUKIN- 1 2 
BETA PRECURSOR IL-1. 


PD02365C 7.89 6.680e-10 342-372 


940 


DM00202 


w T-CELL IG HEAVY ALPHA. 


DM00202A 9.44 9.813e-09 34-44 


944 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.400e-25 319- 
361 BL00284E 19.15 1.000e-15 
508-533 BL00284A 15.64 2.742e- 
15 200-224 BL00284B 17.99 
6. 182e-12 292-313 BL00284D 
16.34 7.070e- 12 430-457 


944 


PD02080 


T-CELL GLYCOPROTEIN CDS 
CHAIN SURFACE ALPHA PRE. 


PD02080A 10.03 9.750e-10 35-51 
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944 


PR00743 


GLYCOSYL HYDROLASE FAMILY 
36 SIGNATURE 


PR00743B 14.95 8.831e-09 240- 
261 


947 


DM01970 


0 kw ZK632 . 1 2 YDR3 1 3 C 
ENDOSOMAL III. 


DM01970B 8.60 4.750e-17 552- 
565 


947 


BL01160 


Kinesin light chain repeat proteins. 


BL01160B 19.54 2.373e-09 203- 
257 


947 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 4.000e-09 559-568 


947 


BL00422 


Granins proteins. 


BL00422E 26.86 8.615e-09 462- 
498 


950 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356D 13.09 8.038e-09 174- 
191 


951 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 4.316e-13 209- 
227 


951 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 6.294e-10 209- 
227 PR00356A 12.90 7.188e-10 
197-210 


952 


BL00570 


Bacterial ring hydroxylating 
dioxygenases alpha-subunit signa. 


BL00570B 19.03 9.357e-09 271- 
303 


953 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 1.931e-09 72-93 


953 


PR00910 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 3.571e-09 107-120 


953 


BL00415 


Synapsins proteins. 


BL00415N 4.29 5.730e-09 13-57 


954 


PR00901 


PHEROMONE B ALPHA- 1 
RECEPTOR SIGNATURE 


PR00901H 14.99 4.706e-09 56-67 


958 


PR00138 


MATRIXIN SIGNATURE 


PR00138C 16.41 6.478e-32 178- 
207 PR00138D 16.56 1.360e-28 
236-262 PR00138B 15.82 8.071e- 
18 155-171 PR00138A 15.14 
5.091e-16 108-122 PR00138E6.01 
9.250e-15 269-283 


958 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 1.000e-40 177- 
226 .BL00024B 21.53 2.636e-33 
129-163 BL00024D 17.28 4.086e- 
31 230-262 BL00024F 11.30 
2.73 le-22 305-326 BL00024H 
11.35 1.947e-14 41 1-423 
BL00024E 7.58 5.500e-14 269-283 
BL00024G 13.31 4.115e-13 342- 
355 BL00024A 11.49 5.050e- 13 
108-119 


958 


BL00142 


Neutral zinc metallopeptidases, zinc- | 
binding region proteins. 


BL00142 8.38 3.455e-ll 236-247 


958 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 1.643e-09 231- 
250 


958 


BL00546 


Matrixins cysteine switch. 


BL00546B 20.11 1.000e-40 178- 
222 BL00546C 16.41 1.771e-31 
230-262 BL00546A 19.62 5.304e- 
28 88-118 BL00546E 10.23 
3.323e-22 305-326 BL00546G 
16.84 7.300e-19 363-383 
BL00546D 10.34 1.486e-14 269- 
283 BL00546F 12.40 2.800e-13 
342-355 BL00546H 10.76 6.625e- 
12 458-469 BL00546H 10.76 
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4.5l2e-09 41 1-422 


959 


PD02043 


RIBULOSE BISPHOSPHATE 
CARBOXYLAS. 


PD02043A 12.92 5.800e-09 137- 
171 


962 


BL00598 


Chromo domain proteins. 


BL00598 14.45 5.781e~16 63-85 


962 


PR00504 


CHROMODOMAIN SIGNATURE 


PR00504C 11.19 5.186e-10 72-85 
PR00504B 9.12 3.250e-09 57-72 


967 


PF00420 


NADH-ubiquinone/plastoquinone 
oxidoreductase chain 4L. 


PF00420A 16.63 9.526e-10 89-120 


967 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 5.000e-16 185-201 
PR00245A 18.03 6.878e-16 59-81 
PR00245D 10.47 8.500e-15 221- 
233 PR00245E 12.40 1.000e-09 
238-253 


967 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237A 11.48 5.065e-09 26-51 


967 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.184e-12 90-130 
BL00237D 11.23 5.909e-09 229- 
246 


968 


BL00059 


Zinc-containing alcohol 
dehydrogenases proteins. 


BL00059B 16.08 5.705e-14 223- 
251 


968 


BL01162 


Quinone oxidoreductase / zeta- 
crystallin proteins. 


BL01 162C 22.80 5.846e-14 284- 
328 BL01162B 18.11 4.205e-ll 
223-251 BL01162A 15.38 3.805e- 
09 194-217 


970 


PF00676 


Dehydrogenase El component. 


PF00676B 24.71 2.800e-39 257- 
295 PF00676D 14.40 7.545e-24 
341-361 PF00676C 16.88 5.737e- 
23 309-333 PF00676A 12.85 
5.050e-12 131-144 


970 


BL00801 


Transketolase proteins. 


BL00801D 22.48 7.750e-ll 250- 
290 


977 


PR00457 


ANIMAL HAEM PEROXIDASE 
SIGNATURE 


PR00457E 20.67 9.591e-26 409- 
436 PR00457D 16.81 5.667e-22 
384-405 PR00457G 17.45 l.OOOe- 
15 590-611 PR00457B 13.29 
4.343e-15 216-232 PR00457C 
19.25 3.250e-12 366-385 
PR00457A 15.80 5.645e-12 162- 
174 PR00457H 15.90 1.900e-10 
684-699 PR00457F 13.69 6.800e- 
09 461-472 


978 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 3.942e-17 95-124 
BL00269B 19.17 4.122e-15 57-86 
BL00269A 8.53 3.250e-13 31-51 


982 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 8.297e-10 79- 
113 


985 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 9.663e-09 78-106 


987 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 1.000e-09 247- 
261 PR00019B 11.36 2.080e-09 
316-330 PR00019B 11.36 3.160e- 
09 244-258 PR00019B 11.36 
4.240e-09 100-114 PR00019B 
11.36 5.680e-09 148-162 


988 


BL01215 


Mrp family proteins. 


BL01215C 18.97 6.447e-36 191- 
233 BL01215D 30.07 1.000e-33 
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250-300 BL01215A 9.75 4.400e-22 
101-128 BLOIzIjB 9.34 z.oDoe-13 
134-147 


988 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364A 8.19 2.023e-09 105-lzl 


988 


BL00746 


NifH/frxC family proteins. 


BL00746A 24.43 2.819e-09 107- 
152 


988 


PR00091 


NITROGENASE COMPONENT II 
SIGNATURE 


PR00091A 8.10 4.329e-09 109-123 


988 


BL01128 


Shikirnate kinase proteins. 


BL0H28A 18.84 8.221e-09 106- 
140 


989 


PF00094 


von Willebrand factor type D domain 
proteins. 


PF00094B 10.43 6.400e-17 491- 
509 


989 


PF00054 


Laminin G domain proteins. 


PF00054B 16.61 7.300e-09 658- 
670 


989 


BL00779 


Glycoprotein hormones alpha chain 
proteins. 


BL00779A 14.01 7.840e-09 329- 
356 


989 


BL01185 


C-terminal cystine knot proteins. 


BL01185B 21.14 9.153e-09 238- 
287 


992 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 8.986e-ll 78- 
112 


992 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 2.000e-09 154- 
177 


994 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240B 24.70 4.692e-ll 148- 
172 


994 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 5.304e-09 101-111 


994 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 9.234e-09 274- 
309 


995 


BL01009 


Extracellular proteins SCP/Tpx- 
1/Ag5/PR-1/Sc7 proteins. 


BL01009D 14.19 4.300e-20 127- 
148 BL01009A 13.75 6.586e-13 
57-75 BL01009E 13.50 1.439e-ll 
159-175 


995 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00837C 17.21 6.143e-20 126- 
143 PR00837A 14.77 1.973e-13 
57-76 PR00837D 11.12 3.700e-ll 
160-174 


995 


PR00838 


VENOM ALLERGEN 5 SIGNATURE 


PR00838G 16.07 2.033e-17 125- 
145 PR00838D 8.73 4.214e-09 57- 
76 


996 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270B 22.18 5.567e-18 111- 
148 PD01270C 19.54 1.167e-17 
154-183 PD01270A 17.22 4.960e- 
14 57-97 PD01270D 24.66 4.284e- 
09 188-224 


999 


BL01305 


moaA / nifB / pqqE family proteins. 


BL01305D 14.97 7.279e-09 7-22 


1000 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.600e-12 42-65 
BL00290B 13.17 1.474e-ll 98-116 


1002 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PDuUUOO Id. yz D.cS40e-lD ZaJ-Zyo 

PD00066 13.92 1.600e-14 201-214 
PD00066 13.92 2.800e-14 313-326 
PD00066 13.92 2.000e-13 341-354 
PD00066 13.92 5.500e-13 229-242 
PD00066 13.92 8.435e-ll 257-270 


1002 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 9.471e-14 269-286 
BL00028 16.07 3.769e-ll 241-258 
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BL00028 16.07 7.577e-ll 325-342 
BL00028 16.07 7.577e-ll 353-370 
BL00028 16.07 6.700e-10 185-202 
BL00028 16.07 1.257e-09 297-314 



1002 



PR00048 



C2H2-TYPE ZINC FINGER 
SIGNATURE 



PR00048A 10.52 1.000e-12 266- 
280 PR00048A 10.52 3.11 8e- 12 
238-252 PR00048A 10.52 8.94 le- 
12 294-308 PR00048B 6.02 
2.688e-10 310-320 PR00048B 6.02 
3.250e-10 198-208 PR00048A 
10.52 3.348e-10 322-336 
PR00048A 10.52 3.739e-10 210- 
224 PR00048A 10.52 5.696e-10 
350-364 PR00048A 10.52 l.OOOe- 
09 182-196 PR00048B6.02 
1.947e-09 282-292 PR00048B 6.02 
5.737e-09 338-348 



1003 



BL00028 



Zinc finger, C2H2 type, domain 
proteins. 



BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 



16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 



4.706e-14 
6.400e-13 
9.550e-13 
1.000e-12 
1.391e-12 
1.783e-12 
7.652e-12 
9.217e-12 
9.217e-12 
5.846e-ll 
6.538e-ll 
2.800e-10 
6.143e-09 



607-624 
411-428 
579-596 
439-456 
495-512 
523-540 
383-400 
355-372 
663-680 
467-484 
635-652 
327-344 
262-279 



1003 



PD00066 



PROTEIN ZINC-FINGER METAL- 
BINDI. 



PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 



13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 



4.462e-15 
8.615e-15 
1.600e-14 
2.800e-14 
2.800e-14 
5.200e-14 
6.400e-14 
8.800e-14 
7.000e-13 
9.000e-13 
6.870e-ll 
7.600e-09 
7.900e-09 



399-412 
343-356 
427-440 
511-524 
623-636 
595-608 
567-580 
371-384 
315-328 
483-496 
455-468 
539-552 
651-664 



1003 



PR00048 



C2H2-TYPE ZINC FINGER 
SIGNATURE 



PR00048A 10.52 9.182e-15 492- 
506 PR00048A 10.52 1.750e-14 
408-422 PR00048A 10.52 7.000e- 
14 576-590 PR00048A 10.52 
1.000e-13 604-618 PR00048A 
10.52 3.571e-13 380-394 
PR00048A 10.52 4.214e-13 464- 
478 PR00048A 10.52 5.235e-12 
352-366 PR00048B6.02 1.000e-ll 
424-434 PR00048B6.02 1.692e-ll 
340-350 PR00048B 6.02 7.23 le-11 
536-546 PR00048B 6.02 7.23 le- 11 
620-630 PR00048A 10.52 8.579e- 
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11 259-273 PR00048A 10.52 
9.053e-l 1 436-450 PR00048B 6.02 
9.308e-ll 508-518 PR00048B 6.02 
1.563e- 10 592-602 PR00048B 6.02 
2. 125e- 10 396-406 PR00048A 
10.52 4.130e-10 660-674 
PR00048A 10.52 4.522e-10 520- 
534 PR00048A 10.52 6.478e-10 
324-338 PR00048B6.02 1.474e-09 
564-574 PR00048B 6.02 2.421e-09 
676-686 PR00048A 10.52 2.800e- 
09 548-562 PR00048A 10.52 
2.800e-09 632-646 PR00048B 6.02 
2.895e-09 368-378 PR00048B 6.02 
1.000e-08 648-658 


1007 


BL00478 


LIM domain proteins. 


BL00478B 14.79 3.739e-14 469- 
484 BL00478B 14.79 3.500e-12 
411-426 BL00478B 14.79 6.000e- 
12 536-551 


1007 


DM00984 


w MYOD MYOBLAST 
DETERMINATION SHORT. 


DM00984B 15.18 4.822e-25 426- 
481 DM00984C7.66 8.036e-ll 
481-495 


1007 


BL00115 


Eukaryotic RNA polymerase II 
heptapeptide repeat proteins. 


BL00115Z 3.12 7.022e-09 260-309 


1008 


DM00475 


w LOW TRANSPOSASE SAPA 12K. 


DM00475B 12.12 8.269e-09 115- 
135 


1011 


PR00456 


RIBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456F 5.86 6.400e-12 76-88 


1012 


PF00756 


Putative esterase. 


PF00756C 14.12 7.692e-10 103- 
133 


1016 


PR00310 


ANTIPROLIFERATIVE PROTEIN 
BTG1 FAMILY SIGNATURE 


PR00310D 9.10 1.540e-37 183-213 
PR00310C 12.74 5.286e-35 153- 
183 PR00310A 11.17 7.000e-27 
16-41 PR00310E 13.58 6.914e-24 
229-249 PR00310B 10.59 3.687e- 
23 123-153 


1016 


BL00960 


BTG1 family proteins. 


BL00960B 24.47 3.288e-26 1 16- 
161 BL00960C 12.68 3.647e-26 
180-202 BL00960A 10.98 5.304e- 
12 14-26 


1017 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NIL 


PD01066 19.43 6.143e-32 44-83 


1017 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 6.063e-15 523-540 
BL00028 16.07 7.188e-15 383-400 
BL00028 16.07 3.700e-13 467-484 
BL00028 16.07 5.950e-13 439-456 
BL00028 16.07 8.650e-13 271-288 
BL00028 16.07 4.115e-ll 355-372 
BL00028 16.07 6.885e-ll 327-344 
BL00028 16.07 2.800e-10 411-428 
BL00028 16.07 3.100e-10 495-512 


1017 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 6.625e-18 399-412 
PD00066 13.92 2.385e-15 371-384 
PD00066 13.92 8.615e-15 343-356 
PD00066 13.92 3.500e-13 511-524 
PD00066 13.92 l.OOOe- 12 483-496 
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PD00066 13.92 1.000e-ll 427-440 
PD00066 13.92 3.769e-10 455-468 
PD00066 13.92 5.800e-09 539-552 


1017 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 6.727e-15 520- 
534 PR00048A 10.52 1.750e-14 
380-394 PR00048A 10.52 3.250e- 
14 408-422 PR00048A 10.52 
4.857e-13 464-478 PR00048A 
10.52 1.529e-12 436-450 
PR00048A 10.52 2.059e-12 352- 
366 PR00048B6.02 3.769e-ll 
368-378 PR00048B6.02 5.846e-ll 
396-406 PR00048B 6.02 5.846e-ll 
508-518 PR00048A 10.52 1.783e- 
10 492-506 PR00048B 6.02 
8.3 13e-l 0 480-490 PR00048A 
10.52 4.240e-09 324-338 
PR00048B 6.02 5.737e-09 340-350 
PR00048A 10.52 8.200e-09 146- 
160 


1020 


BL00478 


LIM domain proteins. 


BL00478B 14.79 6.000e-15 219- 
234 BL00478B 14.79 8.250e-12 
99-114 BL00478B 14.79 8.250e-12 
160-175 BL00478B 14.79 2.800e- 
11 282-297 


1024 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 9.400e-38 17-56 


1024 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 1.600e-14 230-243 
PD00066 13.92 8.200e-14 286-299 
PD00066 13.92 8.200e-14 398-411 
PD00066 13.92 1.000e-13 342-355 
PD00066 13.92 6.478e-ll 314-327 
PD00066 13.92 6.478e-ll 370-383 
PD00066 13.92 6.870e-ll 258-271 


1024 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 6.850e-13 354-371 
BL00028 16.07 4.522e-12 242-259 
BL00028 16.07 8.826e-12 214-231 
BL00028 16.07 9.609e-12 270-287 
BL00028 16.07 2.731e-ll 326-343 
BL00028 16.07 1.900e-10 382-399 
BL00028 16.07 8.500e-10 298-315 
BL00028 16.07 5.629e-09 410-427 


1024 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479A 19.86 7.319e-09 201- 
224 


1024 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 4.273e-15 351- 
365 PR00048A 10.52 5.500e-13 
239-253 PR00048A 10.52 5.500e- 
13 267-281 PR00048A 10.52 
8.941e-12 323-337 PR00048A 
10.52 5.263e-ll 379-393 
PR00048A 10.52 1.783e-10 407- 
421 PR00048B 6.02 3.250e-10 
339-349 PR00048B 6.02 8.875e-10 
227-237 PR00048B6.02 1.000e-09 
283-293 PR00048B6.02 1.000e-09 
395-405 PR00048A 10.52 1.360e- 
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09 295-309 PR00048B 6.02 
2.895e-09 423-433 PR00048A 
10.52 7.480e-09 21 1-225 



1025 



BL00824 



Elongation factor 1 beta/beta'/delta 
chain proteins. 



BL00824B 9.21 4.892e-09 394-414 



1025 



PR00966 



POTYVIRUS NUCLEAR 
INCLUSION A CYSTEINE 
PROTEASE (C4) SIGNATURE 



PR00966H 13.17 9.727e-09 215- 
235 



1026 



BL01282 



BIR repeat proteins. 



BL01282B 30.49 3.829e-09 12-51 



1026 



PF00992 



Troponin. 



PF00992A 16.67 5.263e-09 136- 
171 



1026 



BL00518 



Zinc finger, C3HC4 type (RING 
finger), proteins. 



BL00518 12.23 8.333e-09 31-40 



1029 



PD02283 



PROTEIN SPORULATION REPEAT 
PRECU. 



PD02283C 17.54 5.613e-09 171- 
199 



1029 



BL00203 



Vertebrate metallothioneins proteins. 



BL00203 13.94 5.213e-10 91-137 
BL00203 13.94 5.041e-09 71-117 
BL00203 13.94 8.898e-Q9 53-99 



1031 



BL00170 



Cyclophilin-type peptidyl-prolyl cis- 
trans isomerase signatur. 



BL00170B 20.97 1.000e-40 48-88 
BL00170C 18.49 1.000e-40 95-140 
BL00170A 17.08 2.452e-16 18-45 



1031 



PR00153 



CYCLOPHILIN PEPTIDYL-PROLYL 
CIS-TRANS ISOMERASE 
SIGNATURE 



PR00153C 11,01 4.375e-20 96-112 
PR00153B 11.57 1.500e-17 53-66 
PR00153E 9.10 7.632e-17 124-140 
PR00153D 11.99 6.400e-16 111- 
124 PR00153A 12.98 3.093e-ll 
24-40 



1031 



PF00638 



RanBPl domain proteins. 



PF00638 11.91 5.569e-09 68-83 



1035 



PD01066 



PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 



PD01066 19.43 2.149e-29 6-45 



1035 



PD00066 



PROTEIN ZINC-FINGER METAL- 
BINDI. 



PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 



13.92 1 
13.92 3 
13.92 3 
13.92 5 
13.92 3 
13.92 5 
13.92 5 
13.92 5 
13.92 8 
13.92 3 
13.92 7 
13.92 8 
13.92 3 
13.92 5 



.692e-15 
,077e-15 
.077e-15 
,846e-15 
.400e-14 
,800e-14 
.800e-14 
.800e-14 
,800e~14 
500e-13 
,500e-13 
714e-12 
700e-09 
500e-09 



234-247 
374-387 
458-471 
486-499 
150-163 
206-219 
290-303 
318-331 
514-527 
346-359 
402-415 
178-191 
430-443 
262-275 



1035 



PR00048 



C2H2-TYPE ZINC FINGER 
SIGNATURE 



PR00048A 10.52 4.750e-14 215- 
229 PR00048A 10.52 4.750e-14 
467-481 PR00048A 10.52 7.000e- 
14 355-369 PR00048A 10.52 
4.857e-13 299-313 PR00048A 
10.52 5.500e-13 99-113 PR00048A 
10.52 5.500e-13 327-341 
PR00048A 10.52 1.000e-12 495- 
509 PR00048A 10.52 2.059e-12 
271-285 PR00048A 10.52 2.588e- 
12 523-537 PR00048A 10.52 
8.412e-12 243-257 PR00048A 
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BL00028 



1038 



1038 



1042 



1042 



BL01130 



DM01292 



PD02327 



1043 



PD02365 



BL00983 



Description 



Zinc finger, C2H2 type, domain 
proteins. 



Sulfate transporters proteins. 



ESICULAR LUMEN DOMAIN. 



GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 



CHAIN FACTOR INTERLEUKIN-12 
BETA PRECURSOR IL-1. 



Ly-6 / u-PAR domain proteins. 



Results* 



10.52 4.316e-ll 131-145 
PR00048B 6.02 4.462e-ll 115-125 
PR00048A 10.52 8.579e-ll 439- 
453 PR00048A 10.52 9.053e-ll 
383-397 PR00048A 10.52 1.391e- 
10 159-173 PR00048B 6.02 
1.563e-10 455-465 PR00048A 
10.52 3.739e-10 411-425 
PR00048B 6.02 6.625e-10 203-213 
PR00048B 6.02 6. 625 e- 10 287-297 
PR00048A 10.52 3.160e-09 551- 
565 PR00048A 10.52 4.600e-09 
187-201 PR00048B 6.02 8.579e-09 
147-157 



BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 



16.07 1.900e- 
16.07 2.350e- 
16.07 8.826e- 
16.07 2.038e- 
16.07 5.500e- 
16.07 5.846e- 
16.07 6.192e- 
16.07 8.269e- 
16.07 1.300e- 
16.07 1.600e- 
16.07 3.700e 
16.07 6.700e- 
16.07 7.000e 
16.07 l.OOOe 
16.07 3.314e 
16.07 l.OOOe 



■13 358-375 
■ 13 102-119 
12 470-487 
•11 554-571 
41 190-207 
■11 274-291 
41 218-235 
41 330-347 
40 526-543 
40 414-431 
40 386-403 
40 442-459 
40 302-319 
-09 134-151 
-09 246-263 
-08 498-515 



BL01130A 21.63 7.407e-25 331- 
385 BL01130B 23.34 2.286e-23 
429-481 



DM01292I 12.82 9.400e-10 148- 
190 DM01292I 12.82 9.400e-10 
591-633 



PD02327B 19.84 5.574e-10 169- 
191 



PD02365C 7.89 4.196e-09 365-395 



BL00983C 12.69 3.500e-15 142- 
158 BL00983B 8.19 1. 643e-12 84- 
94 BL00983A 5.84 7.261e-10 74- 
83 



1047 



BL00290 



1051 



BL01221 



Immunoglobulins and major 
histocompatibility complex proteins. 



BL00290B 13.17 6.400e-22 281- 
299 BL00290A 20.89 4.600e-16 
34-57 BL00290A 20.89 2.080e-10 
224-247 



PMP-22 / EMP / MP20 family proteins. 



BL01221B 13.29 6.745e-09 54-68 



1052 



BL00983 



1052 



BL00272 



1054 



BL00420 



1054 



PR00258 



Ly-6 / u-PAR domain proteins. 



BL00983C 12.69 4.981e-Q9 46-62 



Snake toxins proteins. 



BL00272C 8.27 8.326e-09 50-62 



Speract receptor repeat proteins domain 
proteins. 



BL00420B 22.67 4.627e-30 723- 
778 BL00420C 11.90 9.100e-13 
809-820 



SPERACT RECEPTOR SIGNATURE 



PR00258B 9.63 3.813e-15 738-750 
PR0Q258E 13.33 2.047e-12 808- 
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821 PR00258C9.05 2.837e-10 
753-764 


1054 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514G 15.98 4.326e-09 542- 
572 


1055 


BL01212 


ATP P2X receptors proteins. 


BL01212A 34.89 1.000e-40 41-94 
BL01212E 24.87 1.000e-40 225- 
280 BL01212G 11.86 3.700e-34 
309-337 BL01212D 11.42 9.609e- 
27 182-206 BL01212B 19.25 
8.393e-21 126-151 BL01212F 
10.12 2.421e-15 290-301 
BL01212C 8.40 2.500e-14 158-169 


1056 


PR00920 


SPUMAVIRUS ASPARTIC 
PROTEASE (A9) SIGNATURE 


PR00920C 13.24 7.310e-09 149- 
171 


1057 


BL00682 


ZP domain proteins. 


BL00682C 20.71 1.706e-12 439- 
464 


1057 


BL00025 


P-type 'Trefoil' domain proteins. 


BL00025 17.17 5.645e-09 231-252 


1059 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.273e-14 98-138 


1059 


PR00854 


PROSTAGLANDIN D RECEPTOR 
SIGNATURE 


PR00854E 10.50 4.649e-26 236- 
260 PR00854B 7.30 8.154e-21 41- 
59 PR00854G 10.66 1.783e-18 
341-358 PR00854D9.41 2.500e-18 
185-201 PR00854A 15.24 9.077e- 
18 6-21 PR00854H 14.71 6.203e- 
17 369-390 PR00854C 12.92 
1.643e-12 93-105 PR00854F 12.83 
9.682e-ll 321-333 


1059 


PR00856 


PROSTACYCLIN (PROSTANOID IP) 
RECEPTOR SIGNATURE 


PR00856E 9.82 1.724e-09 178-195 


1060 


BL01271 


Sodium: sulfate symporter family 
proteins. 


BL01271D 25.26 1.000e-40 480- 
535 BL01271B 12.02 6.400e-24 
208-233 BL01271A 8.06 7.955e-23 
132-152 BL01271C 13.62 7.429e- 
20 407-429 


1062 


PF00798 


Arenavirus glycoprotein. 


PF00798I 18.55 8.811e-09 53-90 


1064 


BL01017 


Ergosterol biosynthesis ERG4/ERG24 
family proteins. 


BL01017D 20.82 1.000e-40 232- 
278 BL01017F 23,34 9.196e-35 
291-344 BL01017C 15.91 7.324e- 
23 181-207 BL01017B 12.69 
9.419e-17 166-181 


1065 


BL00874 


Bacterial type II secretion system 
protein F proteins. 


BL00874B 29.89 9.724e-09 414- 
469 


1066 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270C 19.54 2.895e-16 43-72 


1066 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 8.435e-09 183-193 


1067 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 3.455e-14 77-117 


1067 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 1.257e-10 91-114 
PR(JUz3/xi 13. y. luue-iu l/d- 
199 


1067 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 9.581e-18 46-68 
PR00245C 7.84 4.780e-13 214-230 
PR00245E 12.40 6.741e-09 267- 
282 


1067 


PR00534 


MELANOCORTIN RECEPTOR 
FAMILY SIGNATURE 


PR00534A 11.49 9.229e-09 38-51 
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1069 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 5.909e-15 409- 
440 BL00107B 13.31 4.214e-ll 
484-500 


1069 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 4.353e-09 549- 
572 


1071 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.789e-13 222- 
245 


1072 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 8.892e-10 4-14 
PR00308C 3.83 8.892e-10 5-15 
PR00308C 3.83 8.013e-09 3-13 


1072 


PR00698 


C.ELEGANS SRG FAMILY 

integral membrane protein 
signature: 


PR00698E 14.43 8.714e-09 111- 
137 


1075 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 7.000e-ll 69-85 
PF00023B 14.20 2.636e-09 131- 
141 


1075 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 6.087e-09 128- 
141 


1075 


PR00806 


VINCULIN SIGNATURE 


PR00806C 11.07 8.839e-09 350- 
368 


1075 


PF00791 


Domain present in ZO-1 and Unc5-like 
netrin receptors. 


PF00791B 28.49 9.505e-09 135- 
190 PF00791B 28.49 9.835e-09 
69-124 


1076 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.610e-ll 174- 
214 BL00237C 13.19 4.176e-10 
317-344 


1076 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 7.677e-ll 322- 
347 PR00237E 13.03 6.1 OOe- 10 
276-300 PR00237A 11.48 8.839e- 
09 103-128 


1077 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 1.831e-09 139- 
189 


1079 


BL00462 


Gamma- glutamyltr ansp epti das e 
proteins. 


BL00462A 20.89 4.000e-20 108- 
151 BL00462D 23.07 7.256e-12 
356-396 BL00462B 17.88 9.153e- 
12 183-220 


1080 


BL00115 


Eukaryotic RNA polymerase II 
heptapeptide repeat proteins. 


BL00115Z 3.12 2.125e-09 1363- 
1412 BL00115Z3.12 6.096e-09 
1349-1398 


1083 


BL00272 


Snake toxins proteins. 


BL00272C 8.27 9.182e-10 109-121 


1083 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 3.348e-14 366- 
397 BL00107B 13.31 4.176e-09 
441-457 ! 


1083 


PR00653 


ACTIVIN TYPE II RECEPTOR 
SIGNATURE 


PR00653D 13.25 7.200e-09 385- 
407 


1083 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 6.727e-ll 535- 
558 PR00109D 17.04 7.609e-09 
442-465 


1083 


BL00983 


Ly-6 / u-PAR domain proteins. 


BL00983C 12.69 y.liDe-Uy IUj- 
121 


1084 


BL00272 


Snake toxins proteins. 


BL00272C 8.27 9.182e-10 109-121 


1084 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 3.348e-14 428- 
459 BL00107B 13.31 4.176e-09 
503-519 


1084 


PR00653 


ACTIVTN TYPE II RECEPTOR 
SIGNATURE 


PR00653D 13.25 7.200e-09 447- 
469 
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1084 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109E 14.41 6.727e-U 597- 
620 PR00109D 17.04 7.609e-09 
504-527 


1084 


BL00983 


Ly~6 / u-PAR domain proteins. 


BL00983C 12.69 9.135e-09 105- 
121 


1085 


PR00541 


MUSCARINIC M4 RECEPTOR 
SIGNATURE 


PR00541C 8.06 7.726e-09 486-507 


1086 


PR00541 


MUSCARINIC M4 RECEPTOR 
SIGNATURE 


PR00541C 8.06 7.726e-09 755-776 


1087 


PD01101 


INHIBITOR HEAVY CHAIN 
CHANNEL IN. 


PD01101B 21.53 3.318e-22 343- 
396 


1088 


BL00129 


Glycosyl hydrolases family 31 proteins. 


BL00129A 26.21 2.400e-28 114- 
160 BL00129D 16.76 6.806e-26 
364-408 BL00129C 15.12 5.295e- 
24 326-354 BL00129E 22.60 
4.857e-23 428-464 BL00129B 
19.19 4.436e-15 225-252 
BL00129F 26.19 2.500e-13 544- 
582 


1090 


PR00887 


STRUCTURE-SPECIFIC 
RECOGNITION PROTEIN 
SIGNATURE 


PR00887A 11.39 1.643e-22 343- 
360 PR00887F 12.74 2.000e-22 
498-516 PR00887B 9.94 3.250e-22 
365-382 PR00887C 13.16 4.000e- 
22 388-405 PR00887E 10.36 
5.200e-22 480-499 PR00887H 
11.84 8.313e-22 537-556 
PR00887G 14.17 9.438e-20 521- 
538 PR00887D 15.12 8.313e-17 
453-467 


1090 


PR00886 


HIGH MOBILITY GROUP 
(HMG1/HMG2) PROTEIN 
SIGNATURE 


PR00886C 11.84 8.500e-13 696- 
715 PR00886A 10.08 3.192e-10 
710-733 


1090 


PD02448 


TRANSCRIPTION PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 5.576e-10 686-725 


1090 


BL00353 


HMG1/2 proteins. 


BL00353B 11.47 8.244e-24 664- 
714 BL00353A 9.60 2.549e-09 
674-723 


1091 


BL00284 


Serpins proteins. 


BL00284C 28.56 4.000e-25 472- 
514 BL00284D 16.34 5.655e-17 
578-605 BL00284A 15.64 2.742e- 
15 341-365 BL00284E 19.15 
4.818e-15 659-684 BL00284B 
17.99 3.667e-14 445-466 
BL00284A 15.64 2.600e-ll 375- 
399 


1092 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 11.20 8.119e-10 357- 
375 


1097 


BL00605 


ATP synthase c subunit proteins. 


BL00605 27.67 3.172e-33 79-133 


1097 


PR00124 


ATP SYNTHASE C SUBUNIT 
SIGNATURE 


PR00124C YLAZ o.4uue-lo Llj- 
139 PR00124A 8.81 8.054e-14 75- 
95 PR00124B 14.66 6.897e-12 96- 
112 


1098 


BL00450 


Aconitase family proteins. 


BL00450B 42.34 8.393e-30 386- 
441 BL00450D 21.14 2.800e-18 
665-689 BL00450E 16.34 8.875e- 
13 710-725 BL00450B 42.34 
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6.400e-12 446-501 BL00450A 
13.76 2.406e-ll 351-365 
BL00450C 11.95 6.657e-10 612- 
622 


1098 


PR00415 


ACONITASE FAMILY SIGNATURE 


PR00415D 12.72 5.696e-16 390- 
406 PR00415I 13.62 4.1 15e-15 
675-689 PR00415G 14.24 8.105e- 
15 548-563 PR00415C 13.34 
7.828e-14 376-390 PR00415E 
10.04 7.828e-14 452-466 
PR00415F 11.66 7.273e-13 466- 
480 PR00415H 12.39 9.700e-13 
613-625 PR00415A 11.15 1.621e- 
10 323-337 PR00415B 8.14 
9.036e-09 347-356 


1104 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 1.00Ue-ll 13y-14y 


1104 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240B 24.70 4.255e-09 55-iUy 


1106 


BL01002 


Translationally controlled tumor 
protein. 


BL01002C 21.97 6.143e-2o /y-iiu 
BL01002A 13.19 1.360e-24 1-24 

t~it r\ 1 aaid Tin'} i i o_ i A AO /CO 

BLQ1002B V.jy d.l loe-14 4o-oz 


1107 


PF00997 


Kappa casein. 


PF00997D 9.95 8.306e-09 513-548 


1109 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 1.391e-09 42-55 


1110 


BL01310 


ATP1G1 / PLM / MAT8 family 
proteins. 


BL01310 14.74 8.981e-24 99-135 


1112 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764B 13.56 2.250e-ll 122- 
143 


1112 


PR00261 


LOW DENSITY LIPOPROTEIN 
(LDL) RECEPTOR SIGNATURE 


PR00261E 11.08 6.308e-09 127- 
149 PR00261F 11.57 7.152e-09 
127-149 


1115 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.273e-14 188- 
228 


1115 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 3.250e-19 157- 
179 PR00245B 10.38 1.918e-09 
275-290 


1115 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 4.150e-09 202- 
225 


1116 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.658e-12 163- 
203 


1116 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 9.325e-19 132- 
154 PR00245C7.84 4.073e-15 
311-327 PR00245B 10.38 5.500e- 
13 250-265 PR00245E 12.40 
7.618e-13 364-379 PR00245D 
10.47 4.673e-09 347-359 


1116 


PR00237 


RHODOPSIN-LIKE GPCR 

oUirilKrAJYllJL I olvjrlN/Yl UxvCr 


PR00237C 15.69 6.400e-10 177- 
900 PR00237G 19 63 5 814e-09 
345-372 


1119 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 7.955e-13 969- 
997 PD01719A 12.89 8.111e-09 
305-333 


1120 


BL00355 


HMG14 and HMG17 proteins. 


BL00355 5.97 1.692e-37 18-49 


1120 


PR00925 


NONHISTONE CHROMOSOMAL 
PROTEIN HMG17 FAMILY 


PR00925A 5.47 2.800e-19 18-33 
PR00925B 3.73 3.400e-16 34-47 



WO 03/054152 



PCT/US02/39555 



Table 3A 
322 



SEQ ID 


Database entry 
ID 


Description 


Results* 






SIGNATURE 


PR00925D 6.56 2.200e-13 66-77 
PR00925C 5.57 8.235e-09 47-58 


1124 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 4.240e-ll 210- 
228 


1125 


PD02327 ! 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327B 19.84 2.091e-09 191- 
213 


1125 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 7.652e-09 338-348 


1127 


PD02870 1 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 8.755e-09 96-131 


1133 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 8.364e-14 122- 
144 PR00245C 7.84 9.280e-13 
300-316 PR00245B 10.38 4.600e- 
11 240-255 PR00245E 12.40 
7.623e-10 353-368 


1133 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.371e-13 153- 
193 BL00237D 11.23 7.750e-10 
344-361 


1133 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 6.063e-12 334- 
361 PR00237C 15.69 6.175e-09 
167-190 


1134 


BL00221 


MIP family proteins. 


BL00221B 10.22 1.871e-ll 141- 
152 BL00221D 12.33 2.174e-ll 
240-255 BL00221E8.47 9.710e-ll 
307-318 BL00221A6.39 5.935e-09 
92-103 


1134 


PR00783 


MAJOR INTRINSIC PROTEIN 
FAMILY SIGNATURE 


PR00783B 15.98 4.130e-15 127- 
152 PR00783F 12.33 9.156e-14 
308-329 PR00783A 12.72 7.462e- 
12 88-108 PR00783E 16.78 
8.263e-10 128-151 PR00783C 
13.54 1.340e-09 164-184 
PR00783E 16.78 6.754e-09 226- 
249 


1136 


PD02886 


GLYCOPROTEIN PRECURSOR 
IMMUNOGLOBULIN FOL. 


PD02886C 21.92 7.907e-10 112- 
151 


1137 


PR00122 


VACUOLAR ATP SYNTHASE 16 KD 
SUBUNIT SIGNATURE 


PR00122C8.20 1.000e-33 104-131 
PR00122B 8.60 2.125e-28 56-81 
PR00122D 9.97 4.375e-28 131-155 
PR00122A 11.44 6.053e-19 30-55 


1137 


BL00605 


ATP synthase c subimit proteins. 


BL00605 27.67 1.778e-10 94-148 


1137 


PR00124 


ATP SYNTHASE C SUBUNIT 
SIGNATURE 


PR00124C 12.42 2.161e-10 128- 
154 


1138 


BL00665 


Dihydrodipicolinate synthetase 
proteins. 


BL00665B 30.33 8.265e-12 52-105 
BL00665D 14.76 1.000e-ll 164- 
187 BL00665C 25.58 5.832e-ll 
105-156 


1138 


PR00146 


DIHYDRODIPICOLINATE 
SYNTHASE SIGNATURE 


PR00146D 16.26 2.525e-10 163- 


1139 


BL00456 


Sodium: solute symporter family 
proteins. 


BL00456C 24.55 4.886e-28 165- 
220 BL00456A 22.59 3.127e-27 
27-82 BL00456B 18.94 1.220e-17 
103-133 


1139 


BL00415 


Synapsins proteins. 


BL00415O 3.44 6.270e-09 514-552 


1139 


BL00136 


Serine proteases, subtilase family, 
aspartic acid proteins. 


BL00136B 9.63 7.796e-09 773-786 
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1141 


BL00310 


Lysosome-associated membrane 
glycoproteins duplicated domain 
proteins. 


BL00310F 23.26 4.162e-09 194- 
249 


1143 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.115e-18 126- 
166 BL00237C 13.19 7.545e-15 
263-290 BL00237D 11.23 8.962e- 
11 324-341 


1143 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 7.120e-15 314- 
341 PR00237F 13.57 2.565e-14 
268-293 PR00237C 15.69 6.667e- 
12 140-163 PR00237A 11.48 
8.125e-ll 63-88 PR00237B 13.50 
1.563e-10 96-118 PR00237E 13.03 
3,118e-09 226-250 


1144 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 2.068e-09 7-22 


1145 


BL01159 


WW/rsp5/WWP domain proteins. 


BL01159 13.85 1.310e-14 387-402 


1145 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 6.906e-15 387- 
402 PR00403A 16.82 5.200e-ll 
373-387 


1145 


BL01179 


Phosphotyrosine interaction domain 
proteins (PID) profile. 


BL01179A 12.63 8.286e-ll 394- 
406 BL01179B 15.18 7.968e-10 
667-682 


1147 


BL00594 


Aromatic amino acids permeases 
proteins. 


BL00594A 16.75 3.851e-09 107- 
151 


1148 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399B 14.27 1.305e-09 242- 
256 


1148 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 6.318e-09 279- 
293 


1150 


BL00291 


Prion protein. 


BL00291A 4.49 8.241e-09 21-56 


1152 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 6.447e-12 210- 
250 


1152 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 3.512e-09 392- 
419 PR00237C 15.69 4.825e-09 
224-247 


1152 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 1.500e-20 179- 
201 PR00245B 10.38 3.571e-16 
297-312 PR00245E 12.40 l.OOOe- 
12 411-426 PR00245D 10.47 
1.000e-10 394-406 PR00245C 7.84 
6.727e-09 358-374 


1153 


PR00962 


LETHAL(2) GIANT LARVAE 
PROTEIN SIGNATURE 


PR00962B 11.98 2.800e-28 310- 
333 PR00962G 15.71 5.655e-28 
609-634 PR00962D 10.40 1.225e- 
27 451-475 PR00962F 12.39 
6.786e-23 568-588 PR00962H 
13.32 9.710e-23 639-659 
PR00962I 11.68 3.829e-22 708-728 
PR00962C o.UU 4.zjUe-zz doz-joJ 
PR00962A 13.28 7.612e-22 17-36 
PR00962E 8.81 1.628e-20 531-550 


1153 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 9.122e-09 454- 
469 


1155 


BL00218 


Amino acid permeases proteins. 


BL00218D 21.49 2.038e-10 385- 
430 BL00218E 23.30 6.400e-10 
466-506 BL00218B21.44 5.790e- 
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09 217-249 


1155 


BL00341 


Surfactant associated polypeptide SP-C 
palmitoylation site proteins. 


BL00341B 8.70 7.895e-09 54-88 


1158 


PR00783 


MAJOR INTRINSIC PROTEIN 
FAMILY SIGNATURE 


PR00783C 13.54 1.474e-17 31-51 


1158 


BL00221 


MIP family proteins. 


BL00221B 10.22 1.643e-14 8-19 


1158 


PD00302 


PROTEASE POLYPROTEIN 
HYDROLASE ASP. 


PD00302B 9.52 1.360e-14 261-277 
PD00302A 6.33 3.323e-ll 198-209 


1158 


PF00692 


dUTPase. j 


PF00692B 8.14 3.613e-ll 113-124 


1158 


DM00892 


3 RETROVIRAL PROTEINASE. 


DM00892C 23.55 4.818e-13 292- 
326 DM00892B 9.78 1.000e-08 
264-270 


1160 


PD00320 


LAMININ CHAIN EGF-LIKE 
DOMAIN P. 


PD00320A 14.49 8.286e-12 1681- 
1695 


1160 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 1.225e-09 1745- 
1761 


1160 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 9.308e-18 1705- 
1718 BL01248 11.02 4.000e-15 
393-406 BL01248 11.02 1.900e-ll 
321-334 BL01248 11.02 6.094e-09 
445-458 


1160 


PR00877 


PLANT PEC FAMILY 
METALLOTHIONEIN SIGNATURE 


PR00877D 4.18 6.516e-09 1308- 
1316 


1160 


PF00054 


Laminin G domain proteins. 


PF00054B 16.61 8.200e-09 558- 
570 


1160 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011B 13.08 2.841e-17 547- 
566 PR0001 ID 14.03 4.150e-16 
547-566 PR00011B 13.08 7.750e- 
16 1370-1389 PR00011A 14.06 
1.170e-15 1277-1296 PR00011D 
14.03 4.000e-15 501-520 
PR00011A 14.06 4.736e-15 547- 
566 PR00011A 14.06 1.327e-14 
1370-1389 PR00011A 14.06 
5.909e-14 1416-1435 PR00011D 
14.03 7.980e-14 1370-1389 
PR00011D 14.03 3.717e-13 1416- 
1435 PR00011D 14.03 6.434e-13 
1277-1296 PR00011B 13.08 
8.932e-13 1277-1296 PR00011A 
14.06 2.161e-12 436-455 
PR00011D 14.03 6.053e-12 436- 
455 PR00011B 13.08 1.138e-ll 
436-455 PR00011B 13.08 6.954e- 
11 1416-1435 PR00011A 14.06 
8.062e-ll 1318-1337 PR00011A 
14.06 1.913e-10 695-714 
PR00011B 13.08 2.043e-10 644- 
663 PR0001 ID 14.03 3.215e-10 
695-714 PR00011C24.25 4.600e- 
10 651-680 PR00011C 24.25 
9.100e-10 1423-1452 PR00011D 
14.03 1.261e-09 644-663 
PR00011B 13.08 1.370e-09 695- 
714 PR00011A 14.06 1.986e-09 
644-663 PR00011B 13.08 3.096e- 
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09 1318-1337 PR00011D 14.03 
8.435e-09 591-610 PR00011C 
24.25 9.857e-09 1769-1798 


1161 


PD00919 


CALCIUM-BINDING PRECURSOR 
SIGNAL R. 


PD00919A 11.53 8.377e-10 185- 
197 


1161 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 9.486e-10 158- 
174 BL01187B 12.04 2.800e-09 
196-212 


1161 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 4.158e-12 39-58 
PR00011B 13.08 2.973e-09 39-58 


1161 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 5.929e-09 163- 
174 


1161 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243H 17.53 7.276e-09 65-91 


1161 


BL01185 


C-terminal cystine knot proteins. 


BL01185B 21.14 9.047e-09 137- 
186 


1161 


DM00060 


338 lew NEUREXIN ALPHA III 
CYSTEINE. 


DM00060 6.92 9.460e-09 108-118 


1161 


BL01248 


Laminin-type EGF-like (LE) domain 
proteins. 


BL01248 11.02 9.660e-09 48-61 


1162 


BL00252 


Interferon alpha, beta and delta family 
proteins. 


BL00252A 18.49 6.657e-23 35-72 
BL00252B 19.78 9.125e-16 73-124 


1162 


PR00266 


INTERFERON ALPHA AND BETA 
SUBUNIT SIGNATURE 


PR00266A 13.61 1.000e-13 67-80 


1163 


PR00264 


INTERLEUKIN- 1 SIGNATURE 


PR00264B 20.98 8.453e-ll 63-90 
PR00264C 17.77 1.851e-10 103- 
132 


1164 


BL00249 


Platelet-derived growth factor (PDGF) 
family proteins. 


BL00249 18.21 3.250e-34 75-105 


1165 


BL00253 


Interleukin-1 proteins. 


BL00253D 25.67 3.464e-ll 95-135 


1165 


PR00264 


INTERLEUKIN- 1 SIGNATURE 


PR00264C 17.77 3.294e-17 95-124 
PR00264B 20.98 6.250e-09 56-83 


1166 


PR00573 


INTERLEUKIN 8B RECEPTOR 
SIGNATURE 


PR00573D 15.57 L450e-20 114- 
130 


1166 


PR00427 


INTERLEUKIN- 8 RECEPTOR 
SIGNATURE 


PR00427E 7.08 9.446e-18 6-22 
PR00427G 10.85 2.667e-16 98-115 
PR00427F 11.77 9.500e-13 48-65 


1166 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 4.789e-16 23-48 
PR00237G 19.63 1.844e-12 70-97 


1166 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 1.857e-16 18-45 
BL00237D 11.23 6.464e-10 80-97 


1167 


PR00414 


PALMITOYL PROTEIN 
THIOESTERASE SIGNATURE 


PR00414A 10.74 1.000e-31 26-49 
PR00414D 17.19 1.450e-27 151- 
171 PR00414E 16.76 1.600e-27 
185-205 PR00414G 12.38 4.000e- 
25 258-277 PR00414B 12.48 
4.2 14e-25 66-86 PR00414C 13.14 
1.000e-22 106-123 


1168 


BL00284 


Serpins proteins. 


BL00284C 28.56 4.000e-25 204- 
246 BL00284A 15.64 7.750e-22 
73-97 BL00284E 19.15 8.826e-19 
391-416 BL00284D 16.34 5.655e- 
17 310-337 BL00284B 17.99 
3.667e-14 177-198 BL00284A 
15.64 2.600e-ll 107-131 


1170 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 103- 
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132 BL00269A 8.53 2.607e-20 38- 
58 BL00269B 19.17 5.500e-17 65- 
94 


1172 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383A 13.34 5.235e-16 274- 
289 BL00383F 15.51 1.429e-15 
477-493 BL00383E 10.35 7.000e- 
15 439-450 BL00383D 11.92 
4.000e-13 403-416 BL00383C 
10.10 6.344e-10 326-337 


1172 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700B 16.80 5.320e-21 313- 
334 PR00700D 12.47 9.217e-19 
436-455 PR00700C 13.17 3.143e- 
15 399-417 PR00700F 11.18 
8.941e-13 483-494 PR00700A 6.96 
8.714e-10 295-303 PR00700E 
17.57 7.618e-09 467-483 


1174 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472A 7.45 3.483e-09 1-13 


1175 


BL00472 


Small cytokines (intercrine/chemokine) 
C-C subfamily signatur. 


BL00472A 7.45 3.483e-09 1-13 


1187 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 4.273e-09 88-102 


1188 


PR00860 


VERTEBRATE 

METALLOTHIONEIN SIGNATURE 


PR00860B 7.04 6.776e-09 52-66 


1189 


BL00280 


Pancreatic trypsin inhibitor (Kunitz) 
family proteins. 


BL00280 24.61 1.5 14e-30 276-320 
BL00280 24.61 6.553e-24 238-282 


1189 


PR00759 


BASIC PROTEASE (KUNITZ- TYPE) 
INHIBITOR FAMILY SIGNATURE 


PR00759C 14.15 2.385e-14 304- 
320 PR00759B 11.26 8.105e-12 
256-267 PR00759B 11.26 2.452e- 
10 294-305 


1189 


BL00213 


Lipocalin proteins. 


BL00213A 12.95 2.636e-09 38-52 


1189 


PR00179 


LIPOCALIN SIGNATURE 


PR00179B 9.56 8.125e-12 122-135 
PR00179A 13.78 6.760e-10 38-51 
PR00179C 19.02 6.727e-09 150- 
166 


1190 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 1.000e-10 83- 
117 


1191 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 1.000e-10 83- 
117 


1198 


BL00154 


E1-E2 ATPases phosphorylation site 
proteins. 


BL00154G 21.18 9.783e-09 203- 
237 


1200 


DM01970 


0 kw ZK632.12 YDR313C 
ENDOSOMAL III. 


DM01970B 8.60 3.494e-09 120- 
133 


1201 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.500e-28 35-74 


1202 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.500e-28 35-74 


1204 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134A 11.96 7.158e-19 81-98 


1204 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 7.353e-18 82-98 


1204 


BL00495 


Apple domain proteins. 


BL00495K 12.58 4.543e-ll 83-116 


1204 


BL01253 


Type I fibronectin domain proteins. 


BL01253E 16.01 3.415e-13 154- 
191 BL01253D4.84 9.585e-ll 81- 
95 


1204 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 2.370e-17 81-99 
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BL00021C 22.21 6.604e-09 165- 
187 


1208 


PR00179 


LIPOCALIN SIGNATURE 


PR00179B 9.56 7.214e-09 121-134 


1212 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364D 10.54 6.700e-09 172- 
189 


1212 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 8.615e-13 177- 
191 PR00019B 11.36 2.286e-12 
174-188 PR00019All.19 4.913e- 
10 319-333 PR00019B 11.36 
4.960e-09 387-401 PR00019A 
11.19 8.667e-09 151-165 


1213 


BL00132 


Zinc carboxypeptidases, zinc-binding 
region 1 proteins. 


BL00132C 21.35 3.308e-28 129- 
170 BL00132B 15.93 1.871e-16 
99-113 BL00132A 26.07 1.682e-14 
50-91 BL00132F 13.26 7.254e-14 
228-250 BL00132D 12.70 2.875e- 
12 173-188 BL00132E 17.72 
3.552e-12 199-226 


1213 


PR00765 


CARBOXYPEPTIDASE A 
METALLOPROTEASE (Ml 4) 
FAMILY SIGNATURE 


PR00765B 15.57 7.857e-16 99-114 
PR00765D 14.16 5.500e-ll 233- 
247 PR00765C 12.55 1.290e-10 
179-188 


1214 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270C 19.54 5.329e-09 32-61 


1221 


PR00007 


COMPLEMENT C1Q DOMAIN 
SIGNATURE 


PR00007A 19.33 L000e-16 143- 
170 PR00007C 15.60 8.200e-15 
214-236 PR00007B 14.16 5.846e- 
14 170-190 PR00007D9.64 
5.250e- 10 249-260 


1221 


BL01113 


Clq domain proteins. 


BL01113B 18.26 1.581e-29 149- 
185 BL01113C 13.18 3.077e-15 
214-234 BL01113A 17.99 1.243e- 
13 50-77 BL01113A 17.99 6. 108e- 
13 35-62 BL01113A 17.99 3.077e- 
12 41-68 BL01113A 17.99 1.574e- 
10 38-65 BL01113A 17.99 9.617e- 
10 44-71 BL01 113A 17.99 7.577e- 
09 59-86 


1221 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 2.038e-12 86-115 
BL00420A 20.42 5.154e-12 44-73 
BL00420A 20.42 4.185e-09 47-76 
BL00420A 20.42 9.031e-09 50-79 


1223 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240E 11.56 2.043e-21 260- 
298 BL00240F 17.74 4.035e-18 
313-361 BL00240G 28.45 4.923e- 
17 361-414 


1223 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001B 17.40 3.077e-16 271- 
292 


1223 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 4.500e-19 274- 
305 BL00107B 13.31 7.231e-12 
342-358 


1223 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109D 17.04 4.462e-20 343- 
366 PR00109E 14.41 5.950e-20 
388-411 PR00109B 12.27 9.571e- 
19 274-293 PR00109A 15.00 
5.065e-l 0 237-251 PR00109C 
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12.85 5.333e-10 324-335 


1223 


BL00239 


Receptor tyrosine kinase class II 
proteins. 


BL00239E 17.14 1.281e-25 314- 
364 BL00239F 28.15 9.684e-19 
369-414 BL00239C 18.75 8.765e- 
15 261-284 BL00239D 16.81 
5.571e-10 286-312 


1223 


PR00046 


MAJOR SIGMA-70 FACTOR 
SIGNATURE 


PR00046A 13.26 8.788e-09 296- 
310 


1223 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790O 7.68 9.357e-16 321-354 
BL00790Q 15.61 6.057e-ll 380- 
429 BL00790N 13.25 3.937e-10 
287-314 BL00790M8.74 5.138e- 
10 265-287 BL00790K 9.30 
1.000e-08 182-236 


1224 


PR00453 


VON WILLEBRAND FACTOR TYPE 
A DOMAIN SIGNATURE 


PR00453A 12.79 9.827e-09 50-68 


1227 


BL00615 


C-type lectin domain proteins. 


BL00615B 12.25 9.571e-10 151- 
165 BL00615A 16.68 3.100e-09 
48-66 


1237 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514E 14.28 7.750e-12 313- 
330 


1257 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 1.000e-09 247- 
261 PR00019B 11.36 2.080e-09 
316-330 PR00019B 11.36 3.160e- 
09 244-258 PR00019B 11.36 
4.240e-09 100-114 PR00019B 
11.36 5.680e-09 148-162 


1263 


BL00427 


Disintegrins proteins. 


BL00427 13.93 4.035e-17 436-491 


1263 


PR00289 


DISINTEGRIN SIGNATURE 


PR00289A 13.62 5.667e-14 450- 
470 


1263 


PR00873 


ECHINOIDEA (SEA URCHIN) 
METALLOTHIONEIN SIGNATURE 


PR00873D 8.43 5.989e-09 608-627 


1264 


BL01173 


Lipolytic enzymes G-D-X-G family, 
histidine. 


BL01173C 8.98 1.000e-16 216-230 
BL01173B 13.27 1.000e-12 174- 
201 BL01173A9.41 7.200e-ll 
141-154 


1266 


BL00492 


Clusterin proteins. 


BL00492F 16.61 1.000e-40 163- 
205 BL00492G 13.20 1.000e-40 
230-280 BL00492H 16.50 6.870e- 
40 358-395 BL00492C 8.35 
L000e-38 52-86 BL00492B 10.36 
9.053e-28 26-49 BL00492E 12.52 
6.423e-26 73-122 BL00492A 11.53 
4.724e-18 2-19 


1267 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.538e-29 207- 
249 BL00284A 15.64 3.739e-18 
107-131 BL00284D 16.34 3.793e- 
17 314-341 BL00284E 19.15 
2.909e~15 401-426 


1268 


BL00284 


Serpins proteins. 


BL00284C 28.56 6.538e-29 243- 
285 BL00284A 15.64 3.739e-18 
125-149 BL00284E 19.15 8.269e- 
18 437-462 BL00284D 16.34 
3.793e-17 350-377 BL00284B 
17.99 3.483e-13 214-235 


1273 


BL00380 


Rhodanese proteins. 


BL00380D 15.90 8.200e-28 110- 
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136 BL00380G 11.26 5.800e-16 
251-264 BL00380B 14.77 7.000e- 
14 49-62 BL00380C 15.67 7.387e- 
13 82-98 BL00380E 12.44 7.000e- 
11 181-193 BL00380A 10.48 
1.000e-09 10-20 


1274 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134C 13.45 9.000e-17 333- 
347 BL00134A 11.96 7.429e- 16 
147-164 BL00134B 15.99 3.793e- 
15 297-321 


1274 


BL01253 


Type I fibronectin domain proteins. 


BL01253H 13.15 5.909e-19 315- 
350 BL01253G 11.34 8.791e-13 
296-310 BL01253F 14.35 2.241e- 
12 252-291 BL01253E 16.01 
7.194e-10 215-252 BL01253D4.84 
7.686e-10 147-161 


1274 


BL00495 


Apple domain proteins. 


BL00495N 11.04 3.957e-24 289- 
324 BL00495O 13.75 6.610e-16 
324-353 BL00495K 12.58 1.957e- 
11 149-182 BL00495L 11.94 
2.154e-09 178-217 


1274 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 7.480e-15 148- 
164 PR00722C 10.87 2.286e-14 
296-309 PR00722B 12.51 3.250e- 
09 203-218 


1274 


BL00021 


Kringle domain proteins. 


BL00021D 24.56 8.773e-27 305- 
347 BL00021B 13.33 9.217e-17 
147-165 BL00021C 22.21 3.717e- 
09 226-248 


1279 


PR00704 


CALPAIN CYSTEINE PROTEASE 
(C2) FAMILY SIGNATURE 


PR00704I 9.52 4.682e-14 145-174 


1287 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.313e-28 16-55 


1287 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 3.769e-15 191-204 
PD00066 13.92 1.500e-13 247-260 
PD00066 13.92 1.500e-13 387-400 
PD00066 13.92 7.500e-13 275-288 
PD00066 13.92 8.000e-13 359-372 
PD00066 13.92 9.143e-12 219-232 
PD00066 13.92 4.913e-l 1 303-316 


1287 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 4.857e-16 371-388 
BL00028 16.07 8. 4 12e- 14 203-220 
BL00028 16.07 5.500e-13 287-304 
BL00028 16.07 1.346e-ll 259-276 
BL00028 16.07 5.154e-ll 399-416 
BL00028 16.07 7.577e-ll 343-360 
BL00028 16.07 1.000e-10 175-192 
BL00028 16.07 6.700e-10 231-248 
BL00028 16.07 3.829e-09 315-332 


1287 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 6.817e-09 280- 
315 


1287 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 7.750e-16 368- 
382 PR00048A 10.52 4.273e-15 
284-298 PR00048A 10.52 1.643e- 
13 200-214 PR00048B 6.02 
1.900e-13 188-198 PR00048B 6.02 
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6.400e-13 272-282 PR00048A 
10.52 3.118e-12 396-410 
PR00048B 6.02 5.000e-12 384-394 
PR00048B 6.02 4.462e-ll 244-254 
PR00048B 6.02 1.563e-10 356-366 
PR00048A 10.52 2.174e-10 256- 
270 PR00048A 10.52 4.913e-10 
312-326 PR00048A 10.52 9.217e- 
10 340-354 PR00048B 6.02 
3.368e-09 300-310 PR00048A 
10.52 7.480e-09 228-242 


1292 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243H 17.53 8.696e-ll 42-68 


1295 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 3.512e-31 10-49 


1295 


FD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 9.100e-16 337-350 
PD00066 13.92 4.462e-15 477-490 
PD00066 13.92 5.800e-14 421-434 
PD00066 13.92 9.400e-14 449-462 
PD00066 13.92 9.500e-13 365-378 
PD00066 13.92 9.500e-13 393-406 


1295 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 7.353e-14 321-338 
BL00028 16.07 7.750e-13 405-422 
BL00028 16.07 9.550e-13 349-366 
BL00028 16.07 9.550e-13 377-394 
BL00028 16.07 2.957e-12 433-450 
BL00028 16.07 5.800e-10 461-478 
BL00028 16.07 7.600e-10 489-506 


1295 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 1.000e-13 346- 
360 PR00048A 10.52 1.000e-13 
486-500 PR00048A 10.52 5.500e- 
13 430-444 PR00048A 10.52 
6.786e-13 402-416 PR00048A 
10.52 6. 786e- 13 458-472 
PR00048A 10.52 4.789e-ll 318- 
332 PR00048A 10.52 9.526e-ll 
374-388 PR00048B 6.02 1.000e-10 
334-344 PR00048B 6.02 9.438e-10 
418-428 PR00048B 6.02 9.43Se-10 
474.484 PR00048B 6.02 8.579e-09 
446-456 


1296 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 8.615e-33 16-55 


1296 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 6.400e-16 348-361 
PD00066 13.92 3.077e-15 236-249 
PD00066 13.92 4.462e-15 264-277 
PD00066 13.92 4.462e-15 320-333 
PD00066 13.92 2.800e-14 432-445 
PD00066 13.92 4.600e-14 292-305 
PD00066 13.92 8.200e-14 488-501 
PD00066 13.92 8.200e-14 516-529 
PD00066 13.92 6.500e-13 376-389 
PD00066 13.92 7.000e-13 208-221 
PD00066 13.92 4.429e-12 460-473 


1296 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 9.182e-15 385- 
399 PR00048A 10.52 9.182e-15 
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469-483 PR00048A 10.52 4.750e- 
14 329-343 PR00048A 10.52 
3.571e-13 413-427 PR00048A 
10.52 9.357e-13 245-259 
PR00048A 10.52 1.529e-12 525- 
539 PR00048A 10.52 2.588e-12 
189-203 PR00048A 10.52 3.1 18e- 
12 301-315 PR00048A 10.52 
3.647e-12 357-371 PR00048B 6.02 
l.OOOe-11 205-215 PR00048B 6.02 
l.OOOe-ll 261-271 PR00048B 6.02 
l.OOOe-11 429-439 PR00048B 6.02 
2.385e-ll 345-355 PR00048B 6.02 
4.462e-ll 513-523 PR00048B 6.02 
7.231e-ll 317-327 PR00048A 
10.52 9.526e-ll 217-231 
PR00048A 10.52 2.174e-10 273- 
287 PR00048B 6.02 2.688e-10 
485-495 PR00048B6.02 3.813e-10 
289-299 PR00048B 6.02 4.938e-10 
233-243 PR00048A 10.52 6.478e- 
10 497-511 PR00048B6.02 
3.842e-09 373-383 


1296 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.588e-14 192-209 
BL00028 16.07 1.450e-13 388-405 
BL00028 16.07 1.450e-13 528-545 
BL00028 16.07 5. 304e- 12 472-489 
BL00028 16.07 6.478e- 12 416-433 
BL00028 16.07 8.826e-12 332-349 
BL00028 16.07 9.609e-12 248-265 
BL00028 16.07 2.385e-ll 276-293 
BL00028 16.07 3.769e-ll 360-377 
BL00028 16.07 7.577e-ll 304-321 
BL00028 16.07 6.100e-10 220-237 
BL00028 16.07 5.8S6e-09 500-517 


1299 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 4.375e-10 188-199 


1299 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 1.321e-19 188- 
214 PR00138B 15.82 1.655e-10 
106-122 PR00138C 16.41 2.393e- 
10 132-161 PR00138E6.01 
6.586e-10 222-236 


1299 


BL00024 


Hemopexin domain proteins. 


BL00024D 17.28 3.411e-22 182- 
214 BL00024B 21.53 8.788e-19 
80-114 BL00024H 11.35 3.077e-10 
337-349 BL00024E 7.58 9.21 le-10 
222-236 BL00024C 22.98 6.000e- 
09 131-180 


1299 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 6.529e-09 183- 
202 


1299 


BL00546 


Matrixins cysteine switch. 


BL00546C 16.41 1.643e-22 182- 
214 BL00546D 10.34 8.017e-10 
222-236 BL00546B 20.11 2.443e- 
09 132-176 BL00546H 10.76 
4.5 12e-09 337-348 BL00546G 
16.84 6.684e-09 338-358 
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1301 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 2.875e-09 170- 
188 


1305 


PD02811 


PROTEIN PEPTIDE REDUCTASE 
MG448 PILB FIMBRIA TRAN. 


PD02811A 20.67 3.017e-22 60-98 
PD02811B 17.07 2.263e-21 111- 
144 PD02811C 13.25 5.696e-13 
147-160 


1307 


BL00282 


Kazal serine protease inhibitors family 
proteins. 


BL00282 16.88 8.043e-18 28-51 


1307 


PR00290 


KAZAL-TYPE SERINE PROTEASE 
INHIBITOR SIGNATURE 


PR00290B 9.78 6.559e-ll 39-51 
PR00290A 10.88 8.851e-09 28-39 


1308 


BL00112 


ATP:guanido phosphotransferases 
proteins. 


BL00112F 21.37 7.728e-09 51-102 


1309 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 139- 
168 BL00269A 8.53 2.607e-20 74- 
94 BL00269B 19.17 5.500e-17 
101-130 


1312 


BL01115 


GTP-binding nuclear protein ran 
proteins. 


BL01115A 10.22 3.559e-12 63-107 


1312 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 7.545e-17 63-85 
PR00449E 13.50 9.280e-14 201- 
224 PR00449D 10.79 7.158e-13 
167-181 PR00449C 17.27 9.229e- 
12 103-126 PR00449B 14.34 
9.280e-10 86-103 


1314 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.059e-14 361-378 
BL00028 16.07 7.353e-14 193-210 
BL00028 16.07 1.900e-13 165-182 
BL00028 16.07 3.250e-13 333-350 
BL00028 16.07 4.150e-13 221-238 
BL00028 16.07 9.550e-13 277-294 
BL00028 16.07 1.000e-l 1 305-322 
BL00028 16.07 1.600e- 10 249-266 


1314 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 2.800e-14 321-334 
PD00066 13.92 1.000e-13 265-278 
PD00066 13.92 1.429e- 12 237-250 
PD00066 13.92 7.857e-12 181-194 
PD00066 13.92 2. 174e-l 1 349-362 
PD00066 13.92 2.385e-10 209-222 
PD00066 13.92 7.23 le-10 293-306 


1314 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 5.500e-14 218- 
232 PR00048A 10.52 5.500e-14 
358-372 PR00048A 10.52 8.500e- 
14 330-344 PR00048A 10.52 
2.286e-13 190-204 PR00048A 
10.52 8,071e-13 162-176 
PR00048A 10.52 3.118e-12 274- 
288 PR00048A 10.52 6.684e-ll 
246-260 PR00048B 6.02 7.23 le-11 
318-328 PR00048B 6.02 1.563e-10 
178-188 PR00048A 10.52 7.261e- 
10 302-316 PR00048B6.02 
8.579e-09 262-272 


1315 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 1.947e-10 95-123 


1315 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 3.750e-09 48-63 



WO 03/054152 



PCT/US02/39555 



Table 3A 
333 



SEQ ID 


Database entry 
ID 


Description 


Results* 


1315 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 4.434e-09 50-79 


1315 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 2.770e-09 24-34 
PD00866L 3.73 2.770e-09 82-92 
PD00866L 3.73 2.918e-09 4-14 
PD00866L 3.73 4.984e-09 14-24 


1315 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 5.014e-09 89-103 


1315 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 5.655e-13 3-46 
BL00243I31.77 9.286e-ll 81-124 
BL00243I 31.77 6.691e-10 13-56 
BL00243I 31.77 2.775e-09 10-53 
BL00243I 31.77 5.310e-09 52-95 
BL002431 31.77 5.437e-09 18-61 


1315 


PR00858 


CRUSTACEAN 

METALLOTHIONEIN SIGNATURE 


PR00858B 5.93 3.394e-09 49-68 
PR00858B 5.93 7.223e-09 15-34 


1315 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 9.893e-13 72-118 
BL00203 13.94 7.414e-12 34-80 
BL00203 13.94 2.879e-ll 46-92 
BL00203 13.94 4.956e-ll 87-133 
BL00203 13.94 5.945e-ll 101-147 
BL00203 13.94 8.319e-ll 78-124 
BL00203 13.94 1.383e-10 63-109 
BL00203 13.94 3.106e-10 77-123 
BL00203 13.94 6.266e-10 15-61 
BL00203 13.94 8.085e-10 33-79 
BL00203 13.94 8.372e-10 102-148 
BL00203 13.94 9.138e-10 56-102 
BL00203 13.94 9.521e-10 88-134 
BL00203 13.94 1.000e-09 5-51 
BL00203 13.94 1.827e-09 82-128 
BL00203 13.94 3.847e-09 83-129 
BL00203 13.94 5. 133e-09 48-94 
BL00203 13.94 5.500e-09 25-71 
BL00203 13.94 6.143e-09 28-74 
BL00203 13.94 6.327e-09 93-139 
BL00203 13.94 7.337e-09 73-119 
BL00203 13.94 8.071e-09 61-107 
BL00203 13.94 8.439e-09 91-137 
BL00203 13.94 9.082e-09 13-59 


1315 


DM00864 


EGF-LIKE DOMAIN. 


DM00864A 15.21 9.229e-09 6-27 


1315 


BL00198 


4Fe-4S ferredoxins, iron-sulfur binding 
region proteins. 


BL00198 10.43 1.600e-09 23-35 
BL00198 10.43 9.700e-09 3-15 


1316 


DM00406 


GLIADIN. 


DM00406 7.73 9.514e-10 162-175 


1316 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 8.859e-09 66-109 


1316 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 4.553e-09 24-53 
BL00269C 16.52 9.289e-09 94-123 


1316 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 3.700e-09 45-73 
PD02283C 17.54 8.763e-09 65-93 
PD02283C 17.54 9.325e-09 80-108 
PD02283C 17.54 9.550e-09 90-118 


1316 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 4.363e-ll 47-93 
BL00203 13.94 5.846e-ll 88-134 
BL00203 13.94 7.527e-ll 57-103 
BL00203 13.94 8.714e-ll 22-68 
BL00203 13.94 4.447e-10 87-133 
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BL00203 13.94 7.798e-10 92-138 
BL00203 13.94 2.010e-09 37-83 
BL00203 13.94 3.939e-09 38-84 
BL00203 13.94 5.133e~09 18-64 
BL00203 13.94 5.224e-09 17-63 
BL00203 13.94 5.592e-09 52-98 
BL00203 13.94 5.684e-09 68-114 
BL00203 13.94 7.153e-09 33-79 
BL00203 13.94 7.245e-09 78-124 
BL00203 13.94 8. 163e-09 43-89 
BL00203 13.94 8.439e-09 23-69 
BL00203 13.94 9.265e-09 42-88 
BL00203 13.94 9.633e-09 93-139 


1317 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 8.279e-10 216-259 
BL00243I 31.77 5.310e-09 219-262 


1317 


BL01208 


VWFC domain proteins. 


BL01208B 15.83 5.865e-09 244- 
259 


1317 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 7.750e-10 163- 
191 PD02283C 17.54 4.938e-09 
198-226 PD02283C 17.54 6.175e- 
09 243-271 PD02283C 17.54 
6.400e-09 62-90 PD02283C 17.54 
6.625e-09 153-181 PD02283C 
17.54 6.738e-09 173-201 
PD02283C 17.54 6.738e-09 208- 
236 PD02283C 17.54 7.975e-09 
188-216 PD02283C 17.54 9.325e- 
09 178-206 


1317 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 9.703e-ll 130-176 
BL00203 13.94 1.191e-10 104-150 
BL00203 13.94 2.149e-10 64-110 
BL00203 13.94 4.447e-10 200-246 
BL00203 13.94 5.883e-10 215-261 
BL00203 13.94 5.979e-10 210-256 
BL00203 13.94 5.979e-10 225-271 
BL00203 13.94 6.362e-10 120-166 
BL00203 13.94 6.649e-10 24-70 
BL00203 13.94 7.032e-10 34-80 
BL00203 13.94 8.181e-10 165-211 
BL00203 13.94 8.947e-10 85-131 
BL00203 13.94 9.809e-10 60-106 
BL00203 13.94 1.000e-09 170-216 
BL00203 13.94 1.367e-09 176-222 
BL00203 13.94 1.643e-09 70-116 
BL00203 13.94 2.378e-09 239-285 
BL00203 13.94 2.469e-09 125-171 
BL00203 13.94 3.755e-09 175-221 
BL00203 13.94 3.847e-09 19-65 
BL00203 13.94 3.847e-09 190-236 
BL00203 13.94 4.122e-09 201-247 
BL00203 13.94 4.490e-09 160-206 
BL00203 13.94 5.224e-09 186-232 
BL00203 13.94 5.867e-09 115-161 
BL00203 13.94 6.235e-09 80-126 
BL00203 13.94 6.418e-09 55-101 
BL00203 13.94 7.337e-09 15-61 
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BL00203 13.94 7.337e-09 69-115 
BL00203 13.94 7.520e-09 45-91 
BL00203 13.94 7.888e-09 95-141 
BL00203 13.94 8.255e-09 29-75 
BL00203 13.94 8.439e-09 206-252 
BL00203 13.94 8.531e-09 126-172 
BL00203 13.94 8.806e-09 38-84 
BL00203 13.94 8.898e-09 151-197 
BL00203 13.94 9.265e-09 216-262 
BL00203 13.94 9.449e-09 180-226 


1317 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 9.757e-09 113-127 


1317 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.425e-10 132- 
161 BL00269C 16.52 9.882e-09 
21-50 


1318 


PR00876 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 2.268e-10 107-121 


1318 


PR00874 


FUNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 95-110 


1318 


BL00427 


Disintegrins proteins. 


BL00427 13.93 2.274e-09 114-169 


1318 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L3.73 6.564e-10 1-11 
PD00866L 3.73 1.443e-09 33-43 
PD00866L3.73 2.770e-09 119-129 
PD00866L3.73 2.918e-09 12-22 
PD00866L3.73 2.918e-09 19-29 


1318 


BL01208 


VWFC domain proteins. 


BL01208B 15.83 2.946e-09 143- 
158 


1318 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 
LI. 


DM01724 8.14 7.465e-10 17-37 
DM01724 8.14 4.434e-09 19-39 
DM01724 8.14 6.684e-09 10-30 


1318 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 8.763e-09 132- 
160 


1318 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 1.724e-12 124-170 
BL00203 13.94 2.034e-12 119-165 
BL00203 13.94 4.207e-12 115-161 
BL00203 13.94 4.931e-12 93-139 
BL00203 13.94 3.275e-ll 109-155 
BL00203 13.94 3.967e-ll 120-166 
BL00203 13.94 3.298e-10 114-160 
BL00203 13.94 4.064e-10 35-81 
BL00203 13.94 4.447e-10 100-146 
BL00203 13.94 5.1 17e-l 0 98-144 
BL00203 13.94 7.989e-10 83-129 
BL00203 13.94 9.426e-10 125-171 
BL00203 13.94 2.929e-09 34-80 
BL00203 13.94 2.929e-09 94-140 
BL00203 13.94 3.663e-09 32-78 
BL00203 13.94 4.490e-09 88-134 
BL00203 13.94 4.582e-09 110-156 
BL00203 13.94 4.673e-09 101-147 
BL00203 13.94 4.857e-09 104-150 
BL00203 13.94 5.592e-09 43-89 
BL00203 13.94 6.694e-09 111-157 
BL00203 13.94 8.898e-09 55-101 
BL00203 13.94 9.082e-09 99-145 


1318 


BL00243 


Integrins beta chain cysteine-rich 


BL00243I 31.77 1.662e-10 68-111 
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domain proteins. 


BL00243I 31.77 3.912e-10 38-81 
BL00243I 31.77 5.500e-10 58-101 
BL00243I 31.77 8.941e-10 48-91 
BL002431 31.77 6.324e-09 25-68 
BL00243I 31.77 7.465e-09 118-161 
BL00243I 31.77 9.239e-09 61-104 
BL00243I 31.77 9.620e-09 98-141 


1319 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDL 


PD00066 13.92 2.800e-14 107-120 
PD00066 13.92 2.800e-14 219-232 
PD00066 13.92 4.000e-13 79-92 
PD00066 13.92 4.000e-12 51-64 
PD00066 13.92 9.143e-12 191-204 
PD00066 13.92 2.038e-10 163-176 


1319 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 3.250e-13 203-220 
BL00028 16.07 8.650e-13 35-52 
BL00028 16.07 9.550e-13 91-108 
BL00028 16.07 3.739e-12 147-164 
BL00028 16.07 4.522e-12 119-136 
BL00028 16.07 4.462e-ll 175-192 
BL00028 16.07 8.500e-10 63-80 


1319 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 4.706e-12 60-74 
PR00048B 6.02 6.000e-12 104-114 
PR00048B 6.02 1.000e-ll 216-226 
PR00048A 10.52 3.368e-ll 88-102 
PR00048B 6.02 7.231e-ll 160-170 
PR00048A 10.52 7.632e-ll 32-46 
PR00048B 6.02 1.000e-10 188-198 
PR00048A 10.52 2.565e-10 172- 
186 PR00048A 10.52 2.957e-10 
200-214 PR00048B 6.02 1.000e-09 
48-58 PR00048A 10.52 4.240e-09 
144-158 PR00048B 6.02 6.684e-09 
76-86 PR00048B 6.02 7.632e-09 
132-142 PR00048A 10.52 8.560e- 
09 116-130 


1319 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 9.122e-09 168- 
203 


1320 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449A 13.20 8.800e-15 4-26 


1320 


BL01115 


GTP-binding nuclear protein ran 
proteins. 


BL01115A 10.22 3.250e-09 4-48 


1321 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 5.055e-ll 71-117 
BL00203 13.94 8.439e-09 146-192 
BL00203 13.94 9.082e-09 53-99 


1327 


BL00799 


Granulins proteins. 


BL00799H 14.15 9.667e-09 18-59 


1329 


PF00711 


Beta defensins. 


PF00711 15.76 7.915e-ll 60-93 


1329 


PR00858 


CRUSTACEAN 

METALLOTHIONEIN SIGNATURE 


PR00858B 5.93 1.479e-09 55-74 


1329 


BL00317 


WAP-type Tour-disulfide core' domain 
proteins. 


BL00317B 14.58 2.216e-09 63-85 


1329 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 
LL 


DM01724 8.14 4.484e-12 19-39 
DM01724 8.14 4.484e-12 23-43 
DM01724 8.14 4.484e-12 27-47 
DM01724 8.14 4.484e-12 31-51 
DM01724 8.14 4.296e-10 35-55 
DM01724 8.14 4.197e-09 11-31 
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1329 



PR00872 



DM01724 8.14 5.382e-09 7-27 
DM01724 8.14 6.803e-09 15-35 



DIPTERA (DROSOPHILA) 
METALLOTHIONEIN SIGNATURE 



PR00872A 5.17 7.409e-09 99-112 



1329 



BL00203 



Vertebrate metallothioneins proteins. 



BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 
BL00203 



13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 
13.94 



7.429e- 
1.989& 
3.967e- 
8.372e- 
8.947e- 
1.184e- 
3.204e- 
3.663e- 
4.673e- 
5.592e- 
6.786e- 
7.429e- 



•13 47-93 
■11 54-100 
■11 50-96 
10 51-97 
•10 58-104 
■09 75-121 
■09 74-120 
•09 57-103 
•09 65-111 
09 70-116 
•09 55-101 
•09 40-86 



1329 



BL00264 



Neurohypophysial hormones proteins. 



BL00264 8.98 7.632e-09 94-121 



1329 



PD00866 



1329 



DM00864 



1329 



BL00243 



GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 



PD00866L 3.73 7.709e-10 74-84 
PD00866L 3.73 6.902e-09 4-14 
PD00866L 3.73 9.262e-09 1-11 



EGF-LIKE DOMAIN. 



DM00864A 15.21 9.486e-09 9-30 



Integrins beta chain cysteine-rich 
domain proteins. 



BL00243I 31.77 2.000e-ll 57-100 
BL00243I 31.77 5.897e-10 69-112 
BL002431 31.77 1.254e-09 60-103 
BL00243I 31.77 7.465e-09 8-51 
BL00243I 31.77 1.000e-08 16-59 



1330 



BL00412 



Neuromodulin (GAP-43) proteins. 



BL00412D 16.54 2.957e-10 175- 
226 



1330 



DM01206 



CORONA VIRUS NUCLEOCAPSID 
PROTEIN. 



DM01206B 10.69 3.278e-09 70-90 
DM01206B 10.69 4.418e-09 105- 
125 



1330 



PF01140 



Matrix protein (MA), pl5. 



PF01140D 15.54 5.291e-09 125- 
160 



1330 



PF00992 



Troponin. 



PF00992A 16.67 5.026e-09 183- 
218 PF00992A 16.67 8.934e-09 
191-226 



1330 



BL00048 



Protamine PI proteins. 



BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 
BL00048 



6.39 7. 
6.39 9. 
6.39 1. 
6.39 2. 
6.39 3. 
6.39 7. 
6.39 2. 
6.39 6. 
6.39 6. 
6.39 6. 
6.39 7. 
6.39 8. 
6.39 2. 
6.39 4. 
6.39 5. 
6.39 6. 
6.39 7. 
6.39 7. 
6.39 8. 
6.39 8. 



107e-16 
196e-16 
132e-12 
059e-12 
250e-12 
618e-12 
625e-ll 
500e-ll 
750e-ll 
875e-ll 
125e-ll 
625e-ll 
539e-10 
434e-10 
855e-10 
921e-10 
158e-10 
750e-10 
105e-10 
579e-10 



64- 91 
63-90 
62-89 
66-93 

65- 92 
92-119 

60- 87 
113-140 

78- 105 
104-131 
112-139 
74-101 

108- 135 

61- 88 
110-137 
98-125 

109- 136 
97-124 

79- 106 
19-46 
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1331 



1332 



1332 



1333 



1333 



1333 



1333 



Database entry 
ID 



BL00290 



BL00280 



PR00759 



PD01066 



PD00066 



BL00028 



PR00048 



Description 



Immunoglobulins and major 
histocompatibility complex proteins. 



Pancreatic trypsin inhibitor (Kunitz) 
family proteins. 



BASIC PROTEASE (KUNITZ-TYPE) 
INHIBITOR FAMILY SIGNATURE 



PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 



PROTEIN ZINC-FINGER METAL- 
BINDI. 



Zinc finger, C2H2 type, domain 
proteins. 



C2H2-TYPE ZINC FINGER 
SIGNATURE 



Results* 



BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6 
BL00048 6 
BL00048 6 



39 8.934e-10 
39 9.526e-10 
39 1.675e-09 
39 1.900e-09 
39 3.250e-09 
39 3.475e-09 
39 3.700e-09 
39 3.700e-09 
39 4.263e-09 
39 5.163e-09 
39 5.275e-09 
39 5.275e-09 
39 5.388e-09 
39 6.738e-09 
39 8.650e-09 
39 8.763e-09 
39 9.100e-09 
39 9.550e-09 
39 9.550e-09 
39 9.663e-09 
39 1.000e-08 



94-121 
103-130 

101- 128 
73-100 

81- 108 
111-138 

82- 109 
96-123 

99- 126 
107-134 
67-94 
80-107 
49-76 
116-143 
52-79 
18-45 
21-48 

76- 103 

100- 127 

102- 129 

77- 104 



BL00290B 13.17 7.750e-09 74 r 92 



BL00280 24.61 2.154e-28 83-127 



PR00759B 11.26 3.323e-10 101- 
112 PR00759C 14.15 2.125e-09 
111-127 



PD01066 19.43 4.162e-34 8-47 



PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 
PD00066 



13.92 
13.92 
13.92 
13.92 
13.92 
13.92 
13.92 



2.800e-14 
2.800e-14 
2.800e-14 
2.800e-14 
5.500e-13 
1.783e-ll 
8.043e-ll 



273-286 
301-314 
329-342 
357-370 
245-258 
217-230 
189-202 



BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 
BL00028 



16.07 
16.07 
16.07 
16.07 
16.07 
16.07 
16.07 



6.850e-13 
2.957e-12 
4.522e-12 
5.304e-12 
1.346e-ll 
2.038e-ll 
4.900e-10 



257-274 
201-218 
341-358 
285-302 
369-386 
229-246 
313-330 



PR00048A 10.52 2.286e-13 198- 
212 PR00048A 10.52 2.929e-13 
254-268 PR00048A10.52 2.929e- 
13 282-296 PR00048A 10.52 
5.500e-13 366-380 PR00048B 6.02 
6.400e-13 354-364 PR00048A 
10.52 8.941e-12 226-240 
PR00048B 6.02 1.000e-ll 326-336 
PR00048B 6.02 5.846e-ll 242-252 
PR00048A 10.52 6.211e-ll 310- 
324 PR00048A 10.52 8.579e-ll 
338-352 PR00048B6.02 9.308e-ll 
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270-280 PR00048B 6.02 7.750e-10 
298-308 PR00048B 6.02 8.3 13e-10 
214-224 


1336 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 8.920e-09 333-347 
PR00806B 4.28 9.640e-09 332-346 


1337 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 8.920e-09 333-347 
PR00806B 4.28 9.640e-0? 332-346 


1340 


BL01221 


PMP-22 / EMP / MP20 family proteins. 


BL01221B 13.29 6.745e-09 54-68 


1343 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 8.200e-12 208- 
231 


1348 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 1.947e-10 316-326 


1352 


DM00973 


3 kw RESISTANCE BENOMYL 
YLL028W CYCLOHEXIMIDE. 


DM00973 A 21.17 2.306e-13 66- 
103 


1353 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 8.200e-10 190- 
204 PR00019A 11.19 7.667e-09 
193-207 


1356 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327B 19.84 2.091e-09 144- 
166 


1356 


DM00179 


w KINASE ALPHA ADHESION T~ 
CELL. 


DM00179 13.97 7.652e-09 291-301 


1357 


PF01130 


CD36 family. 


PF01130D 26.65 1.955e-23 337- 
391 PF01130C 22.91 1.851e-18 
255-305 PF01130B 21.40 1.537e- 
15 200-246 


1359 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 7.480e-10 160- 
183 BL00290B 13.17 2.875e-09 
226-244 


1366 


PR00007 


COMPLEMENT C1Q DOMAIN 
SIGNATURE 


PR00007A 19.33 6.936e-13 274- 
301 PR00007C 15.60 9.250e-13 
349-371 PR00007B 14.16 9.372e- 
13 301-321 PR00007D9.64 
5.500e-ll 381-392 


1366 


BL01113 


Clq domain proteins. 


BL01113B 18.26 1.675e-24 280- 
316 BL01113A 17.99 4.194e-15 
200-227 BL01113D7.47 3.250e-13 
383-393 BL01113A 17.99 3.919e- 
13 191-218 BL01113C 13.18 
9.294e-12 349-369 BL01113A 
17.99 5.500e-ll 185-212 
BL01113A 17.99 6.727e- 11 182- 
209 BL01113A 17.99 8.773e-ll 
203-230 BL01113A 17.99 3.681e- 
10 188-215 BL01113A 17.99 
6.936e-10 176-203 BL01113A 
17.99 7.319e-10 194-221 
BL01113A 17.99 4.635e-09 209- 
236 BL01113A 17.99 5.500e-09 
179-206 


1366 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 3.250e-12 185- 
214 BL00420A20.42 2.623e-10 
203-232 BL00420A 20.42 2.385e- 
09 191-220 BL00420A 20.42 
6.400e-09 182-211 BL00420A 
20.42 7.646e-09 188-217 
BL00420A 20.42 8.200e-09 173- 
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202 


1366 


PR00524 


CHOLECYSTOKININ TYPE A 
RECEPTOR SIGNATURE 


PR00524F 5.36 L766e-09 200-214 
PR00524F5.36 8.564e-09 188-202 


1367 


PR00794 


PANCREATIC RIBONUCLEASE 
FAMILY SIGNATURE 


PR00794D 17.96 8.800e-17 94-117 
PR00794A 14.47 6.897e-16 46-66 
PR00794B 16.95 5.655e-14 66-86 


1367 


BL00127 


Pancreatic ribonuclease family proteins. 


BL00127B 26.57 9.357e-29 46-91 
BL00127C 31.49 9.763e-22 82-126 
BL00127A 12.84 8.200e-10 31-41 


1368 


BL00067 


3-hydroxyacyl-CoA dehydrogenase 
proteins. 


BL00067A 21.37 9.053e-10 53-84 


1368 


BL00677 


D-amino acid oxidases proteins. 


BL00677A 6.96 3.821e-09 55-68 


1368 


BL00982 


Bacterial-type phytoene dehydrogenase 
proteins. 


BL00982A 18.41 4.000e-09 57-89 


1369 


BL00067 


3-hydroxyacyl-CoA dehydrogenase 
proteins. 


BL00067A 21.37 9.053e-10 53-84 


1369 


BL00677 


D-amino acid oxidases proteins. 


BL00677A 6.96 3.821e-09 55-68 


1369 


BL00982 


Bacterial-type phytoene dehydrogenase 
proteins. 


BL00982A 18.41 4.000e-09 57-89 


1371 


PR00762 


CHLORIDE CHANNEL SIGNATURE 


PR00762C 9.29 5.200e-20 203-223 
PR00762A 14.22 8.364e-19 108- 
126 PR00762D 11.29 9.379e-19 
440-461 PR00762B 12.12 7.750e- 
16 139-159 PR00762E 12.07 
2.286e-15 475-492 PR00762F 
15.12 6.559e-15 493-513 
PR00762G 14.13 1.692e-ll 531- 
546 


1371 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 3.951e-09 9-19 


1371 


PD00120 


REPEAT PRECURSOR 
GLYCOPROTEIN EG. 


PD00120A 12.94 4.750e-09 6-19 


1372 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 4.349e-18 100- 
122 


1372 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 2.800e-13 131- 
171 


1372 


PR00642 


EDG1 ORPHAN RECEPTOR 
SIGNATURE 


PR00642D 9.70 5.065e-12 89-104 


1372 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237C 15.69 6.400e-09 145- 
168 | 


1374 


BL01022 


PTR2 family proton/oligopeptide 
symporters proteins. 


BL01022B 22.19 2.241 e- 15 74-120 
BL01022A 11.58 2.212e-12 44-63 
BL01022C 16.62 4.919e-10 160- 
184 


1374 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 2.169e-09 20-30 


1375 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 4.000e-21 132- 
148 PR00705B 10.22 2.385e-10 
276-287 


1375 


BL00139 


Eukaryotic thiol (cysteine) proteases 
cysteine proteins. 


BL00139D 9.24 1.818e-18 295-312 
BL00139A 10.29 1.000e-14 132- 
142 BL00139C9.23 2.800e-10 
275-285 


1375 


PR00704 


CALPAIN CYSTEINE PROTEASE 
(C2) FAMILY SIGNATURE 


PR00704C 11.88 6.162e-09 132- 
149 


1376 


PR00237 


RHODOPSIN-LIKE GPCR 


PR00237C 15.69 3.333e-12 111- 



WO 03/054152 



PCT/US02/39555 



Table 3A 
341 



SEQ ID 


Database entry 
ID 


Description 


Results* 






SUPERFAMILY SIGNATURE 


134 PR00237E 13.03 5.667e-ll 
193-217 PR00237F 13.57 1.474e- 
09 397-422 PR00237B 13.50 
4.750e-09 66-88 


1376 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 9.625e-19 97-137 
BL00237C 13.19 8.846e-09 392- 
419 BL00237B5.28 9.182e-09 
201-213 


1378 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649C 17.82 5.773e-10 325- 
351 


i no 

1378 


TIT* AAO ,in 

PR00249 


SECRETIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00249C 17.08 4.130e-09 327- 
351 


lion 

1380 


PD01876 


ANTIGEN MELANOMA- 
ASSOCIATED MULTIGENE 
FAMILY TUM. 


PD01876A 12.01 2.964e-26 1-42 
PD01876C 21.73 2.343e-25 99-152 
PD01876B 14.18 2.500e-12 62-96 


1382 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL- 
LI. 


DM01724 8.14 3.789e-10 291-31 1 


1382 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E 5.51 8.969e-09 263-288 


1382 


PD01849 


CHORION CLASS PRECURSOR 
EGGSHELL REPEAT MULTIGEN. 


PD01849C 11.22 9.625e-09 265- 
293 


1388 


BL00170 


Cyclophilin-type peptidyl-prolyl cis- 
trans isomerase signatur. 


BL00170B 20.97 1.000e-40 88-128 
BL00170C 18.49 1.409e-37 135- 
180 BL00170A 17.08 6.850e-22 
58-85 


1388 


PR00153 


CYCLOPHILIN PEPTIDYL-PROLYL 
CIS-TRANS ISOMERASE 
SIGNATURE 


PR00153C 11.01 2.800e-18 136- 
152 PR00153B 11.57 8.500e-17 
93-106 PR00153D 11.99 4.000e-16 
151-164 PR00153A 12.98 4.789e- 
16 64-80 PR00153E9.10 2.980e- 
10 164-180 


1390 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 6.571e-16 147- 
174 BL00237A 27.68 2.976e- 11 
13-53 BL00237B5.28 2.421e-10 
102-114 


1390 


PR00529 


GONADOTROPHIN RELEASING 
HORMONE RECEPTOR 
SIGNATURE 


PR00529D 23.59 7.380e-13 2-18 
PR00529H 10.07 1.000e-10 176- 
197 PR00529F5.98 5.138e-10 75- 
90 


1390 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237F 13.57 7.429e-15 152- 
177 PR00237E 13.03 3.250e-14 
94-118 PR00237D 8.94 7.000e-09 
58-80 PR00237G 19.63 8.1 16e-09 
192-219 


1390 


PD02448 


TRANSCRIPTION PROTEIN DNA- 
BINDIN. 


PD02448A 9.37 9.195e-09 34-73 


1392 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 3.732e-ll 196- 
246 


1393 


PR00237 


RHODOPSIN-LIKE GPCR 

CT T"DT7T> t7 A A ATT ~\f CT/^TVT A TT Tl~> T~! 


PR00237C 15.69 9.333e-12 143- 
166 


1393 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 3.200e-10 129- 
169 


1393 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245A 18.03 2.429e-12 116- 
138 PR00245B 10.38 7.525e-ll 
216-231 PR00245E 12.40 2.019e- 
10 330-345 PR00245C7.84 
7.286e-10 277-293 
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1397 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.783e-09 585- 
604 PR00014A 8.22 3.045e-09 
747-757 PR00014C 15.44 6.087e- 
09 683-702 


1397 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL007901 20.01 1.750e-12 1016- 
1047 BL00790I 20.01 6.125e-12 
694-725 BL00790I 20.01 6.679e- 
09 596-627 


1397 


PR00096 


GLUTAMINE 
AMIDOTRANSFERASE 
SUPERFAMILY SIGNATURE 


PR00096B 9.72 9.827e-09 1063- 
1075 


1401 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE 
E2 PRECURSOR PEPLOMER. 


PD00866L 3.73 8.820e-09 51-61 


1402 


PR00250 


FUNGAL PHEROMONE MATING 
FACTOR STE2 GPCR SIGNATURE 


PR00250G 9.58 9.746e-09 21-43 


1404 


PD01652 


RECEPTOR CELL NK 
GLYCOPROTEIN IMMUNOGLOB. 


PD01652A 15.35 6.625e-10 24-60 
PD01652B 8.50 1.836e-09 14-66 
PD01652B 8.50 4.021e-09 111-163 


1409 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240B 24.70 8.250e-12 268- 
292 


1409 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 3.842e-10 312-322 


1409 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870B 18.83 5.500e-10 421- 
454 PD02870D 15.74 6.266e-09 
182-217 


1409 


PR00021 


SMALL PROLINE-RICH PROTEIN 
SIGNATURE 


PR00021A 4.31 8.405e-09 496-509 


1414 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 3.455e-18 46-71 
BL00215B 10.44 8.714e-12 183- 
196 BL00215A 15.82 7.319e-ll 
143-168 


1 At A 

1414 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR l SIGNATURE 


PR00927E 14.93 6.040e-09 74-96 


1414 


BL00439 


Acyltransferases ChoActase / COT / 
CPT family proteins. 


BL00439A 9.40 8.279e-09 318-335 


1416 


PD01270 


RECEPTOR FC 
IMMUNOGLOBULIN AFFIN. 


PD01270D 24.66 9.286e-12 12-48 


1417 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 2.714e-10 42-51 


1417 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 5.114e-09 23-62 


1418 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604A 9.13 7.035e-29 41-96 
BL00604E 8.32 4.759e-24 201-243 
BL00604C 14.66 8.807e-17 134- 
166 BL00604B 9.95 5.154e-16 
104-134 BL00604D 12.28 5.100e- 
14 166-201 


1418 


PR00220 


SYNAPTOPHYSIN/SYNAPTOPORIN 
FAMILY SIGNATURE 


PR00220A 10.93 1.353e-21 38-61 
PR00220E 3.46 4.150e-18 216-235 
PR00220D 8.32 4.575e-15 149-173 
PR00220C 11.05 5.557e-14 117- 
142 PR00220B 15.48 2.703e-13 
62-88 


1421 


BL01219 


Ammonium transporters proteins. 


BL01219D 11.63 2.957e-10 217- 
241 BL01219F 15.24 8.809e-09 
289-314 


1421 


PR00342 


RHESUS BLOOD GROUP PROTEIN 


PR00342G8.18 1. 45 8e- 19 220-239 
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SIGNATURE 


PR00342B 1 1.09 8.657e-13 61-79 
PR00342D 8.46 2.857e-12 128-145 
PR00342H 7.61 6.927e-l 1 250-273 
PR00342C 10.10 4.770e- 10 90-108 
PR00342E 14.49 5.950e-10 151- 
175 PR00342F 7.02 1 .556e-09 
185-201 PR00342L7.61 9.600e-09 
352-378 


1423 


PR00794 


PANCREATIC RIBONUCLEASE 
FAMILY SIGNATURE 


PR00794C 17.32 8.071e-14 96-1 15 
PR00794B 16.95 6.455e-13 71-91 
PR00794A 14.47 9.538e-13 51-71 
PR00794D 17.96 9.368e-10 117- 
140 


1423 


BL00127 


Pancreatic ribonuclease family proteins. 


BL00127B 26.57 2.895e-23 51-96 
BL00127C 31.49 3.045e-ll 105- 
149 BL00127A 12.84 2.161e-09 
36-46 


1426 


PR00830 


ENDOPEPTIDASE LA (LON) 
SERINE PROTEASE (SI 6) 
SIGNATURE 


PR00830A 8.41 1.143e-ll 165-185 


1426 


BL00113 


Adenylate kinase proteins. 


BL00113A 12.74 4.462e-09 47 1 - 
488 


1426 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 8.905e-O9 467-489 


1427 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 9.500e-ll 74-88 
PR00019A 11.19 5.696e-10 77-91 
PR00019B 1 1.36 6.400e-10 25-39 
PR00019B 1 1.36 5.320e-09 49-63 


1427 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 8.043e-09 382- 
401 


1437 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 6.850e-15 889- 
917 PD01719A 12.89 6.727e-13 
643-671 PD01719A 12.89 1.913e- 
12 832-860 PD01719A 12.89 
4.261e-12 701-729 


1442 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 1.900e-ll 106- 
120 


1443 


BL00353 


HMG1/2 proteins. 


BL00353B 11.47 L643e-32 72-122 


1443 


PR00886 


HIGH MOBILITY GROUP 
(HMG1/HMG2) PROTEIN 
SIGNATURE 


PR00886A 10.08 7.218e-18 37-60 
PR00886B 9.88 1.750e-12 60-81 
PR00886C 1 1.84 9.0l6e-l2 104- 
123 


1443 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649C 17.82 6.226e-l 1 286- 
312 


1443 


PR00249 


SECRETIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00249C 17.08 1.783e-09 288- 
312 PR00249D 13.09 5.1 43e-09 
327-353 


1446 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2.373e-09 781-814 


1448 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 8.780e-09 402-435 


1448 


PF00761 


Polyomavirus coat protein. 


PF00761A 12.61 8.925e-09 1-26 


1449 


PR00180 


CELLULAR RETIN ALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00180B 16.42 7.146e-0° 236- 
261 


1452 


PR00926 


MITOCHONDRIAL CARRIER 
PROTEIN SIGNATURE 


PR00926F 17.75 4.706e-24 132- 
155 PR00926E 1 1 .70 7.000e-19 
89-108 PR00926D 10.53 9.308e-17 
41-60 PR00926F 17.75 9.859e-10 
35-58 
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1452 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 S.235e-10 33-58 
BL00215B 10.44 6.400e-09 79-92 
BL00215B 10.44 8.200e-09 174- 
187 


1452 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 1 SIGNATURE 


PR00927E 14.93 7.652e-22 67-89 
PR00927G 11.07 3.500e-15 178- 
194 PR00927F 11.79 6.211e-14 
128-145 PR00927D 11.02 4.000e- 
13 26-40 PR00927B 14.66 2.597e- 
12 164-186 PR00927E 14.93 
7.818e-ll 162-184 PR00927A7.98 
9.667e-09 28-41 


1456 


BL00239 


Receptor tyrosine kinase class II 
proteins. 


BL00239B 25.15 1.000e-40 1037- 
1085 BL00239E 17.14 1.000e-40 
1154-1204 BL00239F 28.15 
1.750e-31 1208-1253 BL00239C 
18.75 1.000e-24 1099-1122 
BL00239D 16.81 2.884e-15 1124- 
1150 


1456 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107B 13.31 1.900e-14 1182- 
1198 BL00107A 18.39 9.217e-14 
1112-1143 


1456 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109D 17.04 4.3 16e-23 1183- 
1206 PR00109B 12.27 4.857e-19 
1112-1131 PR00109E 14.41 
5.500e-16 1227-1250 PR00109C 
12.85 4.000e-13 1164-1175 
PR00109A 15.00 8.962e-ll 1066- 
1080 


1456 


BL00790 


Receptor tyrosine kinase class V 
proteins. 


BL00790O 7.68 4.532e-18 1161- 
1194 BL00790K9.30 5.091e-16 
1010-1064 BL00790H 13.42 
1.466e-15 830-856 BL00790M 
8.74 9.203e-12 1103-1125 
BL00790Q 15.61 5.200e-ll 1219- 
1268 BL00790P 12.33 8.057e-10 
1194-1219 


1456 


BL00240 


Receptor tyrosine kinase class III 
proteins. 


BL00240F 17.74 3.875e-26 1153- 
1201 BL00240C 22.58 4.414e-16 
978-1027 BL00240E 11.56 5.061e- 
15 1098-1136 BL00240G 28.45 
8.909e-10 1200-1253 


1456 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014D 12.04 2.350e-09 893- 
908 PR00014B 14.77 7.000e-09 
842-853 


1456 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001B 17.40 9.640e-09 1109- 
1130 


1457 


PF01307 


Plant virus coat protein. 


PF01307A 21.19 3.025e-09 321- 
351 


1462 


BL00546 


Matrixins cysteine switch. 


BL00546C 16.41 1.643e-33 232- 
264 BL00546B20.il 8.895e-29 
18-62 BL00546A 19.62 7.667e-26 
91-121 BL00546B20.il 7.720e-19 
180-224 BL00546D 10.34 l.OOOe- 
15 273-287 BL00546E 10.23 
4.529e-15 309-330 



WO 03/054152 PCT/US02/39555 



Table 3A 
345 



SEQ ID 


Database entry 
ID 


Description 


Results* 


1462 


BL00024 


Hemopexin domain proteins. 


BL00024D 17.28 7.375e-34 232- 
264 BL00024C 22.98 5.091e-32 
17-66 BL00024B 21.53 3.143e-23 
130-164 BL00024C 22.98 8.732e- 
22 179-228 BL00024E7.58 
5.500e-15 273-287 BL00024F 
11.30 1.900e-14 309-330 
BL00024A 11.49 9.100e-13 111- 
122 


1462 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 1.000e-12 238-249 


1462 


PR00138 


MATRIXIN SIGNATURE 


PR00138D 16.56 6.500e-30 238- 
264 PR00138C 16.41 5.846e-16 
18-47 PR00138A 15.14 7.136e-16 
111-125 PR00138E6.01 7.000e-15 
273-287 PR00138B 15.82 3. 824e- 
11 156-172 


1462 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 6.045e-10 233- 
252 


1475 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 9.566e-17 216-265 
DM01354Z 9.06 2.534e-15 161-210 


1478 


PR00185 


EUKARYOTIC PORIN SIGNATURE 


PR00185D 16.54 1.136e-ll 111- 
129 


1482 


PR00449 


TRANSFORMING PROTEIN P21 
RAS SIGNATURE 


PR00449E 13.50 4.971e-ll 464- 
487 


1484 


PR00879 


FISH ACETYLCHOLINESTERASE 
SIGNATURE 


PR00879A 6.28 1.000e-08 39-45 


1487 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Q 9.23 1.000e-40 283- 
338 DM0 1354T 9.48 4.882e-39 
389-432 DM01354R8.50 8.380e- 
28 338-368 DM01354P9.18 
2.957e-26 249-283 DM01354S 
11.61 1.000e-18 368-389 


1489 


PD02327 


GLYCOPROTEIN ANTIGEN 
PRECURSOR IMMUNOGLO. 


PD02327C 15.47 3.538e-09 60-75 


1491 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354B 14.71 6.426e-14 271- 
299 


1493 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 4.643e-09 348- 
364 


1494 


PF00997 


Kappa casein. 


PF00997D 9.95 8.306e-09 117-152 


1495 


DM01162 


BARLEY YELLOW DWARF VIRUS 
RNA-DIRECTED RNA 
POLYMERASE. 


DM01 162B 11.50 9.743e-09 104- 
113 


1497 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 9.757e-10 79- 
113 


1498 


PR00069 


ALDO-KETO REDUCTASE 
SIGNATURE 


PR00069D 19.36 1.659e-19 260- 
290 


1498 


BL00798 


Aldo/keto reductase family proteins. 


BL00798F 23.30 5.125e-25 314- 
363 BL00798E 20.32 7.462e-24 
256-294 BL00798A 14.97 3.000e- 
10 207-222 


1499 


BL00826 


MARCKS family proteins. 


BL00826B 12.51 2.116e-09 545- 
597 


1499 


BL00970 


Nuclear transition protein 2 proteins. 


BL00970C 14.80 5.538e-09 567- 
605 


1499 


DM00303 


6 LEA 1 1-MER REPEAT REPEAT. 


DM00303A 13.20 8.327e-09 574- 
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624 


1499 


PD01457 


RIBOSOMAL PROTEIN 40S ZINC- 
FINGER METAL. 


PD01457A 16.51 7.708e-ll 561- 
606 PD0 1457 A 16.51 1.409e-10 
565-610 PD01457A 16.51 5.745e- 
10 559-604 PD01457A 16.51 
2.397e-09 558-603 PD01457A 
16.51 3.483e-09 567-612 
PD01457A 16.51 4.103e-09 560- 
605 PD01457A 16.51 8.060e-09 
563-608 PD01457A 16.51 8.293e- 
09 557-602 PD01457A 16.51 
8.914e-09 562-607 


1499 


BL00348 


p53 tumor antigen proteins. 


BL00348F 23.19 5.571e-09 543- 
586 BL00348F 23.19 6.429e-09 
583-626 BL00348F 23.19 9.286e- 
09 560-603 


1501 


DM01354 


lew TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354X 13.86 6.230e-15 3-42 


1502 


BL01210 


Caveolins proteins. 


BL01210A 17.61 9.438e-14 132- 
171 


1503 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 9.571e-10 65-80 


1503 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 1.161e-10 53-86 
DM00215 19.43 8.393e-10 52-85 
DM00215 19.43 4.661e-09 47-80 
DM00215 19.43 5.729e-09 46-79 


1506 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 8.548e-13 66-115 


1508 


PD01941 


TRANSMEMBRANE 
COTRANSPORTER SYMP. 


PD01941E 15.92 8.714e-23 198- 
245 PD01941F 28.52 5.154e-ll 
496-551 


1510 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354N 13.17 1.000e-40 148- 
193 DM01354V 12.97 1.000e-40 
274-321 DM01354V 12.97 6.932e- 
40 218-265 DM01354O8.73 
1.882e-19 199-245 DM01354W 
12.64 2.427e- 13 265-285 
DM01354W 12.64 1.300e-ll 321- 
341 


1511 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 4.913e-19 230-269 


1511 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 4.600e-13 343-360 
BL00028 16.07 6.087e-12 371-388 


1511 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 3.250e-16 340- 
354 PR00048A 10.52 3.348e-10 
368-382 PR00048B6.02 4.938e-10 
384-394 


1511 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 3.077e-10 331-344 
PD00066 13.92 6.700e-09 359-372 


1517 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 8.875e-15 449-466 
BL00028 16.07 6.824e-14 309-326 
BL00028 16.07 7.353e-14 505-522 
BL00028 16.07 2.800e-13 1038- 
1055 BL00028 16.07 8.650e-13 
337-354 BL00028 16.07 1.783e-12 
926-943 BL00028 16.07 4.913e-12 
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982-999 BL00028 16.07 5.304e-12 
898-915 BL00028 16.07 5.304e-12 
1094-1111 BL00028 16.07 8.435e- 
12 421-438 BL00028 16.07 
3.077e-ll 393-410 BL00028 16.07 
3.423e-ll 954-971 BL00028 16.07 
3.769e-ll 365-382 BL00028 16.07 
4.462e-ll 870-887 BL00028 16.07 
5.154e-ll 281-298 BL00028 16.07 
4.000e-10 477-494 BL00028 16.07 
7.000e-10 1010-1027 BL00028 
16.07 7.000e-10 1066-1083 


1517 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


PD01066 19.43 9.400e-30 76-115 
PD01066 19.43 7.525e-15 675-714 
PD01066 19.43 1.943e-09 716-755 


1517 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048A 10.52 5.500e-16 446- 
460 PR00048A 10.52 9.182e-15 
306-320 PR00048A10.52 9.182e- 
15 1035-1049 PR00048A 10.52 
5.500e-14 502-516 PR00048A 
10.52 7.750e-14 923-937 
PR00048A 10.52 1.643e-13 895- 
909 PR00048A 10.52 4.214e-13 
1091-1105 PR00048A 10.52 
4.857e-13 334-348 PR00048A 
10.52 4.706e-12 362-376 
PR00048B 6.02 5.000e-12 434-444 
PR00048A 10.52 6.294e-12 474- 
488 PR00048A 10.52 9.471e-12 
951-965 PR00048A 10.52 3.842e- 
11 979-993 PR00048A 10.52 
4.789e-ll 418-432 PR00048B 6.02 
9.308e-ll 1023-1033 PR00048B 
6.02 1.563e-10 294-304 PR00048B 
6.02 1.563e-10 883-893 PR00048B 
6.02 2.125e-10 911-921 PR00048B 
6.02 2.688e-10 350-360 PR00048B 
6.02 8.3 13e-10 939-949 PR00048B 
6.02 9.438e-10 490-500 PR00048B 
6.02 1.000e-09 995-1005 
PR00048B 6.02 3.368e-09 462-472 
PR00048A 10.52 3.880e-09 390- 
404 PR00048B 6.02 4.316e-09 
1079-1089 PR00048A 10.52 
4.600e-09 867-881 


1517 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 2.800e-14 325-338 
PD00066 13.92 4.600e-14 437-450 
PD00066 13.92 5.200e-14 998- 
1011 PD00066 13.92 6.400e-14 
493-506 PD00066 13.92 8.800e-14 
942-955 PD00066 13.92 5.500e-13 
1026-1039 PD00066 13.92 6.000e- 
13 970-983 PD00066 13.92 
6.500e-13 297-310 PD00066 13.92 
6.500e-13 886-899 PD00066 13.92 
7.000e-13 465-478 PD00066 13.92 



WO 03/054152 



PCT/US02/39555 



Table 3A 
348 



SEQ ID 


Database entry 
ID 


Description 


Results* 








7.000e-13 914-927 PD00066 13.92 
9.000e-13 353-366 PD00066 13.92 
9.500e-13 1082-1095 PD00066 
13.92 4. 808e-10 409-422 PD00066 
13.92 4.000e-09 381-394 PD00066 
13.92 5.800e-09 1054-106/ 


1517 


BL00479 


Phorbol esters / diacyl glycerol binding 
domain proteins. 


BL00479A 19.86 7.319e-09 1025- 
1048 


1518 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 7.158e-13 42-65 
BL00290B 13.17 4.500e-12 98-116 


1519 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354K 9.35 7.706e-28 71-103 
DM01354L 11.73 8.250e-14 103- 
130 


1521 


BL00250 


TGF-beta family proteins. 


BL00250A 21.24 2.500e-31 640- 
676 BL00250B 27.37 2.000e-27 
705-741 


1521 


BL01273 


CoA transferases proteins. 


BL01273C 12.54 1.000e-40 130- 
170 BL01273D 19.11 9.471e-28 
206-250 BL01273B 14.85 9.830e- 
20 81-115 BL01273A 12.56 
2.286e-16 62-75 


1521 


PR00669 


INHIBIN ALPHA CHAIN 
SIGNATURE 


PR00669E 16.28 7.078e-12 640- 
658 


1523 


BL00427 


Disintegrins proteins. 


BL00427 13.93 4.100e-15 455-510 


1523 


PR00289 


DISINTEGRIN SIGNATURE 


PR00289A 13.62 3.700e-18 469- 
489 PR00289B 11.79 3.143e-12 
498-511 


1523 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR0001 ID 14.03 9.870e-09 675- 
694 


1524 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 8.683e-12 186- 
197 


1524 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 3.100e-09 183- 
202 


1525 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 7.000e-17 35-62 


1525 


BL00237 


G-protein coupled receptors proteins. 


BL00237D 11.23 3.500e-14 45-62 


1525 


PR00424 


ADENOSINE RECEPTOR 
SIGNATURE 


PR00424F 8.50 2.688e-12 35-46 


1525 


PR00554 


ADENOSINE A2B RECEPTOR 
SIGNATURE 


PR00554F 8.86 1.750e-16 24-37 
PR00554G 14.18 7.288e-12 69-89 


1528 


BL00018 


EF-hand calcium-binding domain 
proteins. 


BL00018 7.41 9.217e-09 338-351 


1530 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354N 13.17 1.000e-40 36-81 
DM01354O 8.73 1.000e-40 87-133 
DM01354Q9.23 1.000e-40 167- 
222 DM0 1354T 9.48 1.000e-40 
273-316 DM01354V 12.97 l.OOOe- 
40 382-429 DM01354Y 10.69 

•t Ann a r\ ac\0 COO T"\TV/fr\ 1 1< A "V 

1.000e-40 49o-j3o JJMU 1 j j4A. 
13.86 5.219e-39 449-488 
DM01354P 9.18 2.723e-37 133-167 
DM01354R 8.50 7.395e-32 222- 
252 DM01354W 12.64 3.000e-25 
429-449 DM01354S 11.61 9.550e- 
23 252-273 DM01354U 12.24 
| 2.350e-20 362-382 
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1530 


DM00895 


7 kw REVERSE TRANSCRIPTASE 
RNA POLYMERASE. 


DM00895E 15.72 3.889e~ll 149- 
174 


1532 


PR00036 


LACI BACTERIAL REGULATORY 
PROTEIN HTH SIGNATURE 


PR00036B 10.57 1.000e-08 282- 
293 


1533 


BL01160 


Kinesin light chain repeat proteins. 


BL01160D 10.17 7.077e-09 218- 
247 j 


1533 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA. 


PD00126A 22.53 L000e-08 485- 
506 


1535 


PR00671 


INHIBIN BETA B CHAIN 
SIGNATURE 


PR00671C 4.18 9.690e-09 44-64 


1538 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 9.217e-14 89-120 
BL00107B 13.31 6.786e-ll 159- 
175 


1538 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 6.766e-09 89-108 


1539 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 7.000e-09 131- 
147 


1540 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 1.786e-10 813-828 


1541 


DM01418 


352 FIBRILLAR COLLAGEN 
CARBOXYL-TERMINAL. 


DM01418A 20.83 1.000e-40 631- 
679 DM01418C 20.48 1.000e-40 
800-842 DM0 141 8B 22.51 4.103e- 
38 729-771 


1541 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420A 20.42 2.180e-10 549- 
578 


1541 


BL01113 


Clq domain proteins. 


BL01113A 17.99 1.957e-10 555- 
582 BL01113A17.99 2.149e-10 
549-576 BL01113A 17.99 7.3 19e- 
10 552-579 


1542 


BL00279 


Membrane attack complex components 
/ perforin proteins. 


BL00279E 37.11 9.053e-12 901- 
949 


1542 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 8.274e-ll 1016- 
1033 


1542 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 3.836e-10 918- 
939 


1542 


DM00864 


EGF-LIKE DOMAIN. 


DM00864B 11.34 6.910e-10 1025- 
1044 


1542 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 4.150e-14 1020- 
1036 BL01187B 12.04 2.575e-09 
927-943 


1542 


BL01177 


Anaphylatoxin domain proteins. 


BL01177E 20.64 7.632e-09 1018- 
1045 


1542 


BL00799 


Granulins proteins. 


BL00799B 11.02 9.679e-09 1011- 
1047 


1542 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010A 11.79 5.154e-ll 912- 
924 PR00010C 11.16 7.545e-10 
1025-1036 PR00010C 11.16 
9.786e-09 932-943 


1543 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM0003 IB 15.41 7.038e-09 79- 
113 


1544 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 6.318e-19 478- 
496 BL00972D 22.55 7.968e-16 
762-787 BL00972B9.45 1.600e-12 
559-569 


1544 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 8.008e-13 235-250 
PR00049D 0.00 7.375e-12 239-254 
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PR00049D 0.00 5.916e-ll 242-257 
PR00049D 0.00 6.748e-ll 236-251 
PR00049D 0.00 9.395e-l 1 240-255 
PR00049D 0.00 1.286e-10 233-248 
PR00049D 0.00 8.929e-10 241-256 
PR00049D 0.00 2.678e-09 243-258 
PR00049D 0.00 4.051e-09 237-252 
PR00049D 0.00 4.051e-09 238-253 
PR00049D 0.00 4.051e-09 244-259 


1544 


PR00211 


GLUTELIN SIGNATURE 


PR00211B 0.86 7.500e-09 238-259 


1544 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 5.339e-10 222-255 
DM00215 19.43 7.268e-10 226-259 
DM00215 19.43 2.525e-09 220-253 
DM00215 19.43 9.695e-09 221-254 


1544 


BL00048 


Protamine PI proteins. 


BL00048 6.39 9.888e-09 259-286 


1546 


PR00416 


EUKARYOTIC DNA 
TOPOISOMERASE I SIGNATURE 


PR00416B 15.58 5.000e-09 389- 
409 


1546 


BL00048 


Protamine PI proteins. 


BL00048 6.39 8.200e-09 510-537 


1546 


PR00138 


MATRIXIN SIGNATURE 


PR00138E 6.01 8.429e-09 146-160 


1546 


BL00024 


Hemopexin domain proteins. 


BL00024E 7.58 8.694e-09 146-160 


1546 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 4.857e-12 317- 
331 PR00019A 11.19 8.333e-09 
133-147 PR00019B 11.36 8.920e- 
09 269-283 


1547 


DM01595 


kw ALLANTOICASE SPAC1F7.09C. 


DM01595B 10.54 2.625e-15 143- 
171 DM01595I 8.91 2.714e-15 
457-475 DM01595D 10.94 3.274e- 
11 456-481 


1549 


PR00192 


F-ACTIN CAPPING PROTEIN BETA 
SUBUNIT SIGNATURE 


PR00192A8.23 1 .474e-27 47-68 


1549 


BL00231 


F-actin capping protein beta subunit 
proteins. 


BL00231A 8.59 1.829e-24 47-93 


1551 


BL00708 


Prolyl endopeptidase family serine 
proteins. 


BL00708B 24.91 7.197e-12 439- 
470 


1551 


PF00930 


Dipeptidyl peptidase IV (DPP IV) N- 
terminal region. 


PF00930I 15.96 6.373e-17 481-509 
PF00930H 20.16 2.482e-13 402- 
445 PF00930J8.78 1.000e-ll 533- 
554 PF00930G 21.30 9.613e-09 
362-400 


1552 


BL00878 


Orn/DAP/Arg decarboxylases family 2 
pyridoxal-P attachment si. 


BL00878F 19.67 7.23 le-20 171- 
194 BL00878C 17.74 3.676e-ll 
38-58 


1552 


PR00163 


RUBREDOXIN SIGNATURE 


PR00163B 14.45 8.200e-09 143- 
160 


1555 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354V 12.97 1.000e-40 705- 
752 DM01354U 12.24 6.700e-22 
685-705 DM01354T9.48 8.355e- 
15 596-639 DM01354W 12.64 
9.122e-13 752-772 


1557 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354N 13.17 1.000e-40 158- 
203 DM01354O8.73 1.000e-40 
209-255 DM01354Q9.23 l.OOOe- 
40 289-344 DM01354T 9.48 
1.000e-40 398-441 DM01354V 
12.97 1.000e-40 507-554 
DM01354P 9.18 5.213e-37 255-289 
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DM01354R 8.50 6.760e-28 344- 
374 DM01354L 11.73 1.200e-26 
101-128 DM01354S 11.61 7.300e- 
26 377-398 DM01354M 12.50 
6.094e-24 128-158 DM01354U 
12.24 6.700e-22 487-507 
DM01354K9.35 1.228e-17 69-101 
DM01354W 12.64 1.000e-13 554- 
574 


1557 


DM00895 


7 kw REVERSE TRANSCRIPTASE 
RNA POLYMERASE. 


DM00895E 15.72 7.828e-10 271- 
296 


1558 


PR00834 


HTRA/DEGQ PROTEASE FAMILY 
SIGNATURE I 


PR00834F 10.91 2.946e-09 194- 
207 


1558 


BL00741 


Guanine-nucleotide dissociation 
stimulators CDC24 family sign. 


BL00741B 14.27 2.962e-09 1104- 
1127 


1558 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 4.814e-09 1318- 
1333 PR00049D0.00 5.729e-09 
259-274 


1558 


PR00554 


ADENOSINE A2B RECEPTOR 
SIGNATURE 


PR00554B 12.52 8.855e-09 917- 
926 


1560 


PF01140 


Matrix protein (MA), pl5. 


PF01140D 15.54 3.700e-10 977- 
1012 


1565 


BL01226 


Hydroxymethylglutaryl-coenzyme A 
synthase proteins. 


BL01226A 13.79 1.000e-40 50-89 
BL01226C 13.51 1.000e-40 127- 
167 BL01226H 17.74 1.000e-40 
478-526 BL01226G 15.76 3.483e- 
32 384-413 BL01226B 13.35 
1.818e-31 95-127 BL01226F9.78 
8.714e-23 345-363 BL01226E 
13.74 7.716e-22 304-345 
BL01226D 11.60 5.680e-18 226- 
262 BL01226D 11.60 6.940e-10 
174-210 


1568 


BL00189 


2-oxo acid dehydrogenases 
acyltransferase component lipoyl bi. 


BL00189A 19.58 4.414e-13 103- 
138 


1574 


PR00049 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 4.661e-09 254-269 


1574 


PR00456 


RIBOSOMAL PROTEIN P2 
SIGNATURE 


PR00456E 3.06 6.241e-09 184-199 


1577 


PR00756 


MEMBRANE ALANYL 
DIPEPTIDASE (Ml) FAMILY 
SIGNATURE 


PR00756B 14.06 2.364e-13 232- 
248 PR00756C 11.60 7.300e-12 
310-321 


1579 


BL00048 


Protamine PI proteins. 


BL00048 6.39 3.605e-10 636-663 
BL00048 6.39 5.500e-10 630-657 
BL00048 6.39 1.225e-09 628-655 
BL00048 6.39 4.263e-09 648-675 
BL00048 6.39 5.388e-09 627-654 
BL00048 6.39 8.988e-09 626-653 


1580 


BL00284 


Serpins proteins. 


234 BL00284A 15.64 1.000e-18 
63-87 BL00284D 16.34 1.500e-14 
407-434 BL00284B 17.99 1.273e- 
12 162-183 


1581 


BL01052 


Calponin family repeat proteins. 


BL01052B 15.31 3.308e-ll 110- 
136 


1581 


PR00888 


SMOOTH MUSCLE 


PR00888C 12.27 2.141e-09 110- 
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PROTEIN/CALPONIN FAMILY 
SIGNATURE 


126 


1582 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 6.500e-ll 140- 
154 PR00019B 11.36 7.750e-10 
92-106 PR00019A 11.19 5.000e-09 
95-109 


1582 


PR00423 


CELL DIVISION PROTEIN FTSZ 
SIGNATURE 


PR00423E 7.36 9.550e-09 505-527 


1587 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 2.500e-10 277-294 


1587 


BL00518 


Zinc finger, C3HC4 type (RING 
finger), proteins. 


BL00518 12.23 8.286e-10 186-195 


1588 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354R 8.50 7.750e-29 192- 
222 DM01354S 11.61 7.300e-26 
222-243 


1589 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354U 12.24 5.891e-19 62-82 
DM01354V 12.97 1.175e-09 82- 
129 


1590 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 5.065e-13 167-216 


1591 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2, 


DM01354R8.50 1.840e-18 141- 
171 


1591 


BL00300 


SRP54-type proteins GTP-binding 
domain proteins. 


BL00300C 25.57 7.100e-09 648- 
702 


1593 


BL00573 


Pyridine nucleotide- disulphide 
oxidoreductases class-II activ. 


BL00573A 9.65 7.273e-10 750-768 


1593 


PR00411 


PYRIDINE NUCLEOTIDE 
DISULPHIDE REDUCTASE CLASS-I 
SIGNATURE 


PR00411A 15.95 4.569e-09 747- 
770 


1593 


PR00350 


VITAMIN D RECEPTOR 
SIGNATURE 


PR00350E 11.55 5.546e-09 13-33 


1593 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE 
SIGNATURE 


PR00368A 17.76 7.589e-09 747- 
770 


1593 


BL01113 


Clq domain proteins. 


BL01113A 17.99 9.827e-09 484- 
511 


1593 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 8.892e-10 373-383 
PR00308C3.83 1.701e-09 375-385 
PR00308A 5.90 4.282e-09 752-767 
PR00308A 5.90 6.824e-09 372-387 
PR00308C 3.83 1.000e-08 324-334 


1594 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354X 13.86 6.595e-36 13-52 
DM01354Y 10.69 3.455e-09 62- 
102 


1595 


PR00756 


MEMBRANE ALANYL 
DIPEPTIDASE (Ml) FAMILY 
SIGNATURE 


PR00756D 10.58 7.000e-19 187- 
203 PR00756E 11.91 3.700e-15 
206-219 PR00756C 11.60 4.000e- 
12 151-162 


1595 


BL00142 


Neutral zinc metallopeptidases, zinc- 
binding region proteins. 


BL00142 8.38 2.286e-09 187-198 


1596 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354I 15.55 1.000e-40 119- 
160 DM01354V 12.97 1.209e-14 
401-448 DM01354H 18.00 5.280e- 
10 79-119 


1600 


BL00484 


Thyroglobulin type-1 repeat proteins 
proteins. 


BL00484C 17.01 3.854e-ll 96-111 
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1601 


PR00342 


RHESUS BLOOD GROUP PROTEIN 
SIGNATURE 


PR00342A9.71 1.000e-23 12-31 


1608 


PD01941 


TRANSMEMBRANE 
COTRANSPORTER SYMP. 


PD01941E 15.92 1.000e-40 135- 
182 PD01941F 28.52 9.229e-17 
254-309 


1609 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354W 12.64 9.375e-16 123- 
143 


1614 


BL00420 


Speract receptor repeat proteins domain 
proteins. 


BL00420B 22.67 5.808e-26 49-104 
BL00420C 11.90 2.500e-10 134- 
145 


1614 


PR00258 


SPERACT RECEPTOR SIGNATURE 


PR00258B 9.63 7.947e-l 1 64-76 
PR00258C 9.05 3.204e-10 79-90 
PR00258E 13.33 3.333e-10 133- 
146 PR00258D 14.41 3.286e-09 
110-125 


1616 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 1.505e-09 377-418 


1616 


PR00261 


LOW DENSITY LIPOPROTEIN 
(LDL) RECEPTOR SIGNATURE 


PR00261D 12.47 3.338e-10 43-65 
PR00261E 11.08 4.575e-10 84-106 
PR00261C 11.37 5.050e-10 43-65 
PR00261F 11.57 5.920e-10 43-65 
PR00261A 11.02 6.769e-10 43-65 
PR00261A 11.02 9.192e-10 84-106 
PR00261D 12.47 2.667e-09 84-106 
PR00261F 11.57 4.304e-09 84-106 
PR00261C 11.37 5.500e-09 84-106 
PR00261B 14.12 5.714e-09 84-106 
PR00261B 14.12 6.464e-09 43-65 


1616 


BL01209 


LDL-receptor class A (LDLRA) 
domain proteins. 


BL01209 9.31 6,464e-09 93-106 


1616 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 5.613e-13 306- 
357 BL00412D 16.54 8.821e-12 

301- 352 BL00412D 16.54 1.098e- 
10 307-358 BL00412D 16.54 
9.609e-10 304-355 BL00412D 
16.54 1.184e-09 308-359 
BL00412D 16.54 1.918e-09 309- 
360 BL00412D 16.54 2.745e-09 

302- 353 BL00412D 16.54 5.684e- 
09 299-350 BL00412D 16.54 
8.347e-09 303-354 


1616 


PF01140 


Matrix protein (MA), pl5. 


PF01140D 15.54 8.640e-09 319- 
354 


1616 


BL00422 


Granins proteins. 


BL00422C 16.18 8.085e-10 326- 
354 BL00422C 16.18 8.468e-10 
330-358 BL00422C 16.18 8.851e- 
10 323-351 BL00422C 16.18 
4.529e-09 325-353 BL00422C 
16.18 9.471e-09 318-346 


1618 


PR00681 


RIBOSOMAL PROTEIN SI 
SIGNATURE 


PR00681I 8.81 9.897e-09 412-431 


1619 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134A 11.96 3.813e-15 55-72 


1619 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 8.448e-14 56-72 


1619 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 1.763e-13 55-73 
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1619 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 1 1.20 4.945e-09 55-73 


1620 


BL00125 


Serine/threonine specific protein 
phosphatases proteins. 


BL00125C 19.97 1.000e-40 86-133 
BL00125D 33.11 9.679e-38 154- 
209 


1620 


PR00114 


SERINE/THREONINE 
PHOSPHATASE FAMILY 
SIGNATURE 


PR00114E 17.75 2.500e-27 106- 
134 PR00114D 12.34 9.591e-26 
77-104 PR00114F 17.51 3.400e-17 
170-191 PR00114G 17.20 3.676e- 
17 198-215 


1625 


BL01103 


Aspartate-semialdehyde dehydrogenase 
proteins. 


BL01103F 16.78 3.625e-24 256- 
282 BL01103C 15.07 6.727e-20 
91-113 BL01103B 20.29 5.846e-16 
51-73 BL01103E 11.30 4.115e-14 
238-251 BL01103D7.29 1.000e-09 
155-166 


1626 


PD02410 


SYNTHETASE BETA CHAIN 
LIGASE PHENYLALANYL-TRNA. 


PD02410E 10.29 2.241e-13 83-97 


1628 


BL00044 


Bacterial regulatory proteins, lysR 
family proteins. 


BL00044 19.03 7.188e-ll 21-55 


1631 


BL00370 


PEP-utilizing enzymes phosphorylation 
site proteins proteins. 


BL00370C 11.73 9.182e-17 96-110 


1632 


PR00101 


ASPARTATE 

CARBAMOYLTRANSFERASE 
SIGNATURE 


PR00101C 12.35 9.328e-10 410- 
428 


1632 


BL00097 


Aspartate and ornithine 
carbamoyltransferases proteins. 


BL00097C 16.87 1.621e-09 411- 
425 


1634 


BL00530 


Ribonuclease T2 family histidine 
proteins 1. 


BL00530C 11.43 1.000e-17 101- 
113 


1643 


BL00050 


Ribosomal protein L23 proteins. 


BL00050A 23.71 3.659e-18 217- 
250 


1645 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 5.571e-14 479- 
507 PD01719A 12.89 2.000e-09 
666-694 PD01719A 12.89 4.889e- 
09 606-634 PD01719A 12.89 
8. llle-09 419-447 


1647 


BL00564 


Argininosuccinate synthase proteins. 


BL00564D 22.43 2.723e-22 157- 
194 BL00564A 19.93 6.582e-ll 
37-74 


1651 


PR00453 


VON WILLEBRAND FACTOR TYPE 
A DOMAIN SIGNATURE 


PR00453A 12.79 9.719e-13 88-106 
PR00453B 14.65 1.818e-12 125- 
140 PR00453C 12,26 3.769e-10 
246-255 


1652 


DM00031 


IMMUNOGLOBULIN V REGION. 


DM00031B 15.41 9.471e-27 84- 
118 DM00031A 16.80 7.055e-25 
206-254 DM00031A 16.80 5.368e- 
18 20-68 DM00031B 15.41 
8.519e-09 267-301 


1653 


PR00173 


GLUTAMATE-ASPARTATE 
SYMPORTER SIGNATURE 


PR00173F 10.44 8.967e-09 8-28 


1654 


BL01032 


Protein phosphatase 2C proteins. 


BL01032C 6.14 1.000e-10 118-128 


1657 


BL00523 


Sulfatases proteins. 


BL00523A 13.36 2.200e-13 47-64 


1658 


BL00194 


Thioredoxin family proteins. 


BL00194 12.16 4.857e-16 482-495 
BL00194 12.16 3.813e-15 347-360 


1658 


PR00421 


THIOREDOXIN FAMILY 
SIGNATURE 


PR00421B 11.40 3.348e-12 489- 
499 PR00421B 11.40 9.379e-ll 
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354-364 PRUU421A 4.uuue- 
10 346-355 PR00421A 10.15 

a on„ in /loi AC\r\ DDAn/IOir 1 

4.857e-l(J 4ol-4yU rKUlKfZli^ 
13.60 5.714e-10 533-545 


1662 


PF00850 


Histone deacetylase family. 


ripnn ry C f\T~! O OO 1 '"IfZfZn 1 1 1 O/CQ 

PF00850E o.oo l./ooe-zi izoy- 
1295 PF00850F 15.70 1.889e-10 
1307-1340 PF00850G 22.75 
3.724e-10 1346-1388 


1666 


BL00317 


WAP-type 'four-disulfide core* domain 
proteins. 


BL00317B 14.58 7.231e-22 92-114 


1666 


PR00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C 7.69 6.885e-ll 98-108 
PR00003D 8.10 4.971e-09 108-117 


1666 


BL00313 


Seminal vesicle protein I repeat 
proteins proteins. 


BL00313B 8.87 1.000e-40 31-68 
BL00313C 11.19 1.000e-18 103- 
118 BL00313B 8.87 5.337e-17 25- 
62 BL00313B 8.87 7.593e-lo 19- 

r- s~ T~» T /\/\0 1 in O C7 1 C\ 1 A/=* 1 C A'X 

56 BL00313B 8.8/ l.yiUe-l~> 4J- 
80 BL00313B 8.87 9.413e-14 37- 
74 BL00313B 8.87 4.522e-09 49- 

Oic dt aaoi i a i a &(\ ^nnp no 1 _o 
oo j3L0U3ijA IU.ou D.DUue-uy i-y 

BL00313C 11.19 9.060e-09 30-45 


1667 


PD01066 


PROTEIN ZINC FINGER ZINC- 
FINGER METAL-BINDING NU. 


nr^Ai A£/£ 1 O A'X ^ 1Q*\<=» ^1 1 AT] 


1667 


BL01277 


Ribonuclease PH proteins. 


DLu IZ 1 / J\ I / .J7 O.O 1 OC-U7 oh— i \J-o 


1672 


DM00372 


CARCINOEMBRYONIC ANTIGEN 
PRECURSOR AMINO-TERMINAL 
DOMAIN. 


DM00372B 20.31 8.920e-15 363- 
68-113 


1673 


PF00992 


Troponin. 


PrOU992A lo.o/ o.ooye-iu i /o- 

O 1 1 

211 


1673 


BL00093 


N-4 cytosine-specific DNA methylases 
proteins. 


BL00093A 9.77 4.329e-09 718-733 


1673 


PR00948 


ELICITIN SIGNATURE 


PR00948B 15.44 5.361e-08 707- 

732 


1673 


PR00558 


ALPHA-2A ADRENERGIC 
RECEPTOR SIGNATURE 


DDnn^^sr £ ^4 6 71Qf»-0R 4.9Q-44Q 


1676 


BL00962 


Ribosomal protein S2 proteins. 




1678 


PD00015 


GLYCOPROTEIN PRECURSOR 
CELL SI. 


PD00015A 8.90 6.400e-09 35-43 


1678 


PR00795 


RYANODINE RECEPTOR 
SIGNATURE 


PR00795C6.30 8.252e-09 107-132 


1683 


BL00262 


Insulin family proteins. 


dt AAo/rn a 10/101 QOO<a AO AS 

BL00262A lz.4o i.ozye-uy ju-^o 


1684 


BL00061 


Short-chain dehydrogenases/reductases 
family proteins. 


BL00061B 25.79 8.105e-19 133- 
171 


1684 


PR00080 


ALCOHOL DEHYDROGENASE 
SUPERFAMILY SIGNATURE 


PR00080C 17.16 2.125e-10 153- 
173 


1684 


PR00081 


GLUCOSE/RIBITOL 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR00081A 10.53 7.805e-ll 5-23 
PR00081C 15.13 1.771e-09 127- 
144 


1685 


BL0U983 


t _ . /t / "DAT? A nmnin rvrntpin<5 
J_,y-0 / U-x/t-XS- UUIIid.ni piuuciiio. 


BL00983C 12.69 6.885e-09 197- 
213 


1686 


BL00107 


Protein kinases ATP-binding region 
proteins. 


BL00107A 18.39 7.158e-18 678- 
709 


1686 


PR00653 


ACTIVIN TYPE II RECEPTOR 
SIGNATURE 


PR00653E 15.68 5.114e-16 871- 
891 PR00653D 13.25 7.195e-12 
697-719 


1686 


BL00790 


Receptor tyrosine kinase class V 


BL00790N 13.25 7.505e-09 691- 
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proteins. 


718 


1686 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 8.031e-09 678- 
697 


1687 


BL00649 


G-protein coupled receptors family 2 
proteins. 


BL00649A 24.05 2.333e-22 85-113 


1687 


PR00491 


VASOACTIVE INTESTINAL 
PEPTIDE RECEPTOR SIGNATURE 


PR00491A 11.52 9.509e-10 109- 
121 


1688 


BL50002 


Src homology 3 (SH3) domain proteins 
profile. 


BL50002A 14.19 2.500e-18 154- 
173 


1688 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401A 14.00 5.765e-15 214- 
229 


1688 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001A 10.81 6.250e-ll 154- 
167 


1688 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 11.65 5.765e-14 164- 
180 PR00452A 10.48 8.500e-12 
150-161 PR00452C 10.78 l.OOOe- 
10 181-191 


1688 


PR00499 


NEUTROPHIL CYTOSOL FACTOR 2 
SIGNATURE 


PR00499D 10.18 7.279e-09 152- 
173 


1693 


DM01551 


kw OSTEOINDUCTIVE YOPM 
MEMBRANE OUTER. 


DM01551A 15.63 7.265e-09 559- 
589 


1693 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 2.957e-10 614- 
628 PR00019B 11.36 4.600e-10 
527-541 PR00019B 11.36 L720e- 
09 611-625 PR00019A 11.19 
3.000e-09 660-674 PR00019A 
11.19 7.667e-09 530-544 


1698 


PR00528 


GLUCOCORTICOID RECEPTOR 
SIGNATURE 


PR00528C 14.75 9.438e-09 210- 
231 


1705 


BL00733 


Ribosomal protein S26e proteins. 


BL00733B 12.04 4.66 le-25 165- 
198 


1706 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 6.500e-ll 412- 
425 


1706 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019B 11.36 7.000e-ll 773- 
787 PR00019A 11.19 7.000e-09 
776-790 


1706 


PF00791 


Domain present in ZO-1 and Unc5-like 
netrin receptors. 


PF00791B 28.49 6.242e-13 386- 
441 PF00791C 20.98 7.088e-09 
436-475 PF0079 IB 28.49 7.936e- 
09 455-510 


1706 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 1.000e-ll 419- 
435 PF00023B 14.20 9.591e-09 
415-425 


1708 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 3.000e-13 251-264 
PD00066 13.92 3.571e-12 223-236 


1708 


PR00498 


NEUTROPHIL CYTOSOL FACTOR 1 
SIGNATURE 


PR00498A 12.01 5.765e-09 353- 
362 


1708 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048B 6.02 1.563e-10 220-230 
PR00048A 10.52 2.5o5e-lU 151- 
246 PR00048A 10.52 5.320e-09 
50-64 PR00048A 10.52 8.200e-09 
204-218 


1708 


BL00028 


Zinc finger, C2H2 type, domain 
proteins. 


BL00028 16.07 5.154e-ll 235-252 
BL00028 16.07 3.400e-10 207-224 
BL00028 16.07 9.743e-09 53-70 


1711 


PF00550 


Phosphopantetheine attachment site 


PF00550C 13.05 7.188e-10 235- 



WO 03/054152 



PCT/US02/39555 



Table 3A 
357 



SEQ ID 


Database entry 
ID 


Description 


Results* 






proteins. 


252 


1712 


BL00484 


Thyroglobulin type-1 repeat proteins 
proteins. 


BL00484B 9.04 4.400e-12 289-303 


1712 


BL00222 


Insulin-like growth factor binding 
proteins. 


BL00222B 11.09 7.300e-17 65-81 
BL00222A 11.34 9.438e-ll 45-56 


1715 


PF00075 


RNase H. 


PF00075C 11.58 6.786e-13 60-72 


1717 


BL00477 


Alpha-2-macroglobulin family 
thiolester region proteins. 


BL00477G 19.43 3.739e-34 984- 
1016 BL00477A 13.50 6.000e-32 
174-203 BL00477J 19.04 9.625e- 
31 1301-1332 BL00477F 17.34 
1.931e-30 840-870 BL00477L 
23.51 1.964e-30 1498-1531 
BL00477K 17.42 6.400e-24 1443- 
1467 BL00477E 17.53 7.000e-22 
810-831 BL004771 18.76 6.750e- 
20 1088-1115 BL00477C 15.70 
6.667e-17 291-308 BL00477B 9.05 
7.207e-15 264-277 BL00477H 9.07 
1.333e-14 1057-1069 BL00477D 
12.73 2.059e-14 784-794 


1717 


BL01177 


Anaphylatoxin domain proteins. 


BL01177E 20.64 9.438e-12 845- 
872 


1718 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 8.594e-35 151- 
199 BL00232B 32.79 5.579e-22 
417-465 BL00232A 27.72 l.OOOe- 
20 57-90 BL00232C 10.65 3.613e- 
14 415-433 BL00232B 32.79 
4.872e-ll 534-582 


1718 


PR00205 


CADHERIN SIGNATURE 


PR00205B 11.39 4.545e-15 415- 
433 PR00205A 14.73 5.600e-09 
183-199 


1718 


BL00422 


Granins proteins. 


BL00422C 16.18 3.647e-09 226- 
254 BL00422C 16.18 9.294e-09 
220-248 


1718 


PF01140 


Matrix protein (MA), pl5. 


PF01140D 15.54 2.256e-09 221- 
256 PF01140D 15.54 3.198e-09 
217-252 PF01140D 15.54 9.79 le- 
09 220-255 


1722 


BL00086 


Cytochrome P450 cysteine heme-iron 
ligand proteins. 


BL00086 20.87 8.714e-20 469-501 


1722 


PR00463 


E-CLASS P450 GROUP I 
SIGNATURE 


PR00463H 12.41 5.821e-10 469- 
480 


1722 


PR00408 


MITOCHONDRIAL P450 
SIGNATURE 


PR00408F 11.33 6.000e-10 388- 
407 


1722 


PR00464 


E-CLASS P450 GROUP II 
SIGNATURE 


PR00464I 14.64 1.771e-13 479-503 
PR00464E 18.28 9.617e-ll 389- 
410 PR00464C 18.84 1.000e-10 
330-359 PR00464H 13.32 3.596e- 
09 466-480 


1722 


PR00465 


E-CLASS P450 GROUP IV 
SIGNATURE 


PR00465C 21.23 5.125e-25 332- 
359 PR00465H 17.76 5.500e-22 
479-498 PR00465G 13.06 1.281e- 
17 463-480 PR00465D 14.64 
2.607e-17 390-407 PR00465F 
13.37 6.000e-15 439-458 
PR00465E 15.03 9.217e-15 423- 
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438 PR00465B 16.84 4.908e-09 
113-137 


1722 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385D 13.11 1.667e-09 470- 
480 PR00385C 16.94 4.682e-09 
395-407 PR00385A 14.97 5.865e- 
09 341-359 PR00385E 12.66 
7.000e-09 479-491 


1725 


DM00179 


w KINASE ALPHA ADHESION T- 
CELL. 


DM00179 13.97 7.158e-10 107-117 


1727 


PR00220 


SYNAPTOPHYSIN/SYNAPTOPORIN 
FAMILY SIGNATURE 


PR00220B 15.48 2.800e-31 47-73 
PR00220C 11.05 9.000e-31 100- 
125 PR00220D 8.32 3.842e-30 
132-156 PR00220A 10.93 1.857e- 
28 23-46 PR00220E 3.46 4.273e- 
24 199-218 


1727 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604A 9.13 1.000e-40 26-81 
BL00604C 14.66 1.000e-40 117- 
149 BL00604D 12.28 1.000e-40 
149-184 BL00604E8.32 1.000e-40 
184-226 BL00604F5.96 1.000e-40 
231-276 BL00604B 9.95 2.895e-37 
87-117 BL00604F5.96 1.122e-09 
263-308 


1727 


PR00761 


BINDIN PRECURSOR SIGNATURE 


PR00761A 5.81 3.662e-09 287-304 


1727 


BL00291 


Prion protein. 


BL00291A 4.49 8.138e-09 236-271 


1728 


BL00795 


Involucrin proteins. 


BL00795C 17.06 6.700e-09 159- 
204 


1729 


PR00747 


GLYCOSYL HYDROLASE FAMILY 
47 SIGNATURE 


PR00747E 15.13 8.269e-18 225- 
243 PR00747H 12.76 5.610e-17 
326-347 PR00747C 12.06 4.797e- 
11 145-164 


1730 


PR00209 


ALPHA/BETA GLIADIN FAMILY 
SIGNATURE 


PR00209C 4.56 3.893e-09 77-91 


1731 


PR00244 


NEUROKININ RECEPTOR 
SIGNATURE 


PR00244H 13.12 9.357e-26 294- 
313 PR00244G 11.10 2.286e-15 
268-280 PR00244F 10.47 3.132e- 
15 204-218 PR00244A 10.34 
3.368e-15 29-41 PR00244B 12.45 
1.000e-13 55-65 PR00244D 6.84 
1.964e-13 125-136 PR00244C 
13.44 3.118e-12 108-119 
PR00244E 5.93 8.875e-12 183-193 


1731 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 9.053e-19 275- 
302 PR00237F 13.57 5.800e-18 
234-259 PR00237B 13.50 9.400e- 
17 68-90 PR00237A 11.48 5.765e- 
14 35-60 PR00237D 8.94 2.350e- 
11 133-155 PR00237E 13.03 
1.600e-10 184-208 


1731 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 2.688e-19 229- 
256 BL00237D 11.23 4.429e-13 
285-302 BL00237B 5.28 5.909e-09 
192-204 


1732 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870D 15.74 8.755e-09 223- 
258 


1733 


BL00523 


Sulfatases proteins. 


BL00523A 13.36 8.800e-13 44-61 



WO 03/054152 PCT/US02/39555 



Table 3A 
359 



SEQ ID 


Database entry 
ID 


Description 


Results* 


1734 


PR00464 


E-CLASS P450 GROUP II 
SIGNATURE 


PR00464A 20.47 9.719e-13 167- 
188 


1735 


BL00414 


Profilin proteins. 


BL00414D 15.59 5.909e-10 1037- 
1064 


1735 


PR00392 


PROFILIN SIGNATURE 


PR00392C 18.98 5.592e-10 986- 
1007 PR00392D 12.00 7.652e-09 
1008-1023 


1738 


PR00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C 7.69 9.357e-09 73-83 


1741 


PR00915 


LUTEOVIRUS GROUP 1 COAT 
PROTEIN SIGNATURE 


PR00915G 15.24 6.875e-10 303- 
325 


1745 


PR00837 


ALLERGEN V5/TPX-1 FAMILY 
SIGNATURE 


PR00837C 17.21 6.143e-20 126- 
143 PR00837A 14.77 1.973e-13 
57-76 


1745 


BL01009 


Extracellular proteins SCP/Tpx- 
1/Ag5/PR-1/Sc7 proteins. 


BL01009D 14.19 4.300e-20 127- 
148 BL01009A 13.75 6.586e-13 
57-75 


1745 


PR00838 


VENOM ALLERGEN 5 SIGNATURE 


PR00838G 16.07 2.033e-17 125- 
145 PR00838D 8.73 4.214e-09 57- 
76 


1748 


PR00920 


SPUMAVIRUS ASPARTIC 
PROTEASE (A9) SIGNATURE 


PR00920C 13.24 7.310e-09 212- 
234 


1749 


PR00672 


INHIBIN BETA C CHAIN 
SIGNATURE 


PR00672D 10.52 6.400e-09 96-113 


1749 


PD01572 


PHOTOSYSTEM II REACTION 
CENTRE T PROTEIN PHOTOS. 


PD01572 8.77 6.917e-09 544-574 


1751 


BL00713 


Sodium: dicarboxylate symporter family 
proteins. 


BL00713A 20.35 2.227e-35 314- 
356 BL00713B 13.04 8.000e-22 
375-395 


1751 


PR00173 


GLUTAMATE-ASPARTATE 
SYMPORTER SIGNATURE 


PR00173C 13.89 7.353e-25 327- 
353 PR00173B 9.62 9.053e-13 19- 
40 


1752 


PR00219 


SYNAPTOBREVIN SIGNATURE 


PR00219C 9.04 7.750e-09 1-21 


1753 


PR00379 


INTEIN SIGNATURE 


PR00379A 16.17 9.780e-09 393- 
407 


1757 


PF00783 


Inositol polyphosphate phosphatase, 
catalytic domain proteins homologu. 


PF00783A 11.91 3.400e-14 539- 
549 


1758 


PR00169 


POTASSIUM CHANNEL 
SIGNATURE 


PR00169B 14.43 2.800e-30 239- 
268 PR00169C 16.31 3.109e-ll 
307-331 


1762 


BL00456 


Sodium: solute symporter family 
proteins. 


BL00456A 22.59 1.000e-40 16-71 
BL00456C 24.55 1.000e-40 154- 
209 BL00456B 18.94 7.158e-28 
92-122 BL00456D6.92 2.862e-09 
450-460 


1763 


DM01253 


BNR MOTIF REPEAT. 


DM01253A 11.72 7.785e-09 454- 
470 


1763 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 2.200e-12 86-100 
PR00019B 11.36 6.400e-10 83-97 
PR00019A 11.19 2.333e-09 278- 
292 PR00019B 11.36 4.960e-09 
275-289 PR00019A 11.19 9.333e- 
09 62-76 


1764 


PD01719 


PRECURSOR GLYCOPROTEIN 
SIGNAL RE. 


PD01719A 12.89 5.714e-14 421- 
449 


1765 


PR00824 


HEPATIC LIPASE SIGNATURE 


PR00824A 7.81 7.214e-22 6-25 


1768 


BL00979 


G-protein coupled receptors family 3 


BL00979A 19.66 1.000e-40 77-125 
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proteins. 




1769 


PF00168 


C2 domain proteins. 


PF00168C 27.49 4.000e-12 410- 
436 


1769 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 5.345e-12 292- 1 
306 PR00360B 13.61 1.000e-ll 
427-441 PR00360A 14.59 2.895e- 
09 398-411 


1769 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399B 14.27 8.425e-12 385- 
399 PR00399D 14.48 7.796e-09 
328-339 


1771 


BL00312 


Glycophorin A proteins. 


BL00312B 9.22 5.781e-10 717-746 


1771 


BL00446 


RNA polymerases D / 30 to 40 Kd 
subunits proteins. 


BL00446C 18.90 9.087e-09 1326- 
1353 


1771 


PR00122 


VACUOLAR ATP SYNTHASE 16 KD 
SUBUNIT SIGNATURE 


PR00122D 9.97 9.330e-09 716-740 


1771 


PR00205 


CADHERIN SIGNATURE 


PR00205B 11.39 5.745e-10 225- 
243 PR00205B 11.39 4.966e-09 
333-351 PR00205B 11.39 9.390e- 
09 441-459 1 


1771 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 9.308e-18 227- 
275 BL00232B 32.79 9.206e-17 
443-491 BL00232B 32.79 3.407e- 
11 118-166 BL00232C 10.65 
4.1 15e-10 225-243 BL00232C 
10.65 4.462e-10 546-564 
BL00232C 10.65 7.404e-10 333- 
351 BL00232C 10.65 9.842e-09 
441-459 


1773 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354Z 9.06 1.000e-16 102-151 


1774 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259C 16.40 5.313e-17 513- 
542 PR00259B 14.81 4.000e-14 
486-513 PR00259D 13.50 2.340e- 
13 676-703 


1774 


BL00421 


Transmembrane 4 family proteins. 


BL00421B 17.62 1.000e-27 492- 
531 BL00421E 20.97 6.21 le-13 
673-703 


1775 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 8.258e-17 319-335 
PR00245A 18.03 9.265e-16 52-74 
PR00245B 10.38 6.143e-14 258- 
273 


1775 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 5.974e-12 83-123 
BL00237A 27,68 5.974e-12 171- 
211 


1775 


PR00534 


MELANOCORTIN RECEPTOR 
FAMILY SIGNATURE 


PR00534A 11.49 6.123e-10 44-57 


1775 


PR00237 


RHODOPS IN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237E 13.03 9.333e-ll 280- 
304 PR00237A 11.48 5.935e-09 
19-44 


1778 


BL00417 


Synaptobrevin proteins. 


BL00417B 18.48 2.414e-19 141- 
195 BL00417A7.74 6.704e-13 
113-141 


1778 


PR00219 


SYNAPTOBREVIN SIGNATURE 


PR00219A 8.98 9.156e-13 118-138 


1780 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290B 13.17 2.385e-15 255- 
273 BL00290A 20.89 1.529e-14 
34-57 BL00290A 20.89 6.684e-13 
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198-221 


1782 


BL00211 


ABC transporters family proteins. 


BL00211B 13.37 2.385e-17 558- 
590 


1782 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 5.622e-09 448-470 


1782 


PR00671 


INHIBIN BETA B CHAIN 
SIGNATURE 


PR00671B 4.29 9.408e-09 396-416 


1782 


PR00300 


ATP-DEPENDENT CLP PROTEASE 
ATP-BINDING SUBUNIT 
SIGNATURE 


PR00300A 9.56 9.649e-09 451-470 


1783 


PR00176 


SODIUM/NEUROTRANSMITTER 
SYMPORTER SIGNATURE 


PR00176C 10.84 8.615e-25 283- 
310 PR00176A 16.82 5.909e-24 
211-233 PR00176B7.31 4.600e-23 
240-260 PR00176D9.02 6.143e-21 
412-430 PR00176F 10.73 1.333e- 
20 548-568 PR00176E 11.41 
2.227e-20 494-5 15 PR00176G 
12.48 5.034e-17 630-651 
PR00176H 15.27 7.429e-17 670- 
691 


1783 


BL00610 


Sodiumineurotransmitter symporter 
family proteins. 


BL00610A 17.73 1.000e-40 211- 
261 BL00610B 23.65 1.000e-40 
274-324 BL00610D 20.97 l.OOOe- 
40 451-504 BL00610F 29.02 
1.000e-40 641-696 BL00610E 
20.34 9.100e-37 544-587 
BL00610G 12.89 6.087e-22 700- 
723 BL00610C 12.94 4.282e-15 j 
385-437 


1784 


BL00243 


Integrins beta chain cysteine-rich 
domain proteins. 


BL00243I 31.77 7.592e-09 799-842 


1785 


BL00349 


CTF/NF-I proteins. 


BL00349A 10.07 1.000e-40 649- 
695 BL00349C9.33 1.000e-40 
723-766 BL00349E 10.79 9.833e- 
36 793-836 BL00349B 10.51 

I. 205e-34 695-723 BL00349D 

II. 70 1.000e-33 766-793 


1786 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 9.100e-ll 243- 
257 PR00019B 11.36 9.100e-10 
431-445 PR00019A 11.19 2.000e- 
09 434-448 PR00019A 1 1.19 
4.667e-09 482-496 PR00019B 
11.36 6.040e-09 643-657 
PR00019B 11.36 6.400e-09 597- 
611 PR00019B 11.36 9.640e-09 
479-493 


1789 


PR00702 


ACRIFLAVIN RESISTANCE 
PROTEIN FAMILY SIGNATURE 


PR00702G 9.63 4.214e-22 74-98 
PR00702F 12.14 1.474e-19 49-73 
PR00702H 12.94 7.955e-17 153- 
171 PR00702I 17.85 7.857e-16 
216-231 


1790 


DM00914 


BINDING-PROTEIN-DEPENDENT 
TRANSPORT SYSTEMS INNER 
MEMBRANE. 


DM00914C 21.08 8.457e-35 121- 
168 DM00914B 22.92 5.750e-30 
45-86 


1790 


PR00797 


STREPTOPAIN (C10) CYSTEINE 
PROTEASE FAMILY SIGNATURE 


PR00797F 12.40 9.804e-09 41-63 


1791 


BL00942 


glpT family of transporters proteins. 


BL00942E 21.14 9.700e-40 104- 
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145 BL00942D 18.76 1.273e-31 
55-92 BL00942C 14.04 2.688e-21 
13-33 BL00942F 15.07 7.632e-21 
162-180 


1792 


BL00523 


Sulfatases proteins. 


BL00523C 12.64 5.800e-12 593- 
604 


1796 


PR00019 


LEUCINE-RICH REPEAT 
SIGNATURE 


PR00019A 11.19 7.261e-10 315- 
329 PR00019B 11.36 4.600e-09 
312-326 


1798 


PR00303 


PREPROTEIN TRANSLOCASE 
SECY SUB UNIT SIGNATURE 


PR00303E 14.69 1.429e-22 98-122 
PR00303D 16.15 8.650e-22 72-98 
PR00303C 15.93 9.036e-19 34-58 


1798 


BL00755 


Protein secY proteins. 


BL00755C 14.21 2.500e-18 89-105 


1798 


PR00825 


VESPID VENOM ALLERGEN 
PHOSPHOLIPASE Al SIGNATURE 


PR00825B 14.81 9.738e-09 234- 
255 


1800 


PR00081 


GLUCOSE/RIBITOL 
DEHYDROGENASE FAMILY 
SIGNATURE 


PR00081A 10.53 1.439e-ll 179- 
197 


1800 


PR00669 


INHIBIN ALPHA CHAIN 
SIGNATURE 


PR00669B 8.27 2.976e-09 80-97 


1801 


BL01022 


PTR2 family proton/oligopeptide 
symporters proteins. 


BL01022E 23.51 1.173e-12 474- 
510 BL01022D9.42 3.455e-ll 12- 
25 


1802 


BL01187 


Calcium-binding EGF-like domain 
proteins pattern proteins. 


BL01187B 12.04 2.350e-14 489- 
505 


1802 


PR00792 


PEPSIN (Al) ASPARTIC PROTEASE 
FAMILY SIGNATURE 


PR00792B 12.78 7.750e-14 331- 
345 PR00792C9.10 1.000e-12 
380-392 


1802 


BL00141 


Eukaryotic and viral aspartyl proteases 
proteins. 


BL00141C 9.74 1.000e-ll 336-346 
BL00141D 6.28 3.700e-ll 380-390 


1802 


BL01177 


Anaphylatoxin domain proteins. 


BL01177C 17.39 8.780e-10 483- 
502 


1804 


PR00165 


ANION EXCHANGER SIGNATURE 


PR00165F 10.39 9.667e-12 32-51 


1804 


BL00219 


Anion exchangers family proteins. 


BL00219L 18.71 1.000e-40 126- 
165 BL00219N 10.66 6.164e-31 
194-238 BL00219M9.98 3.100e- 
17 148-194 BL00219P 19.59 
6.123e-13 247-274 BL00219H 
10.06 4.555e-ll 11-59 BL00219K 
12.73 8.780e-10 84-126 


1805 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 3.118e-16 786- 
804 BL00972D 22.55 1.500e-14 
1281-1306 BL00972E 20.72 
6.850e-13 1308-1330 BL00972B 
9.45 7.923e-10 882-892 


1810 


BL00216 


Sugar transport proteins. 


BL00216B 27.64 8.531e-13 736- 
786 


1810 


DM00973 


3 kw RESISTANCE BENOMYL 
YLL028W CYCLOHEXIMIDE. 


DM00973A 21.17 1.391e-ll 708- 
745 


1813 


PD00066 


PROTEIN ZINC-FINGER METAL- 
BINDI. 


PD00066 13.92 9.308e-15 288-301 


1813 


PR00048 


C2H2-TYPE ZINC FINGER 
SIGNATURE 


PR00048B 6.02 7.188e-10 285-295 
PR00048A 10.52 8.043e-10 269- 
283 PR00048A 10.52 3.880e-09 
297-311 


1813 


BL00028 


Zinc finger, C2H2 type, domain 


BL00028 16.07 1.692e-ll 515-532 
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proteins. 


BL0002S 16.07 4.000e-10 487-504 
BL00028 16.07 1.257e-09 773-790 
BL00028 16.07 5.371e-09 272-289 


1813 


PF01059 


NADH-ubiquinone oxidoreductase 
chain 4, amino terminus. 


PF01059A 13.84 6.192e-09 19-53 


1815 


BL00134 


Serine proteases, trypsin family, 
histidine proteins. 


BL00134B 15.99 8.560e-17 316- 
340 BL00134A 11.96 4.214e-16 
53-70 


1815 


BL00495 


Apple domain proteins. 


BL00495M 8.50 5.071e-16 226-261 
BL00495N 11.04 4.438e-15 308- 
343 


1815 


PR00722 


CHYMOTRYPSIN SERINE 
PROTEASE FAMILY (SI) 
SIGNATURE 


PR00722A 12.27 4.414e-14 54-70 
PR00722C 10.87 9.471 e- 13 315- 
328 


1815 


BL00021 


Kringle domain proteins. 


BL00021B 13.33 6.580e-16 53-71 
BL00021C 22.21 2.125e-10 235- 
257 


1815 


BL01253 


Type I fibronectin domain proteins. 


BL01253G 11.34 4.316e-12 315- 
329 BL01253E 16.01 9.899e-ll 
224-261 BL01253D 4.84 5.920e-09 
53-67 


1815 


PR00839 


V8 SERINE PROTEASE FAMILY 
SIGNATURE 


PR00839B 11.20 6.425e-09 53-71 


1816 


DM01354 


kw TRANSCRIPTASE REVERSE II 
ORF2. 


DM01354H 18.00 5.618e-12 134- 
174 


1817 


BL00859 


GTP cyclohydrolase I proteins. 


BL00859B 13.15 9.057e-09 72-113 


1819 


PR00700 


PROTEIN TYROSINE 
PHOSPHATASE SIGNATURE 


PR00700D 12.47 2.421e-20 1431- 
1450 PR00700C 13.17 5.765e-17 
1390-1408 PR00700F 11.18 
7.429e-12 1478-1489 


1819 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E5.51 1.656e-09 1186- 
1211 


1819 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 1.000e-14 1434- 
1445 BL00383D 11.92 1.692e-14 
1394-1407 BL00383A 13.34 
6.850e-14 1280-1295 BL00383F 
15.51 4.240e-13 1472-1488 
BL00383C 10.10 6.625e-10 1318- 
1329 BL00383B 7.61 4.000e-09 
1305-1314 


1819 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.450e-10 688- 
707 PR00014C 15.44 6.400e-10 
866-885 PR00014C 15.44 2.565e- 
09 510-529 PR00014C 15.44 
5.696e-09 1056-1075 


1821 


PR00245 


OLFACTORY RECEPTOR 
SIGNATURE 


PR00245C 7.84 4.073e-15 361-377 
PR00245E 12.40 8.286e-12 414- 
429 


1821 


PR00237 


RHODOPSIN-LIKE GPCR 
SUPERFAMILY SIGNATURE 


PR00237G 19.63 5.814e-09 395- 
422 


1822 


PF00242 


DNA polymerase (viral) N-terminal 
domain proteins. 


PF00242F 12.18 8.522e-09 322-344 



* Results include in order: accession number subtype; raw score; p-value; position of signature in 
amino acid sequence 
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912 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 6.73e-29 124-163 


912 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19.16 2.06e-20 164-190 


912 


IPB001398C 


Macrophage migration inhibitory factor 
family 


26.46 6.96e-ll 192-238 


913 


IPB000033A 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


21.82 4.35e-09 514-536 


913 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 4.48e-10 817-827 


913 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 7.10e-10 574-584 


913 


IPB000118B 


Granulin 


7.94 9.20e-09 472-510 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 1 .96e-11484-499 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.84e-14 696-711 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.86e-13 569-584 


913 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 6.54e-17 812-827 


913 


IPB000561 


EGF-like domain 


4.89 3.57e-09 821-829 


913 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.02e-10 175-190 


913 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 4.79e-ll 696-707 


913 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 5.20e-12 812-823 


913 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 8.58e-ll 527-538 


913 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 8.71e-10 569-580 


913 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 9.53e-ll 484-495 


913 


IPB002861B 


Reeler domain 


10.50 4.48e-09 124-152 


913 


IPB002861B 


Reeler domain 


10.50 8.98e-09 181-209 


913 


IPB002861C 


Reeler domain 


23.17 6.48e-09 111-165 


913 


IPB003367A 


Thrombospondin type 3 repeat 


11.78 5.13e-09 782-802 


913 


IPB003367A 


Thrombospondin type 3 repeat 


11.78 5.83e-10 577-597 


913 


IPB003886D 


Extracellular domain in nidogen 


13.91 3.49e-09 569-588 


913 


IPB003886D 


Extracellular domain in nidogen 


13.91 4.7Se-13 696-715 


913 


IPB003886D 


Extracellular domain in nidogen 


13.91 9.41e-10 812-831 


913 


PR00010C 


Type II EGF-like signature III 


6.98 3.63e-09 574-584 


913 


PR00010C 


Type II EGF-like signature III 


6.98 8.02e-09 701-711 


913 


PR00010C 


Type II EGF-like signature III 


6.98 9.47e-13 817-827 


913 


PR00907B 


Thrombomodulin signature II 


11.50 9.43e-09 808-824 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 8.09e-10 812-838 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 8.85e-10 696-722 


913 


PR00907G 


Thrombomodulin signature VII 


10.43 9.27e-09 569-595 


913 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


10.57 2.00e-09 187-204 


914 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 9.76e-26 2-41 


914 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19.16 6.31e-18 42-68 


914 


IPB001398C 


Macrophage migration inhibitory factor 
family 


26.46 5. 14e-12 70-116 


915 


IPB001398A 


Macrophage migration inhibitory factor 
family 


23.60 3.25e-20 2-41 
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915 


IPB001398B 


Macrophage migration inhibitory factor 
family 


19. lo z.uoe-zu jj-oI 


915 


IPB001398C 


Macrophage migration inhibitory factor 
family 


zo. 4t> /.jye-jo oj-i/y 


916 


IPB000001B 


Kringle 


1 2 DA R OQp 1 Q 9R6 110 


916 | 


IPB000001C 


Kringle 


on /11 o oi*» oo in 1^1 


916 


IPB000001D 


Kringle 


11 11 1 ^1/=. 1 1 AfkQ AR^ 


916 


IPB000001E 


Kringle 


1£ AO S ^S<=> 11 ^AA-^l 
10. UZ o.DoQ-\.D D L tH"DJ I 


916 


IPB000001F 


Kringle 


10.14 7.65e-ll 567-577 


916 


IPB000001G 


Kringle 


10 00 i oria i r coo £ 1 £ 

z9.zy z.oUe-io joo-o i o 


916 


IPB000001H 


Kringle 


lO O/l 1 A ^ 1 1 /COS £1Q 

lz.z4 j.4->e-lj> ozo-000 


916 


IPB000177M 


Apple domain 


C\ 1 O A OTo in </1^ KHQ 

y. 1 0 4.oze-lU /y 


916 


IPB000177N 


Apple domain 


in m o^/a on £1 c ^^0 
1U. 1 / d.ZdQ-Z\J OlO-ODZ 


916 


IPB000177O 


Apple domain 


14. 0. LdQ-1 / OJJ-Oo 1 l 


916 


IPB000561 


EGF-like domain 


a on a 01 a no o^o 00^ 
4.©y 4.zie-uy zo /-z / j 


916 


IPB000562 


Type II fibronectin collagen-binding 
domain 


m orv ft ftC Q oft 100 1/tQ 

lu.jy y.uDe-zu ijz-145 


916 


IPB001254A 


Serine proteases, trypsin family 


n no & fiQo 1/1 /i^Q AP.^ 
y.y 0 o.uye-14 4oy- £ foo 


916 


IPB001254B 


Serine proteases, trypsin family 


1 ^ ni 1 sAo 1 1 fSifx AAQ 

1D.U1 j.DUe-i / ozo-o^fy 


916 


IPB001254C 


Serine proteases, trypsin family 


lo. J>4 l.UUe-lo ODO-O/D 


916 


IPB001774D 


Delta serrate ligand 


1 ft c oOc no 1 <o 1 OS 

19. z3 D.oze-uy iDz-iyo 


916 


IPB001881A 


Calcium-binding EGF-like domain 


0 00 1 nrio no oa^ o*CA 
0. /Z l.UUe-Uo Z40-ZD4 


916 


IPB001881A 


Calcium-binding EGF-like domain 


O T1 Q OAo nO 1/^/1 101 

0. /z o./ue-uy 104-1/0 


916 


PR00010A 


Type II EGF-like signature I 


10 01 ^ ^/io in 1 ^n T71 
iz.yi o.D4e-iu iou-1/1 


916 


PR00011B 


Type III EGF-like signature II 


11 no £ /to** no 1 oq 107 
13. Uo o.4ze-uy 1 /y-iy/ 


916 


PR00011B 


Type III EGF-like signature II 


11 no. o iQa no o^n 07R 
1 j,Uo y. Joe-uy zou-z / 0 


916 


PR00013A 


Fibronectin type II repeat signature I 


12.70 4.60e-10 105-114 


916 


PR00013B 


Fibronectin type II repeat signature II 


11 ao r iCn in 1 i£ 109 

11. to D.zje-iu 110-izo 


916 


PR00013C 


Fibronectin type II repeat signature III 


12.27 9.47e-19 132-147 


916 


PR00018A 


Kringle domain signature I 


12.23 8. 84e-12 286-301 


916 


PR00018B 


Kringle domain signature II 


16.62 1.75e-09 302-314 


916 


PR00018C 


Kringle domain signature III 


16.77 7.00e-21 331-351 


916 


PR00018D 


Kringle domain signature IV 


-1 <-» OO ft ft /I ^ 1 ft 1 £" ZT O /T T 

12.83 9.04e-10 356-367 


916 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9. 36e-14 470-485 


916 


PR00722C 


Chymotrypsin serine protease family (SI) 
signature III 


1 ft n a 0 /Co „ 1 a /co^ £07 
10.74 b\oze-i4 ozD-03 / 


916 


PR01236A 


Tumour necrosis factor beta (lymphotoxin- 
alpha) signature I 


/i oo c ooo no 00 is 
4.yz o.zze-uy zz-00 


917 


IPB000034A 


Laminin B 


OO 01 1 £Q<=» 1 A A1Q A*\A 

zz.zi j.oye- iu ly-'+j^r 


917 


IPB000034A 


Laminin B 


00 01 n oo** 11 in/i^ insi 

ZZ.Zl /.ZZe-lO 1U40-1U51 


917 


IPB000034A 


Laminin B 


OO OI Q AO<* 1 n 0Q1 10*^ 

zz.zi y.H-ze-iu zy i-jzd 


917 


IPB000034B 


Laminin B 


OI OO Q ^Qp 1 1 snQ SlAf\ 
Zl.ZZ o.joc-li OUo-o'+O 


917 


IPB000034B 


Laminin B 


OI OO O m 0 1 Q /Ml /1S1 

z 1 .zz y .u / e- 1 y 443-40 1 


917 


IPB00D034C 


Laminin B 


10 c\n 1 ioq in</io ^^n 
lz.y/ 1. ize-iu j4Z-dou 


917 


IPB000034C 


Laminin B 


10 00 0 0O0 no sn^ 5201 
lz.y / z.zye-uy oud-ozo 


917 


lrBUU0U34C 


Laminin B 


12.97 3. 10e-10 492-510 


917 


IPB000034C 


Laminin B 


12.97 5.09e-09 897-915 


917 


IPB000034C 


Laminin B 


12.97 5.29e-12 1067-1085 


917 


IPB000034C 


Laminin B 


12.97 8.17e-14 955-973 


917 


IPB000034C 


Laminin B 


12.97 8.43e-ll 853-871 


917 


IPB000034C 


Laminin B 


12.97 8.62e-12 1114-1132 


917 


IPB001134C 


Netrin, C-terminus 


17.82 9.77e-09 1014-1028 


917 


IPB001774C 


Delta serrate ligand 


18.25 9.27e-09 1105-1147 
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917 


IPB001886A 


Lamimn N-terminal (Domain VI) 


O C 1 C\ A A ~ 1 A 1 1 n 

8.51 9.44e-10 119-126 


917 


IPB001886B 


Lammin N-terminal (Domain VI) 


15.20 4.81e-24 269-291 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 


f\ A CA O 1 A _ 11 O T-l A 1 £ 

24.54 2.14e-13 377-416 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 


24.54 6.21e-ll 1009-1048 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 


<"» A CA £ CC 1 <-> 1 f\/C~T 1 1 f\{Z 

24.54 6.55e-12 1067-1106 


917 


IPB001886C 


Lammin N-terminal (Domain VI) 


/-» a c a '"t r\ a ~ 11 ncc c\c\A 

24.54 7.04e-ll 955-994 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 


24.54 8.03 e- 12 440-479 


917 


IPB001886C 


Laminin N-termmal (Domain VI) 


24.54 8.41e-32 312-351 


917 


IPB001886C 


Laminin N-terminal (Domain VI) 


24.54 8.81e-09 853-892 


917 


IPB001886D 


Laminin N-terminal (Domain VI) 


12.51 3.39e-13 893-911 


917 


IPB001886D 


Laminin N-terminal (Domain VI) 


12.51 5.ooe-Uy ovo-dZo 


917 


IPB001886D 


IT ••"VT* * 1 /"T"V * X 7"T\ 

Laminin N-terminal (Domain VI) 


ia c 1 t OA- 1/1 i aac i Ann 

12.51 7.20e-14 1005-1023 


917 


IPB001886D 


Laminin N-termmal (Domain VI) 


1 a C1 A £. A ^ OA 

12.51 9.64e-20 3 /3-3yl 


917 


IPB001886E 


Lamimn N-terminal (Domain VI) 


-i r\ r\r\ -t C C „ 1 1 /i O O A C A 

10.90 1.55e-ll 438-454 


917 


IPB001886E 


Lammin N-termmal (Domain VI) 


1AAAL1A,-. 1 1 1 A/CC 1 AO 1 

10.90 2.20e-12 lUoD-lOol 


917 


IPB001886E 


Lammin N-terminal (Domain VI) 


1 A AA O AOn 1 1 AC\f\ CA/C 

10.90 5.02e-ll 4yo-->uo 


917 


IPB001886E 


Lamimn N-termmal (Domain VI) 


1A AA /C nf\„ n or 1 Q/CT 

10.90 6.00e-12 oDl-oo/ 


917 


IPB001886E 


Lamimn N-terminal (Domain VI) 


1AAA/T1/1~ K C/IA C C 

10.90 6. 14e-15 540-556 


917 


IPB001886E 


T • * "K T j • 1 S~t~\. ' t 7"T\ 

Lamimn N-terminal (Domain VI) 


i n A A r oi- 11 iin 11^0 

10.90 6.33e-ll 1112-1126 


917 


IPB001886E 


Lammin N-termmal (Domain VI) 


-i r\ r\r\ i /IT „ 11 Tin 0 0 iC 

10.90 7.43e-ll 310-326 


917 


IPB001886E 


Lamimn N-termmal (Domain VI) 


1 A AA A m« 1 *7 O AO OI A 

10.90 y.o /e-i / oOo-oiy 


917 


IPB003884C 


Factor I membrane attack complex 


ia nf\ o oc-» aa /i 1 1 /ton 

13.00 3.35e-09 41 l-42y 


917 


IPB003884C 


Factor I membrane attack complex 


13.00 3.87e-09 833-85 1 


917 


IPB003888L 


FY-rich domain N-terminus 


21.19 8.82e-10 339-385 


917 


PR00011A 


Type III EGF-like signature I 


14.05 2.02e-14 485-503 


917 


PR00011A 


Type III EGF-like signature I 


14.05 2.76e-17 1060-1078 


917 


PR00011A 


Type III EGF-like signature I 


14.05 3.35e-ll 433-451 


917 


PR00011A 


Type III EGF-like signature I 


1 j r\ r a At io 'v/^ o o 1 /" 

14.05 4.41e-13 798-816 


917 


PR00011A 


Type III EGF-like signature I ! 


14.05 4.72e-16 535-553 


917 


PR00011A 


Type III EGF-like signature I 


14.05 7.30e-15 846-864 


917 


PR00011A 


Type III EGF-like signature I 


14.05 7.50e-09 305-323 


917 


PR00011A 


Type III EGF-like signature I 


1 A f\ C O O /I 1/T 1 1 m 1 1 Af 

14.05 8.24e-16 1107-1125 


917 


PR00011B 


Type III EGF-like signature II 


10 rkO A 1 T _ 1/1 /IOC f AO 

13.08 4.17e-14 485-503 


917 


PR00011B 


Type III EGF-like signature II 


10 rvo a oc^ ic 1 1 ao 1 nc 

13.08 4.35e-15 1107-1125 


917 


PR00011B 


Type III EGF-like signature II 


-i o i-io /i o i ^ o/i/c o /C /l 

13.08 4.86e-17 846-864 


917 


PR00011B 


Type III EGF-like signature II 


fO rtO ZT /t /T ^. 1 O 7A O O 1 /C 

13.08 6.46e-12 798-816 


917 


PR00011B 


Type III EGF-like signature II 


i o r\ o ^ a^t« ia /iio /i c 1 

13.08 6.96e-10 433-45 1 


917 


PR00011B 


Type III EGF-like signature II 


11 AO T71^ 1CCOC CC3 

13.08 7.71e-15 535-553 


917 


PR00011B 


Type III EGF-like signature II 


1 O rvO O AA„ 1/11 A/TA 1 ATO 

13.08 8.00e-14 1060-1078 


917 


PR00011C 


Type III EGF-like signature III 


OC OA 1 1 C „ A A /ICC /lOO 

25.79 1.15e-09 455-483 


917 


PR00011C 


Type III EGF-like signature III 


OC ^7 A £ 11 „ 1A/1/1A 

25.79 6.21e-10 440-468 


917 


PR00011C 


Type III EGF-like signature III 


25.79 9.82e-12 817-845 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 1.00e-16 1107-1125 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 2.62e-14 846-864 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 3.00e-13 535-553 


917 


PR00011D 


Type III EGF-like signature IV 


t o io c aa~ 1 a /ioc cao 

12.12 5.00e-13 485-503 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 5.50e-14 1060-1078 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 6.23e-l 1433-451 


917 


PR00011D 


Type III EGF-like signature IV 


12.12 7.98e-12 798-816 


918 


IPB000033B 


Low-density lipoprotein (Ml) receptor, 
YWTD repeat 


7.05 1.00e-09 304-314 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 1.00e-10 132-142 
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918 


IPB000033B 


Low-density lipoprotein (Idl) receptor, 
YWTD repeat 


7.05 4.19e-10 344-354 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 5. 74e-09 384-394 


918 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 7.97e-10 91-101 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.12e-15 379-394 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.37e-14 299-314 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.45e-13 46-61 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 5.76e-15 339-354 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 7.55e-13 127-142 


918 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 8.77e-13 86-101 


918 


IPB000561 


EGF-like domain 


4.89 5.50e-09 55-63 


918 


IPB000859 


CUB domain 


19.99 7.67e-17 721-756 


918 


IPB001881B 


Calcium-binding EGF-like domain 


+ t*\ *-\ C\ 1 A i-t 11 1 r*\ >~ r lOO 

12.28 1.47e-ll 127-138 


918 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 1.47e-l 1 299-310 


918 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 5.00e-14 379-390 


918 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 5.26e-ll 86-97 


918 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 7.00e-13 339-350 


918 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 8.71e-10 46-57 


918 


IPB002557B 


Chitin binding domain 


12.64 3.08e~09 653-666 


918 


IPB003367A 


Thrombospondin type 3 repeat 


11.78 7.62e-14 54-74 


918 


IPB003886D 


Extracellular domain in nidogen 


13.91 1.54e-13 46-65 


918 


IPB003886D 


Extracellular domain in nidogen 


13.91 2.13e-ll 339-358 


918 


IPB003886D 


Extracellular domain in nidogen 


13.91 9.17e-09 379-398 


918 


PR00010C 


Type II EGF-like signature III ! 


6.98 4.48e-10 132-142 


918 


PR00010C 


Type II EGF-like signature III 


6.98 5.61e-09 304-314 


918 


PR00010C 


Type II EGF-like signature III 


6.98 5.83e-09 384-394 


918 


PR00010C 


Type II EGF-like signature III 


6.98 6.05e-09 51-61 


918 


PR00010C 


Type II EGF-like signature III 


6.98 8.02e-09 91-101 


918 


PR00907B 


Thrombomodulin signature II 


11.50 9.16e-10 215-231 


918 


PR00907G 


Thrombomodulin signature VII 


10.43 6.84e-10 46-72 


918 


PR00907G 


Thrombomodulin signature VII 


10.43 7.5 le-10 379-405 


919 


IPB001415A 


Parathyroid hormone family 


18.60 2.50e-37 80-114 


919 


IPB001415B 


Parathyroid hormone family 


26.47 1.00e-40 133-181 


920 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 5.32e-ll 144-154 


920 1 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 6.76e-ll 103-113 


920 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 8.92e-ll 17-27 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 3.05e-13 139-154 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 5.26e-14 12-27 


920 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 9.59e-13 98-113 


920 


IPB000561 


EGF-like domain 


4.89 1.00e-09 252-260 
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920 


IPB000561 


EGF-like domain 


4.89 1.00e-09 67-75 


920 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 L43e-10 12-23 


920 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 6.57e-10 139-150 ! 


920 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 7.86e-10 98-109 


920 


PR00010C 


Type II EGF-like signature III 


£ no o £L"i ~ no i t\i i 1 q 

6.98 3.63e-09 103-113 


920 


PR00010C 


Type II EGF-like signature III 


r\ o r-t i*a_ nn 1 A A 1 C A 

6.98 7.59e-09 144-154 


920 


PR00011A 


Type III EGF-like signature I 


1 a nc o r a„ nn on o/to 

14.05 3.50e-09 33I-34y 


920 


PR00011A 


Type III EGF-like signature I 


14.05 6.09e-14 2oo-3Uo 


920 


PR00011A 


Type III EGF-like signature I 


i a c\c *~t c\c\ n 1 o o n /I 

14.05 7.90e-12 3/6-394 


920 


PR00011B 


Type III EGF-like signature II 


i o no i nn ^ io ooo on/C 

13. Oo l.UUe-Jo Zoo-oUo 


920 


PR00011B 


Type III EGF-like signature II 


13.08 3.36e-12 376-394 


920 


PR00011B 


Type III EGF-like signature II 


13.08 3.59e-09 331-349 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 2.50e-13 376-394 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 2.86e-12 288-306 


920 


PR00011D 


Type III EGF-like signature IV 


12.12 4.05e-ll 331-349 


920 


PR00764F 


Complement C9 signature VI 


15.74 5. 10e-09 236-256 


920 


PR00907B 


Thrombomodulin signature II 


11.50 8.32e-10 54-70 


921 


IPB000827A 


Small cytokines (intercrine/chemokine), C- 
C subfamily 


8.72 5.24e-ll 35-45 


922 


IPB000834A 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


18.14 1.77e-ll 167-188 


922 


IPB000834B 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


13.51 5.50e-14 201-215 


922 


IPB000834C 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


17.20 6.66e-12 270-286 


922 


IPB000834D 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


18.95 3.25e-23 297-323 


922 


IPB000834E 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


9.80 3.33e-13 325-339 


922 


IPB000834F 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


12.40 9. 18e-09 382-394 


922 


IPB000834G 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


14.46 6.00e-15 415-430 


922 


PR00765B 


Carboxypeptidase A metalloprotease (M14) 
family signature II 


14.48 8.04e-15 197-211 


922 


PR00765C 


Carboxypeptidase A metalloprotease (M14) 
family signature III 


10.88 5.50e-09 277-285 


922 


PR00765D 


Carboxypeptidase A metalloprotease (Ml 4) 
family signature IV 


14.06 4.91e-14 330-343 


923 


IPB002181A 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


18.44 7.00e-ll 64-80 


923 


IPB002181B 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


20.16 4.86e-32 93-129 


923 


IPB002181C 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


15.87 6.88e-13 134-146 


923 


IPB002181D 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


29.18 1.23e-18 153-193 


923 


IPB002181E 


Fibrinogen beta and gamma chains C- 
terminai globular domain 


27.75 9.71e-23 205-237 


927 


IPB000774B 


Domain amino terminal to FKBP-type 
peptidyl -prolyl isomerase 


11.42 1.29e-12 54-81 


927 


IPB000774C 


Domain amino terminal to FKBP-type 
peptidyl-prolyl isomerase 


10.28 7.46e-19 84-113 


927 


IPB001179 


FKBP-type peptidyl-prolyl cis-trans 


22.74 4.60e-26 80-113 



WO 03/054152 



PCT/US02/39555 



Table 3B 
369 



SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 






isomerase (PPIase) 




932 


IPB001359H 


Synapsin 


22.58 6.66e-10 176-226 


936 


PR00436A 


Interleukin-8 signature I 


15.20 8.23e-09 3-26 


937 


IPB000308B 


14-3-3 proteins 


12.24 2.33e-40 77-116 


937 


IPB000308C 


14-3-3 proteins 


18.31 1.00e-36 143-192 


937 


IPB000308D 


14-3-3 proteins 


12.57 1.69e-35 215-269 


937 


PR00305A 


14-3-3 protein zeta signature I 


9.70 9.18e-33 77-106 


937 


PR00305B 


14-3-3 protein zeta signature II 


11.03 4.52e-22 126-150 


937 


PR00305C 


14-3-3 protein zeta signature III 


9.89 5.50e-28 159-181 


937 


PR00305D 


14-3-3 protein zeta signature IV 


17.40 4.81e-27 194-220 


937 


PR00305F 


14-3-3 protein zeta signature VI 


15.40 1.00e-36 248-277 


938 


IPB002048 


EF-hand family 


7.915,24e-10 123-135 


939 


PR01353A 


Glucagon family receptor signature I 


8.87 4.86e-10 5-18 


940 


IPB001875B 


Death effector domain 


13.21 1.00e-08 500-509 


940 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 9.65e-09 182-219 


940 


IPB003530A 


Long hematopoietin receptor, soluble alpha 
chains 


23.64 8.76e-09 339-393 


941 


IPB000483 


Leucine rich repeat C-terminal domain 


11.18 8.11e-14 242-256 


941 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.91e-09 135-148 


941 


PR00364D 


Disease resistance protein signature IV 


14.89 4.60e-09 84-100 


944 


IPB000215A 


Serpins 


13.01 9.3 le-15 200-223 


944 


IPB000215B 


Serpins 


9.87 4.79e-12 296-308 


944 


IPB000215C 


Serpins 


13.90 5.18e-ll 323-337 


944 


IPB000215D 


Serpins 


15.35 9.27e-l 1430-456 


944 


IPB000215E 


Serpins 


15.36 1.00e-15 508-532 


944 


IPB000895A 


Transthyretin precursor 


12.60 4.04e-09 39-58 


944 


PR00743B 


Glycosyl hydrolase family 36 signature II 


14.95 8.83e-09 240-260 


945 


IPB002905C 


N2,N2-dimethylguanosine tRNA 
methyltransferase 


19.39 5.00e-ll 115-136 


947 


IPB000306 


FYVE Zn-finger, rabphilin/VPS27/FAB 1 
type 


8.96 3.25e-18 552-564 


947 


IPB001841 


RING finger 


10.69 3.37e-09 559-568 


950 


IPB001304A 


C-type lectin domain 


17.98 5.50e-ll 117-141 


950 


PR00356D 


Type II antifreeze protein signature IV 


12.93 7.41e-09 174-190 


951 


IPB001304A 


C-type lectin domain 


17.98 2.29e-15 202-226 


951 


PR00356A 


Type II antifreeze protein signature I 


13.29 6.70e-10 197-209 


951 


PR00356B 


Type II antifreeze protein signature II 


12.71 4.63e-10 209-226 


953 


IPB001359H 


Synapsin 


22.58 4.43e-10 16-66 


953 


PR00910A 


Luteovirus ORF6 protein signature I 


2.74 3.46e-09 107-119 


954 


PR00901H 


Pheromone B alpha- 1 receptor signature 
VIII 


14.75 4.05e-09 56-66 


955 


PR00436A 


Interleukin-8 signature I 


15.20 4.39e-09 36-59 


956 


IPB003406B 


Core-2/I-Branching enzyme 


11.63 9.31e-37 94-128 


956 


IPB003406C 


Core-2/I-Branching enzyme 


11.36 7.26e-37 145-184 


956 


IPB003406D 


Core-2/I- Branching enzyme 


23.29 1.82e-33 191-230 


956 


IPB003406E 


Core-2/I-Branching enzyme 


22.75 3.87e-29 25 1-292 | 


958 


IPB000130 


Neutral zinc metallopeptidases, zinc- 
binding region 


5.86 6.54e-ll 236-246 


958 


IPB001S18A 


Matrixin 


14.60 7.35e-28 88-117 


958 


IPB001818B 


Matrixin 


26.48 7.75e-39 136-177 


958 


IPB001818C 


Matrixin 


24.38 1.00e-40 180-225 


958 


IPB001818D 


Matrixin 


14.91 5.34e-30 230-261 


958 


IPB001818E 


Matrixin 


8.86 4.67e-13 269-282 
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958 


IPB001818F 


Matrixin 


11.19 6.85e-22 305-325 


958 


IPB001818G 


Matrixin 


14. 71 2.61e-12 342-354 


958 


IPB001818H 


Matrixin 


15.46 2.89e-28 408-434 


958 


IPB002870E 


Reprolysin family propeptide 


1 1.90 5.22e-10 238-250 


958 


PR00138A 


Matrixin signature I 


12.54 6.79e-16 108-121 


958 


PR00138B 


Matrixin signature II 


14.84 1.56e-17 155-170 


958 


PR00138C 


Matrixin signature III 


20.07 5.74e-28 178-206 


958 


PR00138D 


Matrixin signature IV 


14.57 7.86e-27 236-261 


958 


PR00138E 


Matrixin signature V 


7.10 4.96e-13 269-282 


958 


PR00480B 


Astacin family signature II 


14.35 4.32e-ll 231-249 


958 


PR00997G 


Fragilysin metallopeptidase (M10C) 
enterotoxin signature VII 


11.64 8.53e-09 242-258 


962 


IPB000953 


Chromo domain 


14.73 2.29e-17 63-81 


962 


IPB003606E 


N-terminal to some SET domains 


9.57 2.89e-12 226-235 


962 


IPB003606G 


N-terminal to some SET domains 


13.63 4.00e-10 262-290 


962 


PR00504C 


Chromodomain signature III 


12.01 1.97e-10 72-84 


965 


IPB001729 


Surfactant associated polypeptide SP-C 
palmitoylation sites 


9.43 9.11e-09 19-52 


967 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11.56 9.18e-09 118-129 


967 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 9.22e-10 229-245 


967 


IPB001133A 


NADH-ubiquinone oxidoreductase chain 4L 


13.73 5.50e-10 89-119 


967 


PR00237A 


Rhodopsin-like GPCR superfamily 
signature I 


9.81 8.31e-09 26-50 


967 


PR00245B 


Olfactory receptor signature II 


13.73 3.57e-09 129-141 


967 


PR00245D 


Olfactory receptor signature IV 


9.34 2.59e-13 183-192 


967 


PR00245E 


Olfactory receptor signature V 


8.96 5.15e-16 230-241 


967 


PR00534A 


Melanocortin receptor family signature I 


12.77 2.83e-09 51-63 


968 


IPB002328A 


Zinc-containing alcohol dehydrogenase 


20.74 6.65e-09 179-210 


968 


IPB002328B 


Zinc-containing alcohol dehydrogenase 


14.71 9.03e-12 223-248 


968 


IPB002364 


Quinone oxidoreductase/zeta-crystallin 


20.78 6.67e-12 223-250 


970 


IPB001017A 


Dehydrogenase, El component 


11.25 7.35e-13 126-138 


970 


IPB001017B 


Dehydrogenase, El component 


25.91 2.64e-27 152-184 


970 


IPB001017C 


Dehydrogenase, El component 


15.29 7.00e-22 200-219 


970 


IPB001017D 


Dehydrogenase, El component 


18.59 1.45e-40 256-294 


970 


IPB001017E 


Dehydrogenase, El component 


10.22 3.65e-15 312-325 


970 


IPB001017F 


Dehydrogenase, El component 


17.40 9.31e-23 340-360 


975 


PR01281D 


Corticotropin releasing factor type 2 
receptor signature IV 


9.32 8.52e-09 388-403 


977 


PR00457A 


Animal haem peroxidase signature I 


15.27 6.79e-12 162-173 


977 


PR00457B 


Animal haem peroxidase signature II 


12.43 1.26e-15 216-231 


977 


PR00457C 


Animal haem peroxidase signature III 


18.81 1.51e-14 366-384 


977 


PR00457D 


Animal haem peroxidase signature IV 


18.35 5.50e-22 384-404 


977 


PR00457E 


Animal haem peroxidase signature V 


19.97 4.91e-26 409-435 


977 


PR00457F 


Animal haem peroxidase signature VI 


14.42 6.05e-10 461-471 


977 


PR00457G 


Animal haem peroxidase signature VII 


14.17 3.25e-18 590-610 


977 


PR00457H 


Animal haem peroxidase signature VIII 


14.82 6.19e-13 684-698 


978 


IPB001271 


Mammalian defensin 




985 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.63e-10 145-160 


985 


IPB002174A 


Furin-like cysteine rich region 


30.51 9.03e-10 26-57 


987 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 124-137 


987 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 316-329 


988 


IPB000392A 


NifH/frxC family 


9.08 2.98e-ll 107-122 


988 


IPB000808A 


Mrp family 


23.51 8.67e-37 101-145 
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988 


IPB000808B 


Mrp family 


15.70 7.48e-36 191-227 


988 


IPB000808C 


Mrp family 


23.57 8.77e-29 250-285 


988 


IPB000808D 


Mrp family 


28.05 6.40e-24 297-337 


988 


IPB001984C 


ATP-dependent serine proteases, Lon 
family 


12.76 9.82e-09 103-139 


988 


IPB003348A 


Anion-transporting ATPase 


20.06 7.75e-15 106-143 


988 


PR00091A 


Nitrogenase component II signature I 


7.814.38e-09 109-122 


988 


PR00094A 


Adenylate kinase signature I 


9.62 9.86e-09 107-120 


988 


PR00364A 


Disease resistance protein signature I 


8.29 7.59e-10 105-120 


989 


IPB001007B 


von Willebrand factor, type C repeat 


10.03 4.79e-09 259-268 


989 


IPB001846B 


von Willebrand factor type D domain 


10.86 1.82e-12 489-503 


989 


IPB002919B 


Trypsin Inhibitor-like cysteine rich domain 


21.14 6.45e-10 321-343 


989 


PR00163B 


Rubredoxin signature II 


15.17 9.25e~09 215-231 


992 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 8.48e-10 133-182 


992 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 1.38e-ll 154-176 


994 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.48e-ll 86-123 


994 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.04e-ll 267-304 


995 


IPB001283A 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


16.26 6.67e-10 29-48 


995 


IPB001283B 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


13.09 2.33e-12 57-72 


995 


IPB001283E 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


13.34 5.50e-20 127-147 


995 


IPB001283F 


Extracellular proteins SCP/Tpx-l/Ag5/PR- 
1/Sc7 


10.49 6.48e-ll 160-170 


995 


IPB003106E 


Homeobox associated leucine zipper 


13.73 6.90e-09 25-55 


995 


PR00837A 


Allergen Y5/Tpx-1 family signature I 


14.69 1.56e-14 57-75 


995 


PR00837C 


Allergen V5/Tpx-1 family signature III 


16.31 5.50e-20 126-142 


995 | 


PR00837D 


Allergen V5/Tpx-1 family signature IV 


11.13 3.70e-ll 160-173 


995 


PR00838D 


Venom allergen 5 signature IV 


8.59 3.57e-09 57-75 


995 


PR00838G 


Venom allergen 5 signature VII 


15.39 8.60e-18 125-144 


1000 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 3.25e-13 21-70 


1000 


IPB001003B 


MHC Class II, alpha chain, alpha- 1 domain 


14.72 8.38e-ll 29-72 


1000 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 3.63e-ll 42-64 


1000 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.31e-17 78-115 


1002 


IPB000306 


FYVE Zn-finger, rabphilin/VPS27/FABl 
type 


8.96 9.74e-09 181-193 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.60e-16 213-238 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.38e-18 297-322 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.50e-19 325-350 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.23e-18 185-210 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.23e-18 241-266 


1002 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.75e-20 269-294 


1002 


IPB001275 


DM DNA binding domain 


19.17 1.85e-10 201-240 


1002 


IPB001275 


DM DNA binding domain 


19.17 4.65e-10 313-352 


1002 


IPB001275 


DM DNA binding domain 


19.17 5.50e-10 257-296 


1002 


IPB001275 


DM DNA binding domain 


19.17 6.24e-12 285-324 
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1002 


IPB001275 


DM DNA binding domain 


19.17 8.66e-10 229-268 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.00e-09 210-223 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.38e-09 182-195 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.41e-10 350-363 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.53e-12 238-251 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.95e-ll 294-307 


1002 


PR00048A 


C2H2-type zinc finger signature I 


9.94 2.50e-09 322-335 


1002 


PR00048A 


C2H2-type zinc ringer signature I 


9.94 6.79e-13 266-279 


1002 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.00e-09 198-207 


1002 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.50e-l0 310-319 


1002 


PR00048B 


C2H2-type zinc finger signature II 


5.52 9.00e-09 338-347 


1002 


PR00048B 


C2H2-type zinc finger signature II 


5.52 9.36e-l0 282-291 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 l.OOe- 17 467-492 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.50e-20 523-548 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.80e-16 262-287 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.93e-17 439-464 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.27e-21 383-408 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.60e-16 663-688 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.94e-15 635-660 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.09e-21 411-436 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.91e-21 495-520 


1003 


IPB000822 


Zinc ringer, C2H2 type 


1 A fri iT r\ rT 1 A F\r-r n f- f\ 

14.67 6.25e-19 327-352 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.25e-20 579-604 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.54e-18 355-380 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.79e-17 299-324 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.20e-22 607-632 


1003 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.71e-17 551-576 


1003 


IPB001275 


DM DNA binding domain 


19.17 3.69e-12 371-410 


1003 


IPB001275 


DM DNA binding domain 


19.17 4.21e-ll 399-438 


1003 


IPB001275 


DM DNA binding domain 


19.17 4.21e-ll 483-522 


1003 


IPB001275 


DM DNA binding domain 


19.17 4.89e-10 539-578 


1003 


IPB001275 


DM DNA binding domain 


19.17 5.57e-13 595-634 


1003 


IPB001275 


DM DNA binding domain 


19.17 6.35e-10 455-494 


1003 


IPB001275 


DM DNA binding domain 


19.17 7.56e-ll 511-550 


1003 


IPB001275 


DM DNA binding domain 


19.17 8.42e-10 343-382 


1003 


IPB001275 


DM DNA binding domain 


19.17 9.03e-10 651-690 


1003 


IPB001275 


DM DNA binding domain 


19.17 9.09e-09 287-326 


1003 


IPB001275 


DM DNA binding domain 


19.17 9.14e-13 567-606 


1003 


IPB001275 


DM DNA binding domain 


19.17 9.64e-10 315-354 


1003 


PR00048A 


C2H2-type zinc ringer signature I 


9.94 1.00e-08 548-561 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.82e-10 259-272 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 2.06e-12 464-477 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 2.93e-13 604-617 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 3.25e-09 324-337 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.27e-10 520-533 I 


1003 


PR00048A 


C2H2-type zinc finger signature I 


y.y4 4.o0e-lJ joU-Dyj 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.15e-14 576-589 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 408-421 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.73e-10 436-449 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.23e-14 492-505 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.35e-12 352-365 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.88e-09 632-645 


1003 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.53e-ll 660-673 
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1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.50e-09 676-685 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.57e-10 508-517 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.57e-10 536-545 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.57e-10 620-629 


1003 


PR00048B 


C2H2~type zinc finger signature II 


5.52 4.00e-09 592-601 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 4.86e-l0 312-321 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 4.86e-l0 396-405 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.l4e-l0 340-349 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.75e-ll 424-433 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.00e-09 368-377 


1003 


PR00048B 


C2H2-type zinc finger signature II 


5.52 9.00e-09 564-573 


1007 


IPB001781 


LIM domain 


11.42 1.00e-09 536-546 


1007 


IPB001781 


LIM domain 


11.42 8. lie- 12 469-479 


1007 


IPB001781 


LIM domain 


11.42 9.59e-ll 411-421 


1010 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.88e-09 156-181 


1011 


PR00456F 


Ribosomal protein P2 signature VI 


5.70 6.76e-12 76-87 


1016 


IPB002087A 


Anti-proliferative protein 


15.38 5.68e-13 14-28 


1016 


IPB002087B 


Anti-proliferative protein 


19.44 6.88e-20 122-155 


1016 


IPB002087C 


Anti-proliferative protein 


13.79 1.00e-23 180-200 


1016 


PR00310A 


Anti-proliferative protein BTG1 family 
signature I 


10.74 2.45e-28 16-40 


1016 


PR00310B 


Anti-proliferative protein BTG1 family 
signature II 


10.25 6.14e-24 123-152 


1016 


PR00310C 


Anti-proliferative protein BTG1 family 
signature III 


12.94 8.41e-36 153-182 


1016 


PR00310D 


Anti-proliferative protein BTG1 family 
signature IV 


9.99 4.71e-38 183-212 


1016 


PR00310E 


Anti-proliferative protein BTG1 family 
signature V 


14.23 2.06e-24 229-248 


1017 


IPB000306 


FYVE Zn-fmger, rabphiliii/VPS27/FABl 
type 


8.96 9.74e-09 491-503 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.64e-17 41 1-436 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.64e-17 523-548 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.00e-24 383-408 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.25e-20 355-380 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.27e-21 467-492 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.86e-ll 215-240 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.94e-14 271-296 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-09 121-146 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.79e-17 439-464 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.50e-20 327-352 \ 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.62e-18 495-520 


1017 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.71e-10 149-174 


1017 


IPB001275 


DM DNA binding domain 


19.17 5.24e-ll 343-382 


1017 


IPB001275 


DM DNA binding domain 


19.17 5. 38e-10 483-522 


1017 


IPB001275 


DM DNA binding domain 


19.17 6.35e-09 315-354 


1017 


IPB001275 


DM DNA binding domain 


19.17 o.0oe-U9 427-466 


1017 


IPB001275 


DM DNA binding domain 


19.17 8.18e-09 455-494 


1017 


IPB001275 


DM DNA binding domain 


19.17 8.78e-10 511-550 


1017 


IPB001275 


DM DNA binding domain 


19.17 9.61e-ll 371-410 


1017 


IPB001909 


KRAB box 


17.37 2.50e-33 44-78 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.00e-13 408-421 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.09e-10 324-337 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.24e-12 436-449 
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1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 380-393 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.73e-10 492-505 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.79e-13 464-477 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.88e-12 352-365 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.13e-09 268-281 


1017 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.10e-15 520-533 


1017 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.50e-09 340-349 


1017 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.25e-ll 396-405 


1017 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.25e-ll 508-517 


1017 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.00e-ll 368-377 


1017 


PR00048B 


C2H2-type zinc finger signature II 


5.52 9.36e-10 480-489 


1020 


IPB001781 


LIM domain 


11.42 1.69e-10 160-170 


1020 


IPB001781 


LIM domain 


11.42 1.95e-12 219-229 


1020 


IPB001781 


LIM domain 


11.42 5.26e-12 99-109 


1022 


PR01217G 


Proline rich extensin signature VII 


4.02 6.61e~09 130-155 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.00e-16 410-435 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.25e-19 326-351 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.25e-20 214-239 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.77e-18 242-267 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.00e-19 382-407 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.75e-20 270-295 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.94e-14 298-323 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-09 158-183 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.00e-13 186-211 


1024 


IPB000822 


Zinc finger, C2H2 type 


14.67 8.07e-17 354-379 


1024 


IPB001275 


DM DNA binding domain 


19.17 1.68e-09 398-437 


1024 


IPB001275 


DM DNA binding domain 


19.17 3.07e-10 314-353 


1024 


IPB001275 


DM DNA binding domain 


19.17 5.37e-ll 370-409 


1024 


IPB001275 


DM DNA binding domain 


19.17 5.74e-10 258-297 


1024 


IPB001275 


DM DNA binding domain 


19.17 6.10e-12 202-241 


1024 


IPB001275 


DM DNA binding domain 


19.17 8.12e-12 342-381 


1024 


IPB001275 


DM DNA binding domain 


19.17 9.66e-09 230-269 


1024 


IPB001502E 


Endonuclease III 


23.53 7.04e-09 252-279 


1024 


IPB001909 


KRABbox 


17.37 9.00e-33 17-51 


1024 


IPB002867C 


Cysteine-rich domain (C6HC) 


19.46 3.37e-09 323-340 


1024 


IPB002867C 


Cysteine-rich domain (C6HC) 


19.46 9.05e-09 267-284 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.82e-10 211-224 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.90e-15 351-364 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 2.23e-10 379-392 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.29e-12 323-336 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.43e-13 267-280 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.55e-10 295-308 


1024 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.58e-l 1407-420 


1024 


PR00048A 


C2H2~type zinc finger signature I 


9.94 8.62e-14 239-252 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.50e-10 227-236 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.00e-09 423-432 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e-10 283-292 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e-10 339-348 


1024 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.07e-10 395-404 


1025 


PR00966H 


Potyvirus nuclear inclusion A cysteine 
protease (C4) signature VIII 


13.43 9.45e-09 215-234 


1026 


IPB000315A 


B-box zinc finger superfamily 


24.51 4.67e-12 19-42 


1026 


IPB001293A 


TRAF-type zinc finger 


26.83 8.34e-10 10-54 



WO 03/054152 PCT/US02/39555 



Table 3B 
375 



SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 


1026 


IPB003649A 


B-Box C-terminal domain 


13.56 7.92e-12 26-39 


1026 


PR00194D 


Tropomyosin signature IV 


9.54 7.2le-l0 150-173 


1026 


PR01511F 


Kvl.4 voltage-gated K+ channel signature 
VI 


5.l9 4.54e-09 141-155 


1029 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 6.l3e-l0 91-136 


1029 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 6.23e-09 71-116 


1029 


IPB000118A 


Granulin 


8.25 8.84e-09 141-163 


1029 


IPB000359B 


Cystine-knot domain 


19.26 9.44e-09 181-199 


1029 


IPB000726B 


Glycoside hydrolase family 19 


5.39 2.54e-09 66-76 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21.88 6.54e-09 105-145 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21.88 7.83e-10 183-223 


1029 


IPB000967E 


Zinc finger NF-X1 type 


21.88 9.04e-09 136-176 


1029 


IPB001169K 


Integrin beta, C-terminus 


27.45 6.96e-09 107-149 


1029 


IPB001169K 


Integrin beta, C-terminus 


27.45 7.93e-09 56-98 


1029 


IPB001762A 


Disintegrin 


23.93 6.47e-09 185-225 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 2.84e-10 102-135 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.08e-13 164-197 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.25e-10 66-99 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.25e-10 92-125 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


•1 /-» A A **> A S~ -1-1 11^ 1 A C 

12.44 3.46e-ll 112-145 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.68e-ll 169-202 


1029 j 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.89e-12 122-155 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 4.00e-14 159-192 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 4.39e-09 174-207 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


•f A A A A /'A 11 Ifl^ 

12.44 4.64e-ll 206-239 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


-t f\ A A IT "1 r\ 1/"V1 AH 1 A f\ 

12.44 5.19e-10 107-140 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.65e-09 91-124 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.04e-14 76-109 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.19e-13 81-114 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.71e-09 8-41 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.88e-15 117-150 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7.00e-09 221-254 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7.10e-09 67-100 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7.48e-09 222-255 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.16e-09 216-249 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.33e-12 196-229 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.38e-13 71-104 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.84e-09 197-230 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 9.03e-09 86-119 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


ir\ a A r\ *-% a 1 f\ 1 e A 1 O ^7 

12.44 9.28e-10 154-187 


1029 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 9.32e-09 57-90 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 1.13e-09 70-84 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 2.01e-09 136-150 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 3.15e-10 22-36 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 5.94e-09 215-229 


1029 


TT»nAA1 Af\ A T"> 

IPB002494B 


Keratin, high sulfur B2 protein 




1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 6.58e-09 163-177 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 7. 85e-l 0 210-224 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.23e-09 234-248 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.66e-10 95-109 


1029 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 9.62e-09 121-135 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.10e-14 31-74 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 l.lle-17 69-112 
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1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A f 1 f\f~1 1 1 11 T r\ 

14.46 1.27e-ll 27-70 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a AS" 1 1 'I 1 1 CC\ 1 AO 

14.46 1.27e-ll 59-102 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A /T 1 of\_ 11 1 C\(~\ 1 A 1 

14.46 1.39e-13 199-242 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a A f 1 S~ A 11 11/1 ~\ C 'H 

14.46 1.64e-ll 114-157 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


■t A A S" 1 H 1 C "\ f\ A 1 A T 

14.46 1.72e-15 104-147 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.79e-10 204-247 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i/T o 111 r\r\ 1 ai 

14.46 2.22e-12 100-143 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.35e-09 135-178 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.43e-09 177-220 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.80e~18 115-158 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 3.16e-19 162-205 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 i a ** 1 n -i a 1 i/\ 1 /~i 

14.46 3.20e-14 120-163 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 m A *< *W AC* 1 O tf\ A *1 ^7 

14.46 3.45e-13 94-137 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 3.79e-15 167-210 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 3.94e-09 85-128 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 4.03e-13 26-69 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 4.42e-13 64-107 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


t A AS" A l—t^ 1 O f\C 1 i o 

14.46 4.72e-13 95-138 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 4.83e-15 84-127 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A f C f\f\ 1 A 1 11 f 

14.46 5.00e-14 172-215 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.37e-09 183-226 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.38e-20 209-252 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


■t A A *~ f c r\ 1 r\ /-» 1 t~\ 1 S" 1 

14.46 5.50e-12 219-262 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.71e-09 147-190 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


h a a /*- c *nf\ r\r\ t f\ 111 

14.46 5.79e-09 70-113 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.09e-13 99-142 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


a a a s~ ?y *w "t f~\ r\f\ •too 

14.46 6.33e-10 90-133 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.42e-10 32-75 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


-« a a s~ >~i 1 a r\c\ 1/11 loc 

14.46 7.14e-09 142-185 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A /~ l-J -t 11 1 OA I'll 

14.46 7.16e-13 130-173 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


t a /</'"'-7io_ i/"\ic-7i/"\r\ 

14.46 7.38e-19 157-200 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.39e-20 79-122 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.15e-09 16-59 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.15e-09 80-123 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.29e-19 110-153 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


^ m a j~ n a*" -i*i -1 r\ i i~ s*\ 

14.46 8.36e-ll 109-152 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.41e-12 125-168 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.45e-15 89-132 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.57e-09 189-232 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A S~ O O ^ _ 1C11 C A 

14.46 8.86e-15 11-54 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 8.92e-13 152-195 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.13e-17 21-64 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.16e-21 74-117 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.27e- 11 6-49 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.71e-13 36-79 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A tC C\ IaT— AO A 1 O/l 

14.4o 9. /De-UV 41-o4 


1029 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.80e-14 105-148 


1029 


IPB002919A 


Trypsin Inhibitor-like cysteine rich domain 


15.56 5.71e-09 24-36 


1031 


IPB002130A 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


13.87 2.17e-15 18-39 


1031 


IPB002130B 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


21.20 5.85e-39 48-86 


1031 


IPB002130C 


Cyclophilin-type peptidyl-prolyl cis-trans 


16.92 1.00e-40 98-135 



WO 03/054152 



PCT/US02/39555 



Table 3B 
377 



SEQ 

ID 

NO: 


Database 
entry ID 


Description 


Results* 






isomerase 




1031 


PR00153A 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature I 


14.60 2.71e-10 24-39 


1031 


PR00153B 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature II 


12.94 6.29e-17 53-65 


1031 


PR00153C 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature III 


10.79 2.64e-20 96-111 


1031 


PR00153D 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature IV 


10.73 L47e-15 111-123 


1031 


PR00153E 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature V 


8.39 l.43e-l5 1 24-139 


1032 


PR00010C 


Type II EGF-like signature III 


i' AO £* "IT— f\{*\ AC C C 

6.98 5.17e-09 45-55 


1035 


IPB000822 


Zinc finger, C2H2 type 


1 A f~l 1 /Tn„ 1 O T7/1 1AA 

14.67 1.69e-lo 274-zyy 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.29e-l / lUZ-lz / 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.29e-l / 3U2-3Z / 


1035 


IPB000822 


Zinc finger, C2H2 type 


1 A £'7 1 l ~T~l ^ 1 O 1 a a 11C 

14.67 3.77e-18 19U-Z1D 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 3 . 8 1 e- 1 5 414-439 


1035 


IPB000822 


Zinc finger, C2H2 type 


1 A /* "7 A ^7 C ~. 1f\ 11 O 1/11 

14.67 4.75e-19 218-243 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 4.75e-19 442-467 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-15 162-187 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.25e-19 386-411 


1035 


IPB000822 


Zinc finger, C2H2 type 


i a s~i-7 s~ si e 1/1 1^0 1 Ol 

14.67 6.25e-20 358-383 


1035 


IPB000822 


Zinc finger, C2H2 type 


•t A S~T /" C f\ lO 1/1/' 1*71 

14.67 6.50e-13 246-271 


1035 


IPB000822 


Zinc finger, C2H2 type 


-i a s'i—s s~ >~ir\^ 11 i i r\ i c c 

14.67 6.79e-17 330-355 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.00e-19 470-495 


1035 


IPB000822 


Zinc finger, C2H2 type 


14.67 7.23e-18 498-523 


1035 


IPB000822 


Zinc finger, C2H2 type 


-1 A f~t O f\l—t 1 *"7 CI /"* C C 1 

14.67 8.07e-17 526-551 


1035 


IPB000822 


Zinc finger, C2H2 type 


i a s'ry o c f\ ^ 1 n i i /i 1 rn 

14.67 8.50e-19 134-159 


1035 


IPB001275 


DM DNA binding domain 


-i r\ -1 *f 1 ^ A Art f\ 1 A C 

19.17 1.34e-09 206-245 


1035 


IPB001275 


DM DNA binding domain 


19.17 2.70e-10 178-217 


1035 


IPB001275 


DM DNA binding domain 


19.17 3.55e-10 374-413 


1035 


IPB001275 


DM DNA binding domain 


19.17 4.08e-09 346-385 


1035 


IPB001275 


DM DNA binding domain 


t f\ n f O £T „ in /ICO /f AT 

19. 17 5.86e-10 4 j 8-49 / 


1035 


IPB001275 


DM DNA binding domain 


19.17 8.78e-lU 318-30 / 


1035 


IPB001275 


DM DNA binding domain 


10 1 t n 1 c„ i a i aa no 

19.17 9.15e-lU Z9U-5zy 


1035 


IPB001275 


DM DNA binding domain 


19.17 9.88e-lU 48o-dzd 


1035 


IPB001909 


KRAB box 


17.37 3.86e-28 o-4U 


1035 


PR00048A 


C2H2-type zinc finger signature I 


r\r\/i i /i 1 „ 1 A 1 0*7 or\n 

9.94 1.41e-10 18/-2UU 


1035 


PR00048A 


C2H2-type zinc finger signature I 


r\ >1 i ia„ 11 inr\ ill 

9.94 2.29e-13 299-312 


1035 


PR00048A 


C2H2-tvpe zinc finger signature I 


r% r\ A s> A rs 11 111 "i A A 

9.94 2.42e-ll 131-144 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 2.64e-l0 383-396 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 3.45e-10 411-424 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.l8e-l2 495-508 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.21e-13 523-536 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.09e-10 551-564 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.14e-13 327-340 


1035 


PR00048A 


C2H2-type zinc finger signature I 


O OA & 1 A*=» 1 "X QQ 119 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 215-228 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.54e-14 467-480 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.82e-12 243-256 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.92e-14 355-368 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.58e-ll 159-172 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.05e-l 1439-452 


1035 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.36e-13 271-284 
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1035 


PR00048B 


C2H2-type zinc finger signature II 


c ci i r\r\~ f\o aoi /tno 

5.52 1.00e-0o 4o3-49Z 


1035 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.00e-12 115-124 


1035 


PR00048B 


C2H2-type zinc fmger signature II 


5.52 5.50e-10 203-212 


1035 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5,50e-10 28 7-290 


1035 


PR00048B 


C2H2-type zinc finger signature II 


C CI C ^/*v„ 1 1 ACC A (Z A 

5.52 5.50e-ll 455-464 


1035 


PR00048B 


C2H2-type zinc finger signature II 


r* C 1 i-l f A t\C\ OT1 O OO 

5.52 7.50e-09 371-380 


1035 


PR00048B 


C2H2-type zinc finger signature II 


5.52 8.00e-09 147-156 


1035 


PR00048B 


C2H2-type zinc finger signature II 


c ci o c r\ ~ c\c\ onn ,i ao 

5.52 8.50e-09 399-408 


1038 


IPB001902A 


Sulfate transporter 


10.29 9.57e-10 310-319 


1038 


IPB001902B 


Sulfate transporter 


21.91 3.43e-31 340-390 


1038 


IPB001902C 


Sulfate transporter 


23.62 6.44e-25 429-480 


1038 


IPB002645B 


STAS domain 


13.24 1.43e-ll 179-193 


1038 


IPB002645B 


STAS domain 


13.24 1.43e-ll 622-636 


1039 


IPB000626 


Ubiquitin domain 


21.25 7.65e-09 56-110 


1041 


IPB000729A 


PMP-22/EMP/MP20 family 


19.70 9.05e-09 47-68 


1042 


IPB003530A 


Long hematopoietin receptor, soluble alpha 
chains 


23.64 3.01e-09 362-416 


1042 


PR01474F 


Vascular cell adhesion molecule- 1 (VCAM- 
1) signature VI 


14.81 9.48e-09 389-402 


1043 


IPB001526A 


Ly-6/u-PAR domain 


13.24 3.65e-13 73-88 


1043 


IPB001526C 


Ly-6/u-PAR domain 


13.04 6.82e-13 142-157 


1043 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 3.93e-09 149-160 


1047 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 5. 99e-10 203-252 


1047 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 1.00e-40 15-68 


1047 


IPB001039B 


Major histocompatibility complex protein, 
Class I 


27.55 1.00e-40 96-147 


1047 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 1.00e-40 177-230 


1047 


IPB001039D 


Major histocompatibility complex protein, 
Class I 


16.49 4.43e-38 255-309 


1047 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 3.84e-15 224-246 


1047 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.25e-25 261-298 


1047 


IPB003363E 


Glycoprotein GG/GX 


13.35 2.94e-ll 308-340 


1051 


IPB000729D 


PMP-22/EMP/MP20 family 


i o r\/Z '1 1 A„ 1 A 1 TO OAA 

18.96 7.30e-10 173-200 


1052 


IPB001526C 


Ly-6/u-PAR domain 


11 f\ A 1 T7„ f\Ci AH £Z1 

13.04 1.77e-09 46-oi ! 


1054 


PR00258B 


Speract receptor signature II 


7.94 9.50e-16 738-/49 


1054 


PR00258C 


Speract receptor signature III 


9.05 2.32e-10 753-763 


1054 


PR00258D 


Speract receptor signature IV 


14.29 6.63e-09 785-/99 


1054 


PR00258E 


Speract receptor signature V 


14.06 8.45e-14 oUo-ozu 


1055 


IPB001429A 


ATP P2X receptor 


\1M l.O0e-38 12-50 


1055 


IPB001429B 


ATPP2X receptor 


_ ja a ni - 11 ZT1 *"7 1 

5.49 2.7le-12 62-71 


1055 


IPB001429C 


Air Jr ZA receptor 


1Q Ofi S 88-118 


1055 


IPB001429D 


ATP P2X receptor 


20.63 2.38e-19 143-164 


1055 


IPB001429E 


ATP P2X receptor 


15.32 1.43e-30 178-205 


1055 


IPB001429F 


ATP P2X receptor 


20.01 5.88e-30 248-277 


1055 


IPB001429G 


ATP P2X receptor 


9.93 2.64e-14 291-300 


1055 


IPB001429H 


ATP P2X receptor 


12.67 7.00e-34 309-335 


1055 


IPB001675B 


Glycosyltransf erase family 29 


15.84 9.26e-09 339-361 


1055 


PR01307A 


P2X purinoceptor family signature I 


10.92 2.80e-12 82-90 
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1055 


PR01307B 


P2X purinoceptor family signature II 


1030 1.00e-15 157-168 


1055 


PR01307C 


P2X purinoceptor family signature III 


10.17 3.77e-15 243-255 


1055 


PR01307D 


P2X purinoceptor family signature IV 


10.26 1.56e-15 290-300 


1055 


PR01307E 


P2X purinoceptor family signature V 


15.20 2.29e-19 310-324 


1055 


PR01311A 


P2X4 purinoceptor signature I 


7.73 8.32e-10 1-8 


1055 


PR01311C 


P2X4 purinoceptor signature III 


9.00 1.90e-12 30-38 


1055 


PR01311D 


P2X4 purinoceptor signature IV 


12.87 1.12e-10 49-55 


1055 


PR01311E 


P2X4 purinoceptor signature V 


6.43 1.00e-09 139-146 


1055 


PR01311F 


P2X4 purinoceptor signature VI 


9.15 1.15e-10 208-216 


1055 


PR01311G 


P2X4 purinoceptor signature VII 


11.74 8.00e-14 280-290 


1055 


PR01311H 


P2X4 purinoceptor signature VIII 


12.12 1.00e-12 303-312 


1055 


PR01311I 


P2X4 purinoceptor signature IX 


5.25 1.00e-ll 359-366 


1059 


PR00854A 


Prostaglandin D receptor signature I 


15.24 9.08e-18 6-20 


1059 


PR00854B 


Prostaglandin D receptor signature II 


7.30 8.15e-21 41-58 


1059 


PR00854C 


Prostaglandin D receptor signature III 


12.92 1.64e-12 93-104 


1059 


PR00854D 


Prostaglandin D receptor signature IV 


9.41 2.50e-18 185-200 


1059 


PR00854E 


Prostaglandin D receptor signature V 


10.50 4.65e-26 236-259 


1059 


PR00854F 


Prostaglandin D receptor signature VI 


12.83 9.68e-ll 321-332 


1059 


PR00854G 


Prostaglandin D receptor signature VII 


10.66 L78e-18 341-357 


1059 


PR00854H 


Prostaglandin D receptor signature VIII 


14.71 6.20e-17 369-389 


1059 


PR00856E 


Prostacyclin (prostanoid IP) receptor 
signature V 


9.82 1.72e-09 178-194 


1060 


IPB001898A 


Sodium: sulfate symporter family 


13.06 5.30e-21 49-72 


1060 


IPB001898B 


Sodium:sulfate symporter family 


13.18 8.41e-22 88-114 


1060 


IPB001898C 


Sodium: sulfate symporter family 


9.97 1.00e-23 132-151 


1060 


IPB001898D 


Sodiurrusulfate symporter family 


12.13 1.00e-21 208-232 


1060 


IPB001898E 


Sodium: sulfate symporter family 


13.97 6.76e-21 248-272 


1060 


IPB001898F 


Sodium:sulfate symporter family 


18.66 6.04e-20 309-335 


1060 


IPB001898G 


Sodium:sulfate symporter family 


16.36 8.36e-24 405-434 


1060 


IPB001898H 


Sodium: sulfate symporter family 


21.26 1.00e-40 486-534 


1063 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 8.89e-ll 19-72 


1064 


IPB001171A 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


11.99 1.55e-10 11-24 


1064 


IPB001171B 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


14.81 6.88e-15 166-178 


1064 


IPB001171C 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


26.44 4.75e-33 181-217 


1064 


IPB001171D 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


20.69 1.00e-40 241-282 


1064 


IPB001171G 


Ergosterol biosynthesis ERG4/ERG24 
enzymes 


25.35 1.00e-40 298-350 


1065 


IPB003388E 


Reticulon 


14.24 6.29e-09 930-972 


1066 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 2.04e-09 264-301 


1067 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11.56 5.20e-12 105-116 


1067 


PR00237C 


Rhodops in-like GPCR superfamily 
signature III 


14.77 6.73e-ll 91-113 


1067 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.77e-10 175-198 


1067 


PR00245A 


Olfactory receptor signature I 


10.98 3.65e-09 79-90 


1067 


PR00245B 


Olfactory receptor signature II 


13.73 4.60e-09 116-128 


1067 


PR00245D 


Olfactory receptor signature IV 


9.34 1.53e-13 212-221 


1067 


PR00245E 


Olfactory receptor signature V 


8.96 3.30e-10 259-270 
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1067 


PR00534A 


Melanocortin receptor family signature I 


12.77 8.43e-09 38-50 


1069 


IPB000095E 


PAK-box /P21-Rho-binding 


17.62 3.52e-l 1410-455 


1069 


IPB000961C 


Protein kinase C-terminal domain 


15.48 9.39e-14 409-443 


1069 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 6.54e-15 402-442 


1069 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 467-505 


1069 


IPB003527C 


MAP kinase 


14.70 7.37e-10 394-442 


1069 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 2.00e-13 92-103 


1069 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 302-349 


1069 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 1.00e-40 398-439 


1069 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21.14 1.64e-40 441-480 


1069 


IPB003605F 


GS motif preceding kinase domain in TGF 
beta receptor 


23.21 1.00e-40 504-557 


1069 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 5.58e-09 549-571 


1070 


IPB000729C 


PMP-22/EMP/MP20 family 


37.83 1.71e-09 80-132 


1070 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 9.33e-10 156-183 


1071 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 7.52e-14 201-250 


1071 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 2.80e-26 14-67 


1071 


IPB001039B 


Major histocompatibility complex protein, 
Class I 


27.55 5.50e-23 90-141 


1071 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 7.75e-16 176-229 


1071 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


19.82 7.94e-23 175-228 


1071 


IPB001039D 


Major histocompatibility complex protein, 
Class I 


16.49 7.43e-29 254-308 


1071 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17.51 6. 82e- 17 222-244 


1071 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.40e-19 260-297 


1072 


IPB002146 


ATP synthase B/B' CF(0) 


21.39 6.94e-09 188-226 


1072 


PR00308B 


Type I antifreeze protein signature II 


3.38 8.99e-09 3-14 


1072 


PR00308C 


Type I antifreeze protein signature III 


2.79 6.92e-09 3-12 


1072 


PR00308C 


Type I antifreeze protein signature III 


2.79 9.59e-10 4-13 


1072 


PR00308C 


Type I antifreeze protein signature III 


2.79 9.59e-10 5-14 


1072 


PR00698E 


C.elegans Srg family integral membrane 
protein signature V 


14.65 2.76e-09 111-136 


1072 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3.91 2.29e-09 4-14 


1072 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3.91 2.88e-09 5-15 


1073 


IPB002617C 


Vesicular monoamine transporter 


16.23 5.03e-10 198-250 


1073 


PR01035E 


Tetracycline resistance protein signature V 


10.87 1.00e-08 208-230 


1073 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 l.OOe-OS 499-519 


1074 


IPB002617C 


Vesicular monoamine transporter 


16.23 5. 03e- 10 286-338 


1074 


PR01035E 


Tetracycline resistance protein signature V 


10.87 1.00e-08 296-318 


1074 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 1.00e-08 587-607 


1075 


IPB000906A 


ZU5 domain 


22.49 3.72e-09 58-100 
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1075 


IPB000906G 


ZU5 domain 


25.85 7.95e-10 164-212 


1075 


PR00806C 


Vinculin signature III 


10.57 8.56e-09 350-367 


1076 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 3.25e-09 188-210 


1076 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.86e-09 276-299 


1076 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 1.25e-10 322-346 


1077 


IPB003662A 


General substrate transporters 


18.97 2.89e-09 139-171 


1079 


IPB000101A 


Gamma-glutamyltranspeptidase 


29.14 3.45e-14 352-391 


1079 


PR01210A 


Gamma- glutamyl trans peptidase signature I 


17.15 1.50e-13 130-155 


1080 


IPB000684L 


Eukaryotic RNA polymerase II 
heptapeptide repeat 


3.49 8.35e-09 1357-1399 


1080 


IPB001237C 


43 Kd postsynaptic protein 


5.44 4.63e-ll 28-80 


1080 


IPB001359H 


Synapsin 


22.58 8.69e-09 1371-1421 


1081 


IPB002688A 


Sterol/diacylglycerol O-acyltransferase 


19.24 8.65e-20 259-281 


1081 


IPB002688B 


Sterol/diacylglycerol O-acyltransferase 


22.16 2.50e-25 308-347 


1081 


IPB002688C 


Sterol/diacylglycerol O-acyltransferase 


22.98 3.13e-20 321-368 


1081 


IPB002688D 


Sterol/diacylglycerol O-acyltransferase 


24.36 3.57e-33 380-415 


1081 


IPB002688E 


Sterol/diacylglycerol O-acyltransferase 


13.67 2.50e-18 433-453 


1082 


PR01547A 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature I 


10.65 6.40e-24 53-74 


1082 


PR01547B 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature II 


17.77 2.38e-19 97-116 


1082 


PR01547C 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature III 


9.50 5.94e-23 118-139 


1082 


PR01547D 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature IV 


12.09 5.50e-29 144-169 


1082 


PR01547E 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature V 


10.42 1.95e-26 171-195 


1082 


PR01547F 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VI 


11.64 5.95e-25 258-278 


1082 


PR01547G 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VII 


7.91 1.00e-36 298-325 


1082 


PR01547H 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature VIII 


9.70 1.00e-30 333-357 


1082 ! 


PR01547I 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature IX 


15.37 7.94e-28 408-438 


1082 


PRO 1547 J 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature X 


14.06 3.15e-23 439-460 


1082 


PR01547K 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XI 


13.61 7.87e-23 462-481 


1082 


PR01547L 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XII 


10.35 6.29e-19 486-502 


1082 


PR01547M 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XIII 


10.06 3.44e-12 1178-1196 


1083 


IPB000095E 


PAK-box /P21-Rho-binding 


i /.oz j.ue-iz oo /-H iz 


1083 


IPB000961C 


Protein kinase C-terminal domain 


15.48 4.66e-14 366-400 


1083 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 5.50e-14 359-399 


1083 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 424-462 


1083 


IPB001526C 


Ly-6/u-PAR domain 


13.04 1.00e-08 105-120 


1083 


IPB003527C 


MAP kinase 


14.70 8.81e-10 351-399 


1083 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 1.32e-ll 112-123 
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1083 


IPB003605B 


GS motif preceding kinase domain in TGF 
beta receptor 


10.56 5.00e-28 229-255 


1083 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 259-306 


1083 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 1.00e-40 355-396 


1083 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21.14 8.88e-39 398-437 


1083 


IPB003605F 


GS motif preceding kinase domain in TGF 
beta receptor 


23.21 6.81e-40 461-514 


1083 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 2.15e-09 535-557 


1083 


PR00653D 


Activin type II receptor signature IV 


12.01 6.70e-ll 385-406 


1084 


IPB000095E 


PAK-box /P21-Rho-binding 


17.62 3.15e-12 429-474 


1084 


IPB000961C 


Protein kinase C-terminal domain 


15.48 4.66e-14 428-462 


1084 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 5.50e-14 421-461 


1084 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.18e-09 486-524 


1084 


IPB001526C 


Ly-6/u-PAR domain 


13.04 1.00e-08 105-120 


1084 


IPB003527C 


MAP kinase 


14.70 8.81e-10 413-461 


1084 


IPB003605A 


GS motif preceding kinase domain in TGF 
beta receptor 


13.79 1.32e-ll 112-123 


1084 


IPB003605B 


GS motif preceding kinase domain in TGF 
beta receptor 


10.56 5.00e-28 291-317 


1084 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 1.00e-40 321-368 


1084 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 1.00e-40 417-458 


1084 


IPB003605E 


GS motif preceding kinase domain in TGF 
beta receptor 


21,14 8. 88e-39 460-499 


1084 


IPB003605F 


GS motif preceding kinase domain in TGF 
beta receptor 


23.21 6.81e-40 523-576 


1084 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 2.15e-09 597-619 


1084 


PR00653D 


Activin type II receptor signature IV 


12.01 6.70e-ll 447-468 


1085 


PR00541C 


Muscarinic M4 receptor signature III 


8.06 7.73e-09 486-506 


1086 


PR00541C 


Muscarinic M4 receptor signature III 


8.06 7.73e-09 755-775 


1087 


PR01430A 


Protease activated receptor 2 precursor 
signature I 


8.72 8.20e-09 123-134 


1088 


IPB000322A 


Glycoside hydrolase family 3 1 


19.86 8.13e-23 151-176 


1088 


IPB000322B 


Glycoside hydrolase family 3 1 


14.24 9.64e-10 186-201 


1088 


IPB000322C 


Glycoside hydrolase family 3 1 


22.80 3.57e-20 225-250 


1088 


IPB000322D 


Glycoside hydrolase family 31 


17.98 6.57e-27 326-352 


1088 


IPB000322E 


Glycoside hydrolase family 3 1 


25.60 9.10e-33 366-407 


1090 


IPB000135A 


High mobility group proteins HMG1 and 
HMG2 


11.69 7.66e-14 707-761 


1090 


IPB000135B 


High mobility group proteins HMG1 and 
HMG2 


13.24 5.68e-21 680-724 


1090 


PR00886A 


High mobility group (HMG1/HMG2) 
protein signature I 


10.57 4.46e-10 710-732 


1090 


PR00886C 


High mobility group (HMG1/HMG2) 
protein signature III 


11.20 6.28e-13 696-714 


1090 


PR00887A 


Structure-specific recognition protein 
signature I 


12.71 1.82e-21 343-359 


1090 


PR00887B 


Structure-specific recognition protein 


10.24 1.35e-20 365-381 
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signature II 




1090 


PR00887C 


Structure-specific recognition protein 
signature III 


13.32 1.47e-21 388-404 


1090 


PR00887D 


Structure-specific recognition protein 
signature IV 


16.42 2.73e-15 453-466 


1090 


PR00887E 


Structure-specific recognition protein 
signature V 


10.75 3.40e-20 480-498 


1090 


PR00887F 


Structure-specific recognition protein 
signature VI 


14.12 6.46e-21 498-515 


1090 


TVT> A A O O 'H 

PR00887G 


Structure-specific recognition protein 
signature VII 


13.65 7.00e-18 521-537 


luyu 


T)D A AO C7TU 

JrKUUoo /xi 


Structure-specific recognition protein 
signature VIII 


12.27 3.53e-20 537-555 




lrr>UUUZlDA 


Serpins 


13.01 4.77e-13 341-364 




TBDnAAO 1 CD 


Serpins 


r\ ot c\ f O f\r\ A a r\ A s~ 1 

9.87 9.68e-09 449-461 


1091 


IPB000215C 


Serpins 


13.90 7. lie- 11 476-490 


1 Am 

1091 


IPB000215D 


Serpins 


15.35 3.88e-16 578-604 


1091 


1PB000215E 


Serpins 


15.36 6.21e-16 659-683 


i a ao 

1092 


TT»T> AAA n^T A 

1PB000126A 


Serine proteases, V8 family 


1 1.75 2.88e-10 357-372 


1 aai 

1092 


TnT>AAA1 

1PB000126B 


Serine proteases, V8 family 


12.50 1.51e-ll 471-487 


1 aao 
1092 


PR00834D 


HtrA/DegQ protease family signature IV 


11.75 6.23e-09 466-483 


1 nni 
1092 


t»t> aa o o nr» 

PR00839B 


V8 serine protease family signature II 


11.20 8.12e-10 357-374 


1097 


TT>T> A A A A C A 

1PB000454 


T"J 1 _ j "1 11 1 L r in-. 

Eubactenal and plasma membrane ATP 
synthase subunit C 


27.64 2.50e-28 79-132 


IKJy i 


TiT> A A 11/1 A 


ATP synthase C subunit signature I 


C\ f /"V A AS" -t f% T C r\ A 

8.69 4.46e-13 75-94 


1 A AT 


OT> A A 1 O .-1 "P> 

PR00124B 


ATP synthase C subunit signature II 


14.47 8.14e-ll 96-111 


my / 


DD A A 1 O /IP 


ATP synthase C subunit signature III 


12.82 6.63e-17 113-138 


1 AAC 


IroUUlO / j 


Aconitase C-terminal domain 


25.70 4.86e-34 947-996 




TDT3 A A 1 AO A A 


Aconitase (aconitate hydratase) 


37.52 1.17e-13 452-500 


1 AQQ 


TDDHA 1 AO A A 


Aconitase (aconitate hydratase) 


37.52 5.06e-23 392-440 


1 HOC 


TDDAA 1 AQAD 


Aconitase (aconitate hydratase) 


18.77 4.32e-18 665-688 


1 AQQ 


TDDAA1 ATfiP 


Aconitase (aconitate hydratase) 


18.16 3.65e-12 710-724 


iuy© 


r>T3 AA/11 C A 

JrK.UU4J.DA 


Aconitase family signature I 


13.25 7.00e-10 323-336 


1098 


PR00415C 


Aconitase family signature III 


14.58 2.75e-13 376-389 


1 AAC 


DDAA/1 1 CA 


Aconitase family signature IV 


14.53 2.93e-15 390-405 


1098 


PR00415E 


Aconitase family signature V 


9.37 1.53e-12 452-465 


1098 


PR00415F 


Aconitase family signature VI 


11.36 6. 14e-12 466-479 


1098 


PR00415G 


Aconitase family signature VII 


14.02 2.35e-14 548-562 


1098 


PR00415H 


Aconitase family signature VIII 


13.87 4.79e-12 613-624 


1098 


PR00415I 


Aconitase family signature IX 


14.82 8.43e-15 675-688 


1099 


PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 92-109 


1100 


PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 188-205 


1101 


T\i% A 1 A Of! XT 

PR01088H 


Na+/H+ exchanger isoform 6 (NHE6) 
signature VIII 


16.89 1.00e-08 142-159 


1 1 09 


pp on? sod 


Transmembrane four family signature IV 


1 A AA 1 /JO- AA iZC A1 

14.90 2.43e-09 65-91 


1104 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.56e-10 124-161 


1104 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 9.65e-09 227-264 


1106 


IPB001983A 


Translationally controlled tumor protein 


14.49 1.43e-25 1-27 


1106 


IPB001983B 


Translationally controlled tumor protein 


15.99 4.43e-20 45-70 


1106 


IPB001983C 


Translationally controlled tumor protein 


16.91 1.41e-24 83-109 
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1106 


IPB001983D 


Translationally controlled tumor protein 


22.98 4.09e-28 123-163 


1107 


IPB000117D 


Kappa casein 


10.18 8.71e-09 513-547 


1108 


IPB001737A 


Ribosomal RNA adenine dimethylase 


27.11 1.86e-09 110-155 


1109 


IPB002048 


EF-hand family 


7.91 1.00e-09 42-54 


1110 


IPB000272 


ATP 1 G 1/PLM/MAT8 family 


14.36 8.64e-23 99-134 


1110 


IPB003403A 


Herpesvirus immediate early protein 


21.25 6.59e-09 39-61 


1112 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 


15.12 6.29e-09 127-148 


1112 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


16.87 7.09e-09 127-148 


1112 


PR00261F 


Low density lipoprotein (LDL) receptor 
signature VI 


15.46 3.91e-09 127-148 


1112 


PR00764B 


Complement C9 signature II 


12.47 1.36e-ll 122-142 


1115 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11.56 4.32e-10 216-227 


1115 


PR00245A 


Olfactory receptor signature I 


10.98 6.70e-10 190-201 


1115 


PR00245B 


Olfactory receptor signature II 


13.73 2.03e-09 227-239 


1115 


PR00245C 


Olfactory receptor signature III 


14.65 3.54e-10 274-290 


1116 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11.56 8.36e-09 5-16 


1116 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 6.19e-09 355-371 


1116 


PR00245A 


Olfactory receptor signature I 


10.98 3.16e-ll 165-176 


1116 


PR00245B 


Olfactory receptor signature II 


13.73 1.77e-09 16-28 


1116 


PR00245B 


Olfactory receptor signature II 


13.73 9.07e-10 202-214 


1116 


PR00245C 


Olfactory receptor signature III 


14.65 2.96e-15 249-265 


1116 


PR00245D 


Olfactory receptor signature IV 


9.34 8.58e-12 309-318 


1116 


PR00245E 


Olfactory receptor signature V 


8.96 2.20e-15 356-367 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 1.92e- 10 300-315 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 1.92e-10 565-580 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 2.83e-ll 1213-1228 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 3.90e-09 143-158 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 4.77e-09 448-463 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 7.83e-ll 1336-1351 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8.02e-10 1093-1108 


1119 


IPB001862A 


Membrane attack complex 
components/perforin/complement C9 


12.54 8. 92e- 12 964-979 


1119 


IPB001928 


Endothelin and related toxins 


14.31 3.00e-09 949-972 


1119 


IPB002861B 


Reeler domain 


10.50 5.95e-l 1 970-998 


1119 


IPB002861C 


Reeler domain 


23.17 1.00e-12 957-1011 


1119 


IPB002861C 


Reeler domain 


23.17 3.45e-09 136-190 


1120 


IPB000079 


Nucleosomal binding domain found in 
HMG14 and HMG17 


8.09 1.64e-37 18-48 


1120 


PR00925A 


Nonhistone chromosomal protein HMG17 
family signature I 


5.67 2.29e-19 18-32 


1120 


PR00925B 


Nonhistone chromosomal protein HMG17 
family signature II 


3.75 3. 18e-16 34-46 


1120 


PR00925C 


Nonhistone chromosomal protein HMG17 
family signature III 


5.23 7.45e-09 47-57 


1120 


PR00925D 


Nonhistone chromosomal protein HMG17 


7.38 2.00e-14 66-76 
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family signature IV 




1121 


PR00219C 


Synaptobrevin signature III 


f r\ 1 1 c £i _ r\c\ on i a o 

6.91 1.56e-09 89-108 


1124 


IPB001304A 


C-type lectin domain 


17.98 8.88e-19 203-227 


1125 


IPB002896F 


Herpesvirus glycoprotein D 


21.08 9.22e-09 374-409 


1125 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 5.15e-09 323-360 


1126 


PR01007I 


Flagellar hook-length control protein 
signature IX 


ct n A r\ f (~\ — a A A zT"*T A A A 

8.84 9.69e-09 467-490 


1131 


IPB000008D 


C2 domain 


14.83 3.05e-ll 1008-1026 


1131 


IPB000738A 


WHEP-TRS domain 


22.67 8.02e-09 1 068-1093 


1131 


PR00360A 


C2 domain signature I 


15.18 8.36e-09 682-694 


1132 


IPB001135C 


NADH-ubiquinone oxidoreductase 49Kd 
chain 


25.46 9.79e-09 72-122 


1133 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11 c /r i i a „ io 101 mo 

11.56 2.20e-12 181-192 


1133 


IPB000276D 


Rhodopsin-like GPCR superfamily 


rv A e\ s~ aa_ 1 A 1 A A n £.f\ 

9.40 6.09e-10 344-360 


1133 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


19.23 7.37e-10 334-360 


1133 


PR00245A 


Olfactory receptor signature I 


1 A AO O CA^ 1 A 1 CC 1 </C 

10.98 o.DUe-lU oo-loo 


1133 


PR00245B 


Olfactory receptor signature II 


13. /J o.zue-13 iyz-zu4 


1133 


PR00245C 


Olfactory receptor signature III 


14.65 4.10e-13 I3y-2jj 


1133 


PR00245D 


Olfactory receptor signature IV 


r\ i a n 11 ooo iao 

y.34 /.o3e-i2 zyo-3u/ 


1133 


PR00245E 


Olfactory receptor signature V 


o.yo 4. / /e-lZ 34D-3DO 


1134 


IPB000425A 


MIP family 


A n *7 Ha 11 00 1 n*7 

y.3 / /.33e-i i yz-iu / 


1134 


IPB000425B 


MIP family 


11 Q/l 1 11 11^ 17^ 

Z2.y4 I./De-31 1Z3-1/3 


1134 


IPB000425C 


MIP family 


1 A ^T*7 1 1Q« ia i/iA 1^/1 

10.0/ /.Zoe-lU Z4U-Z34 


1134 


IPB000425D 


MIP family 


o.lO y.O/e-lj 3U/-3ZD 


1134 


PR00783A 


Major intrinsic protein family signature I 


1 1 01 K <;flo 1/1 OQ 1 A1 

11. oZ D.DUe-14 oo-IU/ 


1134 


PR00783B 


Major intrinsic protein family signature II 


K 1/1 1 £1"*a 1 O 1 07 1^1 

1j.34 i.o/e-iy 1Z/-1DI 


1134 


PR00783C 


Major intrinsic protein family signature III 


1 1 IO 1 H£.a. 1 A 1 1 go 

13. 3Z Z. /oe-lU 104-1 S3 


1134 


PR00783E 


Major intrinsic protein family signature V 


1 1 C O 1 1 D AO IOC 1 < A 

Io.Zj o.lie-uy izo-idu 


1134 


PR00783E 


Major intrinsic protein family signature V 


1 1^ Q 1 la. 1 1 07^ IAS 

Io.Zj o.i3e-ll ZZo-Z4o 


1134 


PR00783F 


Major intrinsic protein family signature VI 


12.30 L53e-14 308-328 


1136 


IPB002897A 


Monocarboxylate transporter 


■\ sz n a AA„ n i 1 o /IT 

16.12 4.90e-z3 lo-4 / 


1136 


IPB002897B 


Monocarboxylate transporter 


24.20 4.30e-32 72-126 


1136 


IPB002897C 


Monocarboxylate transporter 


•11 y-t A AS" 11 1/11 1 Z"0 

13.63 4.46e-13 143-168 


1136 


IPB002897C 


Monocarboxylate transporter 


13.63 7.38e-12 177-202 


1136 


IPB002897D 


Monocarboxylate transporter 


30.44 5.85e-27 296-348 


1136 


IPB002897E 


Monocarboxylate transporter 


32.06 9.59e-24 407-458 j 


1137 


IPB000454 


Eubacterial and plasma membrane ATP 
synthase subunit C 


27.64 9.85e-ll 94-147 


1137 


PR00122A 


Vacuolar ATP synthase l6kDa subunit 
signature I 


10.78 6.00e-18 30-54 


1137 


PR00122B 


Vacuolar ATP synthase l6kDa subunit 
signature II 


10.11 4.32e-26 56-80 


1137 


PR00122C 


Vacuolar ATP synthase l6kDa subunit 
signature III 


10.28 9.40e-32 104-130 


1137 


PR00122D 


Vacuolar Air synthase loicjja subunit 
signature IV 




1137 


PR00124C 


ATP synthase C subunit signature III 


12.82 7.58e-ll 128-153 


1138 


IPB002220A 


Dihydrodipicolinate synthetase 


16.79 9.29e-09 41-53 


1138 


IPB002220B 


Dihydrodipicolinate synthetase 


14.56 6.84e-14 68-89 


1138 


IPB002220C 


Dihydrodipicolinate synthetase 


25.64 1.23e-24 108-155 


1138 


IPB002220D 


Dihydrodipicolinate synthetase 


10.31 6.56e-10 164-175 


1138 


IPB002220F 


Dihydrodipicolinate synthetase 


12.81 6. 14e-10 234-247 
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1138 


PR00146D 


Dihydrodipicolinate synthase signature IV 


1 A C\0 O "7C\~* 111 /TO 1 OA 

14.98 9.79e-ll 163-loU 


1139 


IPB001734A 


Sodiumrsolute symporter family 


14.69 5.76e-16 51-67 


1139 


IPB001734B 


Sodiumrsolute symporter family 


18.26 6.63e-27 166-206 


1139 


IPB001734C 


Sodiumrsolute symporter family 


18.21 7.75e-23 924-953 


1140 


IPB000074C j 


Apolipoprotein A1/A4/E 


22.23 1.58e-09 385-422 


1140 


IPB002607B ! 


Hydratase/decarboxylase 


29.50 3.64e-10 242-290 


1140 


PR00122B 


Vacuolar ATP synthase 16kDa subunit 
signature II 


10.11 7.56e-09 493-517 


1142 


IPB000897A 


GTP-binding signal recognition particle 
(SRP54) domain 


9.15 9.87e-09 611-630 


1143 


IPB000219A 


Dbl domain (dbl/cdc24 rhoGEF family) 


10.71 3.00e-09 156-165 


1143 


IPB000276C 


Rhodopsin-like GPCR superfamily 


8.03 1.41e-10 278-289 


1143 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 2.17e-10 324-340 


1143 


PR00237A 


Rhodopsin-like GPCR superfamily 
signature I 


9.81 8.92e-ll 63-87 


1143 


PR00237B 


Rhodopsin-like GPCR superfamily 
signature II 


12.45 4.08e-09 96-117 


1143 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 5.50e-12 140-162 


1143 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 8.46e-09 226-249 


1143 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 9.33e-13 268-292 


1143 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


19.23 6.04e-15 314-340 


1143 


PR01157A 


P2 purinoceptor signature I 


9.05 3.52e-ll 110-122 


1144 


PR00049D 


Wilm's tumour protein signature IV 


0.00 2.07e-09 7-21 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 1.1 Oe- 11 266-290 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 1.92e-ll 265-289 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 2.23e-ll 270-294 


1145 


IPB00Q135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.00e-09 262-286 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.25e-10 271-295 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.55e-09 272-296 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 5.09e-09 260-284 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.07e-10 267-291 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.09e-09 259-283 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.40e-12 268-292 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.63e-10 263-287 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.98e-ll 269-293 


1145 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 9.24e-12 264-288 


1145 


IPB000637B 


HMG-I and HMG-Y DNA-binding domain 


14.21 8.64e-09 272-290 
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(A+T-hook) 




1145 


IPB001202B 


WW / rsp5 / WWP domain 


12.18 6.40e-15 387-40 1 


1145 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 8.65e-09 258-293 


1145 


IPB001580F 


Calreticulin family 


2.93 8.65e-09 277-286 


1145 


PR00403A 


WW domain signature I 


19.41 5. 03e-10 373-386 


1145 


PR00403B 


WW domain signature II 


13.50 5.33e-15 387-401 


1148 


PR00360B 


C2 domain signature II 


11.64 1.29e-ll 279-292 


1150 


IPB000685G 


Ribulose bisphosphate carboxylase, large 
chain 


10.38 5.23e-09 45-99 


1150 


PR01237A 


Tumour necrosis factor c (lymphotoxin- 
beta) signature I 


6.39 8.27e-09 38-61 


1152 


IPB000276A 


Rhodopsin-like GPCR superfamily 


11 C/^ T i n oio O/l A 

11.56 7.63e-10 238-24y 


1152 


PR00245A 


Olfactory receptor signature I 


m no 1 o a,-, i a o 1 o no 

10.98 1.30e-10 212-223 


1152 


PR00245B 


Olfactory receptor signature II 


10 *7o c cn« 11 O/tA 0*C1 

13 . /3 D.DUe-11 Z4y-zoi 


1152 


PR00245C 


Olfactory receptor signature III 


1 A £iC £ C7„ 1 C Of|£ Q 1 O 

14.65 o.o/e-lD zyo-3 1 2 


1152 


PR00245E 


Olfactory receptor signature V 


O A O A /I ~ 11 A AO A 1 A 

8.96 2.04e-13 403-414 


1152 


PR00534A 


Melanocortin receptor family signature I 


1 O T7 *7 CO- AO 1 *7 1 1 Q Q 
12.// /.DZe-Uy 1/1-lOJ) 


1153 


IPB001680 ! 


G-protein beta WD-40 repeats 


1A /II 1 AA^. 11 /tC/C /I /C*7 

10.43 l.OUe-i 1 4DO-40 / 


1153 


PR00320B 


G protein beta WD-40 repeat signature II 


n oo *7 cc-» 11 /iC/i 
12.82 /.DOe-11 4D4-40O 


1153 


PR00320C 


G protein beta WD-40 repeat signature III 


n ->o /i ai^ 11 >i -cr >i /|£Q 
12.32 4.43e-l 1 4j4-40o 


1153 


PR00962A 


Lethal(2) giant larvae protein signature I 


11 01 O C*7^. 01 1*7 oc 

12. 91 i.D/e-22 1 /-3D 


1153 


PR00962B 


Lethal(2) giant larvae protein signature II 


12.01 i.D2e-Zy olU-JJ2 


1153 


PR00962C 


tt it t /'f^\ • it j_ • • j_ ^ ^ TTT 

Lethal(2) giant larvae protein signature III 


O AO £L A A^ O/TO 1QO 

o.U3 o.4Ue-Z3 302-3oZ 


1153 


PR00962D 


Lethal(2) giant larvae protein signature IV 


AA1 A 1^C^OO/1C1 /1*7/1 

y.yi 4. i oe-zo 4d 1-4/4 


1153 


PR00962E 


Lethal(2) giant larvae protein signature V 


0 01 o £.n ^ 01 ^01 C/tO 
o.ol i.O/e-21 Dil-j4y 


1153 


PR00962F 


Lethal(2) giant larvae protein signature VI 


13.40 8.50e-24 568-587 


1153 


PR00962G 


Lethal(2) giant larvae protein signature VII 


14.51 6.54e-29 609-633 


1153 


PR00962H 


Lethal(2) giant larvae protein signature VIII 


13.53 3.53e-23 639-658 


1153 


PR00962I 


Lethal(2) giant larvae protein signature IX 


12.48 5.13e-23 708-727 


1155 


IPB002027A 


Amino acid permease 


18.88 8.82e-ll 185-213 


1155 


IPB002027C 


Amino acid permease 


19.67 8.31e-21 385-423 


1155, 


IPB002027D 


Amino acid permease 


22.00 6.79e-21 466-505 


1157 


PR01436C 


NADH-ubiquinone oxidoreductase chain 2 
signature III 


9.53 7.75e-09 82-100 


1158 


IPB001428B 


dUTPase 


18.14 1.00e-14 107-125 


1158 


IPB001969A 


Eukaryotic and viral aspartic protease active 
site 


16.37 5.71e-09 195-211 


1158 


PR00783C 


Major intrinsic protein family signature III 


13.32 5.26e-17 31-50 


1160 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 9.42e-10 1750-1760 


1160 


IPB000034A 


Laminin B 


on n 1 oc^x in a oi coo 

22.21 1.85e-19 487-522 


1160 


IPB000034A 


Laminin B 


oo oi o nn« ia 11C/C i mi 

22.21 2.y9e-10 1356-1391 


1160 


IPB000034A 


Laminin B 


OO 11 O O /I „ 1*7 0*7C A1 A 

22.21 3.84e-17 875-910 


1160 


IPB000034A 


Laminin B 


OO 01 O 1 O „ 1 A 1 TIO 1 *7/TO 

22.21 8.13e-10 1733-1768 


1160 


IPB000034A 


Laminin B 


OO 01 O C/C^ AA 1 O/CO 1 OAO 

22.21 o.Doe-uy lZoi-lzyo 


1160 


IPB000034A 


Laminin B 


OO OI A A A ^ AA COO C/CO 

22.21 9.44e-0y 533-D05 


1160 


TnnAArtAI A T~> 

IPB000034B 


Laminin B 


91 9? 1 9^p-14 6S4-609 


1160 


IPB000034B 


Laminin B 


21.22 3.25e-31 1426-1464 


1160 


IPB000034B 


Laminin B 


21.22 3.33e-12 1287-1325 


1160 


IPB000034B 


Laminin B 


21.22 4.77e-10 1757-1795 


1160 


IPB000034B 


Laminin B 


21.22 4.99e-18 601-639 


1160 


IPB000034B 


Laminin B 


21.22 5.45e-09 511-549 


1160 


IPB000034C 


Laminin B 


12.97 3.09e-ll 508-526 


1160 


IPB000034C 


Laminin B 


12.97 3.45e-09 319-337 
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i i /cn 

1 160 


1PB000034C 


Laminin B 


12.97 4.00e-l 1 443-461 


1160 


TT>"D A AAAO A /"l 

1PB000034C 


Laminin B 


12.97 4.05e-12 1377-1395 


1 1 /ca 
1 160 


T"D"D AAAA1 A t~* 

1PB000034C 


Laminin B 


12.97 6.52e-13 554-572 


1 1 /ca 
1 160 


TOTD AAAAO A 

1FB000034C 


Laminin B 


12.97 6.56e-20 1754-1772 


1 1 *CA 

1 160 


TTJT5AAAAO A 

IPB000034C 


Laminin B 


12.97 6.73e-09 1284-1302 


1 1 <ca 
1160 


1PB000034C 


Laminin B 


12.97 8.01e-09 598-616 


1 1 iCA 

1 lou 


1PB000034C 


Laminin B 


12.97 9.09e-ll 651-669 


1160 


1PB000034C 


Laminin B 


12.97 9.31e-16 1653-1671 


1 1 /CA 

1 160 


TT5"D AAAC/C 1 

1PB000561 


EOF-like domain 


4.89 5.50e-09 1754-1762 


1 1 £T A 

1 160 


TDT5AA1 AA1 T\ 

1PB001001D 


r\\T A 1 TTT 1a 1 

DNA polymerase III, beta chain 


14.35 8.29e-09 2372-2395 


1 1 £!A 

1 160 


1PB001169K 


Integrin beta, C-terminus 


27.45 8.93e-ll 1691-1733 


1 1 /TA 

1160 


IPB001271 


Mammalian defensin 


19.97 5.91e-09 1728-1756 


1 1 /^A 

1 160 


TTTHAA 1 *7*7 A /—I 

1PB001774C 


Delta serrate ligand 


18.25 8. 16e-09 545-587 


i 1 /ca 
1160 


Trin AA1 OO/T A 

1PB001886A 


Laminin N-termmal (Domain VI) 


8.51 8.27e-09 124-131 


1160 


IPB001886B 


Lammin N-termmal (Domain VI) 


15.20 5.11e-21 286-308 


1 1 /CA 

1160 


TTIT5AA1 C O /Ti^ 

1PB001886C 


Laminin N-terminal (Domain VI) 


24.54 1.00e-09 1284-1323 


1 1 £TA 
1 160 


1PB001886C 


Laminin N-terminal (Domain VI) * 


24.54 1.00e-ll 1703-1742 


1 1 cc\ 

1160 


TnrjAAl oOZTi 

1PB001S86C 


Laminin N-terminal (Domain VI) 


24.54 1.23e-10 508-547 


i 1 cr\ 
1160 


TT>T>AA1 OO^A 

1PB001886C 


Laminin N-terminal (Domain VI) 


24.54 1.53e-14 1754-1793 


1160 


IPB001886C 


Lammin N-terminal (Domain VI) 


24.54 2.97e-33 319-358 


1160 


IPB001886C 


Laminin N-terminal (Domain VI) 


24.54 3.28e-13 391-430 


1160 


IPB001886C 


Laminin N-termmal (Domain VI) 


24.54 4.19e-10 554-593 


1160 


1PB001886C 


Lammm N-termmal (Domain VI) 


24.54 5.67e-10 1423-1462 


1 160 


TT>r>AA1 nn/z-i 

1PB001886C 


Laminin N-termmal (Domain VI) 


24.54 6.33e-ll 598-637 


1160 


1PB001886C 


Lammin N-termmal (Domain VI) 


24.54 7.49e-10 1377-1416 


i 1 cr\ 
1160 


Tnn AA1 OOZTI 

1PB001886C 


Lammm N-termmal (Domain VI) 


24.54 8.11e-ll 651-690 


1 1 *CA 

1 160 


1PB001886D 


Laminin N-termmal (Domain VI) 


12.51 1.71e-09 439-457 


1 160 


IPBOUI086D 


Laminin N-terminal (Domain VI) 


12.51 3.65e-17 1699-1717 


1 1 A 

1160 


1PB001886D 


Laminin N-terminal (Domain VI) 


12.51 8.95e-10 1750-1768 


1 1 <A 
1 160 


TDDAA1 0 O /CT"\ 
lPB0018b6D 


Lammm N-termmal (Domain VI) 


12.51 9.05e-22 387-405 


1 1 £A 

1 160 


TDDAA 1 OO /CD 

!PB0018b6b 


Laminin N-terminal (Domain VI) 


10.90 1.21e-13 1282-1298 


1 i ezr\ 
1160 


1PB001886E 


Lammm N-termmal (Domain VI) 


10.90 1.47e-09 317-333 


1 1 ^A 

1160 


1PB001886E 


Lammm N-termmal (Domain VI) 


10.90 4.82e-16 1375-1391 


1 1 /CA 

1160 


TAT3AA1 OO^TD 

1PB001886E 


Laminin N-terminal (Domain VI) 


10.90 5.97e-09 649-665 


1160 


IPB001886E 


Lammm N-termmal (Domain VI) 


10.90 6.74e-09 441-457 


1160 


1PB001886E 


Laminin N-terminal (Domain VI) 


10.90 7.43e-13 1752-1768 


1 1 cc\ 

1 160 


TDDAA1 0 O 

1PB001886E 


Laminin N-terminal (Domain VI) 


10.90 7.61e-ll 596-612 


1160 


TT»r>AA1 OOifr 

IPB001886E 


Lammin N-terminal (Domain VI) 


10.90 8.35e-ll 506-522 


1160 


IPB001886E 


T" "x T j_ • 1 /'T~x • T rr\ 

Lammm N-termmal (Domain VI) 


10.90 9.50e-20 552-568 


1160 


1PB003884C 


Factor I membrane attack complex 


13.00 3.22e-09 1403-1421 


1 1 /TA 

1160 


TltiAAAl 1 A 

PR00011A 


Type III EGF-like signature I 


14.05 1.68e-14 1416-1434 


1 160 


T»T1 AAA 1 1 A 

PR00011A 


Type III EGF-like signature I 


14.05 1.69e-ll 644-662 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 3.88e-15 1277-1295 


1160 


PR00011A 


m TTT 'HOT' 1 ""I • a T 

Type III EGF-like signature I 


14.05 3.93e-16 1370-1388 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 3.95e-13 436-454 


1 1 fn 

1 LOW 


pDAAAl 1 A 

.ri\.UUUl 1A 


Type III EGF-like signature I 


1 /I AC -C AA„ AA CA1 £Zf\C\ 

14. Oj j.00e-0y jy 1-609 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 5.05e-12 1318-1336 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 5.43e-ll 695-713 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 6.82e-10 501-519 


1160 


PR00011A 


Type III EGF-like signature I 


14.05 8.83e-16 547-565 


1160 


PR00011B 


Type III EGF-like signature II 


13.08 1.00e-09 591-609 


1160 ' 


PR000UB 


Type III EGF-like signature II 


13.08 1.66e-10 695-713 


1160 


PR00011B 


Type III EGF-like signature II 


13.08 4.18e-ll 644-662 
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1 LOU 


r\K.UUUl lr> 


l ype ill JbGF-like signature 11 


13.08 4.35e-15 1370-1388 


1 1 ACl 

1 IOU 


DDAnni 1 TJ> 
rKUUUl 115 


T" 1 , ™ „ TTT O /"IT? 1-1 — a tt 

Type III EGF-like signature II 


13.08 5.57e-12 436-454 


1 1 

1 IOU 


T>r> AAA 1 1 r> 

rKUUU 1 1 B 


np. M „ TTT T7 /~i T? 1*1 • < TT 

Type III EGF-like signature II 


13.08 5.71e-17 547-565 


1 IOU 


JrKUUUl lr> 


np w TTT T"?/"~1T7 1-1 * i TT 

lype 111 EGF-like signature II 


13.08 6.46e-12 1416-1434 


1 1 AC\ 

l IOU 


T>T> AAA 1 1 "D 

JrKUUUl lr> 


1 ype III EGF-like signature II 


13.08 7.52e-13 1277-1295 


1 IOU 


"DD AAA1 1 T5 

JrKUUUl lr> 


TTT T7 /""i T7 1 J 1 ♦ » yt 

1 ype III EGF-like signature II 


13.08 8.48e-ll 1318-1336 


1 1 

1 IOU 


DD AAA1 1 

rKUUUl 1C 


1 ype 111 bur-like signature III 


25.79 2.62e-09 651-679 


1 1 

1 IOU 


DT) AAA 1 1 /~< 

x'KUUUl 1G 


r T^-w ~~ TTT TJi^T? 1*1 * > TTT 

lype 111 EGF-like signature III 


25.79 6.53e-10 1423-1451 


1 IOU 


r\KUUUl ID 


lype 111 EGF-like signature IV 


12.12 1.18e-14 1370-1388 


1 1 f\d 


rMvUUUl 1LI 


lype in bor-like signature IV 


12.12 3.63e-10 695-713 


1 1 £fi 
I IOU 


pT> AAA1 1 T> 

r*KUUUl ID 


lype 111 EGF-like signature IV 


12.12 4.96e-14 501-519 


1160 


PR00011D 


Type III EGF-like signature IV 


12.12 6.01e-09 591-609 


i IOU 


DT? AAA1 1 r\ 

r'KUUUI ID 


r T , _. ~ TTT T~? T7 1 J 1 „ * , j TIT" 

lype 111 EGF-like signature IV 


12.12 7.65e-10 644-662 


1 1 £fl 
1 IOU 


rKUUUl ID 


T 1 -^^. TTT "D^T7 1-1 ' x TT 7 

1 ype III EGF-like signature IV 


12.12 8.25e-17 547-565 


1160 


PR00011D 


Type III EGF-like signature IV 


12.12 8.33e-13 436-454 


1 1 /T A 

1160 


PR00011D 


Type III EGF-like signature IV 


12.12 8.92e-14 1416-1434 


1160 


PR00011D 


Type III EGF-like signature IV 


12.12 9.17e-13 1277-1295 


1160 


PR00011D 


Type III EGF-like signature IV 


12.12 9.23e-09 1747-1765 


1160 


PR00877D 


Plant PEC family metallothionein signature 
IV 


6.45 1.89e-09 1308-1315 


1161 


1PB000034C 


Laminin B 


12.97 6.42e-10 205-223 


1 1 <c i 
1 lol 


TDDAAA 1 CO 

1FBU00152 


Aspartic acid and asparagine hydroxy! ation 
site 


8.86 1.00e-10 158-173 


1161 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 9.29e-09 196-211 


1161 


IPB000359A 


Cystine-knot domain 


23.24 4.86e-10 129-153 


1161 


IPB000561 


EGF-like domain 


4.89 9.25e-10 46-54 


1161 


IPB001762A 


Disintegrin 


23.93 9.65e-09 95-135 


1161 


IPB001774C 


Delta serrate ligand 


18.25 1.71e-31 37-79 


1 1 /Z 1 

llol 


TT»T» A A 1 T~1 A /~1 

IPB001774C 


Delta serrate ligand 


18.25 9.17e-12 68-110 


1161 


IPB001862F 


Membrane attack complex 
components/perforin/complement C9 


29.39 2.65e-09 170-217 


1101 


IrBUUloozr 


Membrane attack complex 
components/perforin/complement C9 


29.39 8.53e-09 132-179 


1 lOl 


TT)D A A 1 oorrj 

IrBUUloouli 


Laminin N-terminal (Domain VI) 


10.90 9.67e-10 44-60 


1 1 ^1 
1101 


nn aaai n A 
r'KUUUI UA 


lype 11 bGr-like signature I 


12.91 1.27e-09 143-154 


1 101 


DD AAAI /IP 

IrKUUUlUG 


""P 1 , _ TT Tj/^TT 1 — 1 . „ ' , . j TTT 

lype 11 EGF-like signature III 


6.98 2.54e-09 163-173 


1 lOI 


DO AAAI 1 A 

JrKUUUl 1A 


1 ype ill bor-like signature 1 


1 A A y^" OO A A OA f <—i 

14.05 6.88e-09 39-57 


11^1 
1101 i 


"DT? AfiA1 1 T5 

r^KUUUl IB 


Tir^a TTT TT / ' T7 1 -* 1 - ^. j.T JL ... L i tt 

lype 111 bGr-like signature 11 


13.08 7.88e-10 39-57 


1161 


PR00011D 


Type III EGF-like signature IV 


12.12 4.57e-12 39-57 


1 1 ^ri 
llol 


PR01217G 


Proline rich extensin signature VII 


4.02 7.79e-09 221-246 


1162 


IPB000471A 


Interferon alpha, beta and delta family 


27.36 2.86e-34 45-98 


1 1 

1162 


PR00266A 


Interferon alpha and beta subunit signature I 


13.41 9.59e-14 67-79 


1163 


IPB000975D 


Interleukin-1 


24.45 5.55e-09 59-98 


1163 


IPB000975E 


Interleukin-1 


28.12 9.80e-09 103-142 


1163 


PR00264A 


Interleukin-1 precursor family signature I 


18.63 1.00e-08 62-82 


1 1 fi^ 




Interleukin- 1 receptor antagonist precursor 
IL-1RA signature III 


10.33 4.o4e-ll 65-82 


1163 


PR01360F 


Interleukin-1 receptor antagonist precursor 
IL-1RA signature VI 


14.44 3.11e-12 124-142 


1164 


IPB000072 


Platelet-derived growth factor (PDGF) 


13.51 4.00e-26 75-97 


1164 


IPB003498F 


Probable DNA packing protein, C-terminus 


23.31 7.55e-09 154-193 


1165 


IPB000975E 


Interleukin-1 


28.12 3.57e-12 95-134 


1165 


PR00264A 


Interleukin-1 precursor family signature I 


18.63 9.85e-09 55-75 
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1165 


PR00264C 


; : — 

Interleukin-1 precursor family signature III 


i o in A on» 16 1 08 191 


1165 


PR01357F 


Interleukin-1 alpha/beta precursor family 
signature VI 


17.87 7.15e-10 108-123 


1165 


PR01360E 


Interleukin-1 receptor antagonist precursor 
IL-1RA signature V 


9.69 9.33e-13 95-115 


1165 


PR01360F 


Interleukin-1 receptor antagonist precursor 
IL-1RA signature VI 


1 A A A Q Q/; Q 1Q 1 1/C 1 n A 

14.44 y.ooe-lo 110-1.34 


1166 


IPB000276C 


Rhodopsin-like GPCR superfamily 


(S.kjd o.oue-li 


1166 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


1 a iA * n^o \ & 97 ai 


1166 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


ly.Zj) O.OUC-IJ f\J~Z7\J 


1166 


t»t» a a A o it? 

PR00427E 


Interleukin-8 receptor signature V 




1166 


PR00427F 


Interleukin-8 receptor signature VI 




1166 


"T»T"> A A A 1 T/""* 


Interleukin-8 receptor signature VII 


10 47 S 5^p-17 QS-1 14 


1 1 C£L 

lloo 


PKUUj /3JJ 


interieuKin oJts receptor signature i v 


15 06 1 85e-18 114-129 


1167 


T1TJ A A /I 1 /I A 


r aimitoyi protein inioesierase bignaLui e i 


10 61 ^ 44e-l1 26-48 


1167 


rKUU414r> 


-raimitoyi protein tnioesierdsc bignaiure 11 


1 1 87 6 7Qe-25 66-85 


1167 


T»T> A A >1 1 A {~*- 

PR0U414C 


Jraimitoyl protein tnioesterase signature jlii 


n 1 Q0p-?S 106-122 


1167 


TIT* f\f\ A 1 A T"\ 

PR00414D 


Palmitoyl protein tnioesterase signature i v 


1 7 99 9 1 ^e-97 1 S 1 - 1 70 


1167 


PR0l)4141i 


raimitoyi protein tnioesterase signal ure v 


17 10 1 43e-27 1 85-204 


1167 


PR00414G 


Palmitoyl protein thioesterase signature VII 


12.98 2.36e-24 258-276 


1168 


IPB000215A 


Serpins 


ii 01 4 1 8p-90 7^-Q6 


1168 


IPB000215B 


Serpins 


0 87 Q 68^-OQ 1 81-1 


1168 


IPB000215C 


Serpins 


1 ^ OA 7 11 <=»-1 1 908-999 


1168 


TnnAAA^ 1 r\ 

IPB000215D 


Serpins 


1 ^ 1^ 1 88<=-1 6 ^10-^^6 


1168 


IPB000215E 


Serpins 


i« qfi i 8 ^01-41 S 


1170 


tt»t» r\r\ 1 T71 

IPB001271 


Mammalian defensin 


10 07 ^49^.91 10^-1^1 


1172 


TT»T» AAAO C7 

IPB000387 


Tyrosine specific protein phosphatase and 
dual specificity protein phosphatase family 


10 77 S 8^e-11 4^0-440 


1172 


PR00700A 


Protein tyrosine phosphatase signature I 


^ O^ 1 OO^ 10 9Q^-^09 


1172 


PR00700B 


Protein tyrosine phosphatase signature II 


17 fl£ ^ 7Hp„9 1 ^1^_^^^ 
1 /.UO ^. /Uc-Zrl J IJ-jjj 


1172 


PR00700C 


Protein tyrosine phosphatase signature III 


io on o 0^f*-16 ^00-416 


1172 


PR00700D 


Protein tyrosine phosphatase signature IV 




1172 


PR00700E 


Protein tyrosine phosphatase signature V 


1£ ^O 1 Q^/a 10 467-489 


1172 


PR00700F 


Protein tyrosine phosphatase signature VI 




1172 


PR01371E 


Salmonella/Y ersinia modular tyrosine 
phosphatase signature V 


11 71 9 1 1 1> 0Q 4^7-448 

1 1. / 1 z. i ie-ui? / -< +*+o 


1187 


IPB000008C 


C2 domain 


23.37 2.44e-09 61-100 


1187 


1PB002642A 


Lysophospholipase catalytic domain 


18 17 1 6Q^-11 1SS-180 


1187 


IPB002642B 


Lysophospholipase catalytic domain 


1 1 84 4 18^-1 S ^81-407 

li.O^f ^.JOC-lJ JOJ'HU/ 


1187 


IPB002642E 


Lysophospholipase catalytic domain 


18 1 Q 6 01 p-1 0 ^00 ^14 


1187 


TT\T^I AAA /" ,4 S\ f~\ 

IPB002642G 


Lysophospholipase catalytic domain 


1/t 1 1 & 70p 10 678-796 
jt 1 , 1 1 O. /Zc-lv 0/0-/ZO 


1187 


PR00360B 


C2 domain signature II 


11^/18 19 88-1 01 


1188 


PR00860B 


Vertebrate metallothionein signature II 


o.yu / . i ac-vy jz-uj 


1189 


IPB000566B 


Lipocalin and cytosolic fatty-acid binding 
protein 


8.91 9.53e-09 122-132 


1189 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


17.66 1.32e-23 285-319 


1189 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


17.66 9. 12e-15 247-281 


1189 


PR00179A 


Lipocalin signature I 


13.97 2.13e-10 38-50 


1189 


PR00179B 


Lipocalin signature II 


7.67 4.91e-12 122-134 


1189 


PR00179C 


Lipocalin signature III 


17.26 4.60e-09 150-165 


1189 


PR00759A 


Basic protease (Kunitz-type) inhibitor 
family signature I 


14.67 5.88e-09 228-242 
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1189 


PR00759B 


Basic protease (Kunitz-type) inhibitor 
family signature II 


12.35 3. 8 le- 10 294-304 


1189 


PR00759B 


Basic protease (Kunitz-type) inhibitor 
family signature II 


12.35 8.71e-12 25o-2oo 


1189 


PR00759C 


Basic protease (Kunitz-type) inhibitor 
family signature III 


1 <-\ An £~ f\r\ n 1 A 1f\A HQ 

12.43 o.UUe-14 oU4-o iy 


1189 


PR01215A 


Alpha- 1 -microglobulin signature I 


1 A AC 1 9^o 1 C ^9 A^ 
14. Uo D.ZdQt 15 jZ-^j 


1189 


PR01215B 


* * * -4 • 111* * A. - TT 

Alpha-l-microglobulm signature II 


1 A O O T ftrto 1 ^ A1 ^ Q. 
14. Zo /.UUe-lD 4 /-Do 


1189 


PR01215C 


Alpha- 1 -microglobulin signature III 


0 99 1 &'\e± 1 Q Q1 _1 C\fi 

o.zz j.oje- iy ? i- 1 \jo 


1189 


PR01215D 


Alpha- 1 -microglobulin signature IV 


1 9 OQ 1 9T 1 1 ^ 1 ^9 
1Z.OO l.Dje-Z / 11j-1jZ 


1189 


PR01215E 


Alpha-l-microglobuhn signature V 


n on 1 fZ&a, OA 1 A^ 1 (\f\ 
lZ.oO 1.0oe-Z4 14D-100 


1189 


PR01215F 


Alpha- 1 -microglobulin signature VI 


1A -31 Q CQp 9« 1 TO 109 

lu.oi y.Dye-zj i/j-iyz. 


1189 


PRO 1273 A 


Invertebrate colouration protein signature I 


1 A AC 9 cq 0 AQ 'X'Z AQ 
14. Uo d.j Je-Uy DD-Hy 


1200 


IPB001594 


DHHC-type Zn-fmger 


ni 01 n Ala 91 QQ 1 A'X 


1201 


IPB001909 


KRABbox 


17.37 9.61e-26 35-69 


1202 


IPB001909 


KRAB box 


17.37 9.61e-zo Jj-oy 


1203 


IPB000961A 


Protein kinase C-terminal domain 


1 00 C f"\T ~ AA T/1 1 AC 

16.82 5.97e-09 /4-lus 


1204 


IPB000001D 


Kringle 


11.31 3.92e-15 81-97 


1204 


IPB000126A 


Serine proteases, V8 family 


1 1 tc n ci ^ 1001 A/C 

11.75 9.51e-12 ol-yo 


1204 


IPB000177K 


Apple domain 


10 in 0 rrc^. 11 OQ 1 1 C 

13.19 3.55e-ll 53-1 ID ! 


1204 


IPB001254A 


Serine proteases, trypsin family 


9.98 o.06e-lo 51-9/ 


1204 


PR00722A 


Chymotrypsm serine protease family (SI) 
signature I 


12.06 3.3 7e-l / 5Z-9 / 


1208 


PR00179B 


Lipocalin signature II 


T £7 9 irio AQ 191 1 99 

/.o / /.oue-uy izi-idj 


1208 


PR01174F 


Retinol binding protein signature VI 


1 1 7£ Q lAo, AQ 1 9A 1 9£ 

1 1 . /o y. J4e-uy izu-1.50 


1208 


PR01254A 


Prostaglandin D synthase signature I 


12.32 9.79e-09 31-54 


1208 


PR01275B ! 


Neutrophil gelatinase lipocalin signature II 


n nT 1 „ 1 1 0 A /I A 

9.02 3.1oe-l 1 39-49 


1209 


IPB001007C 


von Willebrand factor, type C repeat 


20.89 5.97e-10 34-55 


1212 


PR00019A 


Leucine-rich repeat signature I 


-1 1 t"* /■» ^ r\ ^ 1 '1 1 TT 1 AA 

11.72 2.50e-12 177-190 


1212 


PR00019A 


Leucine-rich repeat signature I 


11.72 7.82e-09 390-403 


1212 


PR00019B 


Leucine-rich repeat signature II 


1 -t A r\ 1 s~ A 1 1 1 T /f 10T 

11.42 1.64e-13 174-187 


1212 


PR00019B 


Leucine-rich repeat signature II 


11.42 9.18e-ll 387-400 


1213 


IPB000834B 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


13.51 2.50e-17 103-117 


1213 


IPB000834C 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (M14) family 


17.20 2.80e-15 172-188 


1213 


IPB000834D 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


18.95 4.72e-12 199-225 


1213 


IPB000834E 


Zinc carboxypeptidases, carboxypeptidase 
A metalloprotease (Ml 4) family 


9.80 2.15e-09 228-242 


1213 


PR00765B 


Carboxypeptidase A metalloprotease (M14) 
family signature II 


14.48 1.39e-15 99-113 


1213 


PR00765C 


Carboxypeptidase A metalloprotease (M14) 
family signature III 


10.88 1.82e-10 179-187 


1213 


PR00765D 


Carboxypeptidase A metalloprotease (M14) 
family signature IV 


14.06 9.45e-12 233-246 


1214 


PR01536C 


Interleukin-1 receptor type I and type II 
family signature III 


1 0 0? 7 00e-09 67-90 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 1.30e-ll 56-93 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.22e-15 44-81 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.29e- 19 38-75 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.74e-09 20-57 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.82e-15 68-105 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 3.05e-10 92-129 
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1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA 7 77^ 1 1 OA A7 

11.40 j.zze-ii zo-od 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 Afi 1 1Ae 1 A QQ 17A 

11.40 j.dqq- m oy-izo 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 /I A 7 7Ao, 10 07 AO 

1 1 .4o J.ooe- iz jz-oy 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 /IA 7 C7» 1 A A1 5 A 
1 1 .40 D\D /e- 14 4 /-o4 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 A A A 1 A» 1^71 105 

1 1 .4o 4. loe-i D /1-lUo 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 /I A A 1 7#a OH 1 1 7 

i i.4o 4. i /e-io ou-i i / 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 A£. A IQa. 1^/11 76 
1 1.40 4.Zoe-l j 41- /o 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 /I A A 1/1 AC. 101 
1 1 .40 4.oZe- 14 0D- 1 UZ 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA A A</=» 1 < ^.A 57 
1 1 .40 4.uDe- ID DU-o / 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 /IA C ^/Co 1 7 A7 QQ 

li.4o D.Doe-ij oz-yy 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 /IA C < Aa 1/1 Q-3 1 OA 

11.40 D.D0e-14 oo-iZU 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA ^ Q1*=» 1 A 7G AA 

1 1 .40 D.y ie- 14 zy-oo 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA 7 1 A 1 ^7 
11.40 / .U /e- 1U yD- LOZ. 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AG* 7 AA<= 1 A 5A 1 07 
11. 40 /.40c- 10 oO- IZj 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA 1 1 A 77 1 IA 

11.40 /.jye-io //-114 


1221 


TTinAAAOfir A 


Fibrillar collagen C-terminal domain 


1 1 A A 5 OQp 17 CO OA 

l i.4d o.zye-iz Do-yu 


1221 


IPB000S85A 


Fibrillar collagen C-terminal domain 


1 1 A A 5 QAo 1 A *\G OA 

11.40 o.Joe-14 3y-yo 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA Q AAp» 1 Q 7^. 77 

ii. 40 y.uue-iy 


1221 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1 AA Q A7^ 1 A 7A 111 
1 1.40 y.4Ze-14 /4-111 


1221 ! 


IPB000885B 


Fibrillar collagen C-terminal domain 


ifl if 1 fiflfl OA 77 1 7^ 

ly.iD i.uue-zu /z-izd 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


iv.iD i.zze-io jj-oo 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


ifl K 1 /1 7o 1 Q CI 1 HA 

ly.iD i.4/e-io ji-iU4 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1Q K 1 Clo 01 AO 107 

ly.iD i.Die-zi oy-izz 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 n 1 c i n a a oa i£. on 

iy.13 i./4e-zu 30-oy 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


ia 1 r i n /1 1 A OA 1/17 

ly.iD i.y4e-iu yu-i4o 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


ia if 1 CiOa. f\C\ 1 C AO 

iy.i_) i.yoe-uy id-oo 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


i A 1^ o oi „ -|/| OA 

ly.lD Z.Zie-14 Z/-oU 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in K 1 n"2ja 1 7 75 1 'X 1 

iy. i!) 3.uje-i / /o-iji 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


m K Q 77c» 1 C 7/1 77 

ly.iD J.ZZe-lJ Z4-// 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


m 1« 7 O/l/a 17 QA 177 

ly.lD i.o4e-li o4-lo/ 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


m ico O/i /io nc 

ly.iD j.yje-Z4 4Z-yo 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 Q 1 < A 1 Ao 1 c an 57 

ly.iD 4.ioe-io 3U-oo 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 O 1 C A lOo 77 7C 175 

iy. id 4. 3ze-Z3 o-izo 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in K ^ 7Aq 7A AA 117 

iy. id 4. /ue-zu ou-i i ~> 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


ia K ; A7«» 7A A^ 05 

ly.iD D.u/e-zu 4D-yc 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in ic A /1A<a AO 7 ^ A 

iy. id o.4ue-uy j-do 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1Q1^AA1*= 15 A7 11A 

iy. id o.oie-io oj-iio 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in ic 7 A7«a 77 «7 1 in 

iy. id /.oze-zz d /- 1 iu 


1221 


tt\t\ r\f\f\ a a c tt\ 

IPB000885B 


Fibrillar collagen C-terminal domain 


in k 7 A7<=> AO Q AO 

ly.iD /.oje-uy y-oz 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


10 K 7 71q no 1 O A^ 

ly.iD /. / ie-uy iz-od 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1Q 1^ 7 51p» 01 00 

ly.iD /.oie-zi jy-yz 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in k 7 O/i q I Q AA 1 1 Q 

ly.iD /.o4e-iy oo-i iy 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


in K 7 O<o 17 15 71 

ly.iD /.yDe-io i o- / 1 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 n 1 c o on a 1 a no 1 /i/c 

ly.iD o.oye-iu yi-i4o 


1221 


IPB000885B 


Fibrillar collagen C-termmal domain 


i n 1 c n die*, oo c/i i no 

ly.iD y.u/e-zz D4-iu/ 


1ZZ1 


T'D'nnnnsR^'R 
lrt>\J\JyJooDD 


Th"i r"\t*1 1 1 Q f r»/"\11 r» rr/an +ot*m i n o 1 rlAtnoiri 

FiDniidr coiiagen v^-ierniiiicu tionidin 


19 15 9 16e-12 81-134 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.41e-13 21-74 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.88e-20 48-101 


1221 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.91e-12 87-140 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.00e-ll 40-74 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.42e-09 55-89 


1221 


IPB001073A 


Complement Clq protein 


22.14 1.67e-ll 79-113 


1221 


IPB001073A 


Complement Clq protein 


22.14 2.97e-09 43-77 
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112.1 


TDDAH 1 f\71 A 

LrrJUUlU /iA 


— — ; — — ; 

Complement Clq protein 


99 14 1 Q1-195 


1 11 1 

1221 


TT>T") r\r\ -> r\TJ A 

lrBUU 10/3 A 


Complement Clq protein 


99 14 % 7Sp-10 11 

ZZ. 1*+ J. I JC"lu O 


1221 


TT1TDAA1 All A 

1PB0010/3A 


Complement Clq protein 


99 14 /I OQ^-OQ RS-1 1Q 
zz. ih t +.\Ji»c -\jy oj" l \.y 


1 OO 1 

1221 


TDDAA1 A 


Complement Clq protein 


99 14 A 4Sp-1 I 14-68 


1 11 1 

1221 


TTIOAA1 AT3 A 

1PBU010/3A 


Complement Clq protein 


99 14 A fiOp-19 04.-1 9ft 

ZZ. It < T.UUC"li y~r IZrO 


1221 


1PB001073A 


Complement Clq protein 


99 14 ^ SOf»-1 1 SIR- 199 

ZZ.H J.JUC"!! OO-IZZ 


1 ii i 
1221 


TUT1AA1 AT3 A 

1J-*\HU01U/3A 


Complement Clq protein 


99 14^ f\lp 1 1 1£\- 110 
ZZ. 1-t- J.O /c-JL 1 /O-llU 


1111 
1221 


1PB001073A 


Complement Clq protein 


97 14 7 1 Hp 10 -\9_&£ 


1 lO 1 

1221 


TDDAA 1 AT^ A 

lPr>UUlU/3A 


Complement Clq protein 


99 14 7 11p-1 1 S9-1 1 6 


1221 


TDDAA1 ATI A. 

lPr>UUiU/3A 


Complement Clq protein 


99 1 4 7 ->1 p-1 1 17-7 1 


1221 


IPB001073A 


Complement Clq protein 


22.14 7.83e-ll 97-131 


1221 


Tnnf\A1 AT) A 

1PB001073A 


Complement Clq protein 


77 14 2 17p 10 -\ 8 Q9 

zz. i*f o. i /e- iu -)o-y/ 


1221 


IPB001073A 


Complement Clq protein 


22.14 8.59e-09 49-83 


1221 


IPB001073A 


Complement Clq protein 


70 1/1 O Qlo lO 1 AO 11 A 

zz. 14 y.oie-13 


1221 


IPB001073B 


Complement Clq protein 


20.88 7.75e-28 150-184 


1221 


IPB001073C 


Complement Clq protein 


1 1 o C £Z 1 11 /I 11') 

13.07 8, 56e-16 214-233 


1221 


IPB001073D 


Complement Clq protein 


7.60 7.68e-09 251-260 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


i /•*■ ii i aa 1 o oi n/i 

26.12 L09e-13 82-134 


1221 


IPB001442A 


C-termmal tandem repeated domain in type 
4 procollagen 


1^" 11 1 O A „ AA A1 1 /I I 

26.12 L39e-09 91-143 


1221 


IPB001442A 


C-termmal tandem repeated domain in type 
4 procollagen 


i^r 111 /ti~ 1 /t i k 

26.12 L42e-lo 64-116 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


i/i** iii/ii n o 1 o o 

26.12 2.41e-17 31-83 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 2.47e-16 76-128 


1221 


IPB001442A 


C-terminal tandem repeated domain m type 
4 procollagen 


1ZT 111 "70 „ 1 <-7 £"»~7 1 1 A 

26.12 2.73e-17 67-119 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


^✓"1^/^Or\ nA ilA 1 A 1 

26.12 2.89e-20 49-101 


1221 


IPB001442A 


C-terminal tandem repeated domain m type 
4 procollagen 


26.12 3.97e-09 10-62 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.03e-16 79-131 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.79e-20 46-98 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.80e-17 25-77 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.80e-17 28-80 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.82e-21 40-92 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 4.96e-12 13-65 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


r\ 1 <■*» f CI ^* AA OO 1 A f\ 

26.12 5.85e-09 88-140 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.56e-l 8 34-86 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.77e-14 22-74 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.39e-23 37-89 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 


26.12 7.55e-14 16-68 
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4 procollagen 




1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.99e-20 70-122 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.07e-22 43-95 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.30e-23 61-113 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.58e-20 58-110 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.09e-15 19-71 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.18e-19 55-107 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.37e-12 85-137 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.56e-18 73-125 


1221 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.59e-24 52-104 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.00e-ll 104-124 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.49e-09 47-67 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.85e-09 107-127 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 2.82e-09 41-61 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 2.82e-09 86-106 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.31e-09 71-91 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.55e-09 56-76 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.68e-09 50-70 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 3.92e~09 68-88 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 5.14e-09 89-109 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 5.24e-10 101-121 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 6.01e-10 38-58 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 6.72e-09 59-79 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 7.81e-09 35-55 


1221 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 9.15e-09 44-64 


1221 


PR00007A 


Complement C1Q domain signature I 


20.64 7.47e-17 143-169 


1221 


PR00007B 


Complement C1Q domain signature II 


15.63 5.26e-14 170-189 


1221 


PR00007C 


Complement C1Q domain signature III 


16.13 2.07e-14 214-235 


1221 


PR00007D 


Complement C1Q domain signature IV 


9.66 3.91e-10 249-259 


1223 


IPB000053H 


Thymidine/pyrimidine-nucleoside 


12.28 9.76e-09 28-37 
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phosphor ylase 




1223 


IPB000095E ! 


PAK-box /P21-Rho-binding 


17.62 4.19e-ll 275-320 


1223 


IPB000494C 


Epidermal growth-factor receptor (EGFR), 
L domain 


24.40 3.60e-3l 261-307 


1223 


IPB000494D 


Epidermal growth-factor receptor (EGFR), 
L domain 


in a a *"7 frt„ oa too ni 

19.44 7.5Qe-29 3zz-3 /l 


1223 


IPB000494E 


Epidermal growth-factor receptor (EGFR), 
L domain 


21. OD Z.4oe-Z4 5 /y-4 14 


1223 


IPB000861F 


PKN/rhophilin/rhotekin rho-binding repeat 


1 £ ? Kp 1 1 9£R 199 


1223 


IPB000959B 


POLO box duplicated region 


K/CO 1 QQ~ 10 9^1 -9Q 1 


1223 


TTkT* AAAA/"1 J~\ 

IPB000961C 


Protein kinase C-terminal domain 


1 K A Q £ OQp 1 'X 97A IfiR 


1223 


IPB001245A 


Tyrosine kinase catalytic domain 


99 A< 7 1 Q/3 9£ 9£7 107 


1223 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 5.24e-23 325-363 


1223 


IPB001359H 


Synapsin 




1223 


IPB001359H 


Synapsin 


22.58 6.85e-09 777-827 


1223 


IPB001772C 


Kinase associated domain 1 


20.66 7.93e-13 26z-z9z 


1223 


IPB003306E 


WIF domain 


25.51 5.35e-12 379-424 


1223 


IPB003527C 


MAP kinase 


14.70 2.48e-14 259-307 


1223 


PR00109A 


Tyrosine kinase catalytic domain signature I 


12.56 5. 14e-10 237-250 


1223 


PR00109B 


Tyrosine kinase catalytic domain signature 
II 


11.07 2.61e-18 274-292 


1223 


PR00109C 


Tyrosine kinase catalytic domain signature 
III 


11.86 1.87e-10 324-334 


1223 


PR00109D 


Tyrosine kinase catalytic domain signature 
IV 


17.61 7.12e~21 343-365 


1223 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 5.09e-20 388-410 


1223 


PR01217B 


Proline rich extensin signature II 


4.82 6.09e-09 927-943 


1224 


PR00453A 


Von Willebrand factor type A domain 
signature I 


ll. 78 3.l5e-09 50-67 


1225 


IPB000734 


Lipase 


10.25 8.13e-09 164-178 


1227 


IPB001304A 


C-type lectin domain 


17.98 3.48e-10 41-65 


1227 


IPB001304B 


C-type lectin domain 


10.18 5.95e-09 99-111 


1227 


PR01504F 


Pancreatitis-associated protein signature VI 


12.72 6.45e-09 153-167 


1237 


IPB002181A 


Fibrinogen beta and gamma chains C- 
teiminal globular domain 


18.44 2.32e-10 244-260 


1237 


IPB002181B 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


20.16 9.57e-10 271-307 


1237 


IPB002181C 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


15.87 1.00e-08 294-306 


1237 


IPB002181D 


Fibrinogen beta and gamma chains C- 
terminal globular domain 


zy.ib j.i4e-iy jiz-^jz 


1257 


IPB000483 


Leucine rich repeat C-terminal domain 


11 10 1 ^Qt* HQ A91 A77 
ll.io l.Joe-Uy 4ZJ-4J / 


1257 


PR00019B 


Leucine-rich repeat signature II 


11/17 9. ^^*=. 1/1 1 OA 1 17 
1 1 ,4Z o.ooe- 1U IZ4-IO/ 


1257 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.33e-10 316-329 


1263 


IPB001762A 


Disintegrin 


T7 AO C /IC„ 1^7 AIL A AHA 

23.93 6.45e-l/ 434-4/4 


lzoo 


lroKjUL /OZA 


Disintegrin 


9^ 0^ 6 47e-09 416-456 


1263 


IPB002870A 


Reprolysin family propeptide 


12.22 3.3 le-10 63-79 


1263 


IPB002870B 


Reprolysin family propeptide 


24.73 6.56e-14 109-147 


1263 


IPB002870E 


Reprolysin family propeptide 


11.90 2.50e-ll 341-353 


1263 


PR00289A 


Disintegrin signature I 


14.29 2.65e-13 450-469 


1263 


PR00873D 


Echinoidea (sea urchin) metallothionein 
signature IV 


8.25 7.35e-09 608-626 


1264 


IPB002168A 


Lipolytic enzymes 


11.40 5.07e-ll 139-153 
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1264 


IPBOO2I60I5 


: ; — : * "~ 

Lipolytic enzymes 


oa oo c 1 /|~ ia irn oaj) 

3U.oo J.14e-iy 10 /-204 


1 o/r a 
1264 


1PB00216oC 


Lipolytic enzymes 


Q in a nn«. t/c ok Tin 
y. 1 1 4.0Ue-lo zio-ZZy 


1264 


1PB002160L> 


Lipolytic enzymes 


1Q Q< £L 00«. tO 1<CO 101 

ly.oD o. /l)e-lo 309-393 


1264 


lrr{UUz4oyi 


Dipeptidyl peptidase IV, N-terminus 


1 n QQ K OOo AQ oi 1 100 

iu.yy o.zye-uy zn-zzy 


1264 


T1T> A1 A/1 AD 

.rK01040.t> 


Tyrosyl-tRNA synthetase signature II 


1/1 <o n (\c\ex no o/ic o/cn 

14. oy /.uoe-oy 245-zoo 


1266 


TnT>AAAOci A 

IJP.B000753A 


Cluster in 


1Z.ZU o.4je-l o Z-lo 


1266 


tt*ti r\r\r\'~7 c 0 ti 

IPB000753B 


Clusterin 


1 1 A 1- * 1 aa^ An in 01 
11.42 l.U0e-40 jy-oj 


1266 


IPB000753D 


Clusterin 


ir o a 1 nn« /ia i /to on/i 
15.34 1.00e-40 163-204 


1266 


T"n"n Anniro t~> 

IPB000753E 


Clusterin 


13.24 1.00e-40 230-279 


1266 


IPB000753F 


Clusterin 


10 On 1 AA^. /I A T1 A 

17.89 1.00e-40 310-361 


1266 


IPB000753G 


Clusterin 


1 O Ol C\ CC„ O /t T7A OAA 

1 / . /3 9.55e-24 3/2-390 


1267 


IPB000215A 


Serpins 


13.01 9.14e-18 107-130 


1 O/CO 

1267 


IT1DAAAO 1 C Z" 1 

IPB000215C 


Serpins 


1 1 OA T OOn K oi 1 OOC 

13. yO /.ooe-lj 21 1-ZZj 


1 O/TO 

1267 


1PB00021 5D 


Serpins 


15.35 3.74e-17 314-340 


1 O^O 

1267 


IPB000215E 


Serpins 


15.36 6.6oe-16 401-425 


126o 


TDDAAA11 C A 

1PB00021 D A 


Serpins 


13.01 y.i4e-lo 125-14© 


1268 


T15DAAA11 CD 

1PB000215B 


Serpins 


o ot i ooa 1 1 oi o om 
y.o/ l.oZe-11 Zlo-230 


126b 


1PB000215C 


Serpins 


13. y0 7.ooe-15 247-261 


1268 


T"r»r»nnn'"» 1 rT» 

IPB000215D 


Serpins 


15.35 3.74e-17 350-376 


1268 


IPB000215E 


Serpins 


15.36 7.19e-18 437-461 


1273 


IPB001307A 


Rhodanese signatures 


11 AO O/Tyl— 1A 1A OA 

11.98 8.64e-10 10-20 


1273 


IPB001307B 


Rhodanese signatures 


1 £Z AA O OA„ 1/1 /l O £"1 

16.09 3.o9e-14 48-61 


1273 


1PB001307C 


Rhodanese signatures 


•fc 11 1 TO„ 1COO 1A1 

15.11 l.Zoe-15 82-101 


1 273 


TT>T>AA1 O A*7T** 

IPB001307D 


Rhodanese signatures 


1 £ O'V A f\C\~ O/C 1 1 A 11C 

15.o / 4.U0e-26 110-135 


1273 


tnnnAi oaoo 

1PB001307E 


Rhodanese signatures 


11 1 n c i in ion im 
13. iy 5.3oe-10 loO-iyi 


1273 


tt"»t~) r\r\ 1 0 r\>~t r~> 

IPB001307G 


Rhodanese signatures 


11 HI Oil-. 1/COiTI o^i 

1 1. /3 y.3 le-16 251-263 


1274 


1PB000001D 


Krmgle 


11 11 /IOC-. 1/1 1/1T 1/C1 

11.31 4.95e-14 14/-163 


1 T7/I 

12/4 


TDDAAAAA1 Z"" 1 

IPBOOOOOICj 


Kringle 


OO OO A A/Zn 1 1 ICO OOO 

zy.zy 4.4oe-ii z5y-2o/ 


1 T7 /I 

12/4 


TWDAAAAA 1 T T 

1PB000001H 


Kringle 


io O/i o oa„ in ooo inn 

12.24 z.ooe-io zyy-3oy 


1274 


IPB000126A 


Serine proteases, V8 family 


1 1 7C n 1C„ 1 A 1 /IO 1 ^o 

11.75 9.25e-10 147-162 


1 OO/i 

1274 


ynr) AAA1 nr/ 

IPB000177K 


Apple domain 


11 ino iOn 11 1/in 101 
13 . 1 9 2. 1 be- 1 1 149- 1 o 1 


1274 


IPB000177L 


Apple domain 


11.33 9.92e-10 178-216 


1274 


IPB000177N 


Apple domain 


10.17 5.19e-24 289-323 


1274 


IPB000177O 


Apple domain 


14.39 9.00e-16 324-352 


1274 


IPB001254A 


Serine proteases, trypsin family 


9.98 3.37e-16 147-163 


1274 


IPB001254B 


Serine proteases, trypsin family 


15.01 5.50e-16 297-320 


1274 


IPB001254C 


Serine proteases, trypsin family 


16.54 4.86e-20 327-346 


1274 


IPB002093D 


BRCA2 repeat 


8.99 9.77e-09 8-46 


1274 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 3.88e-15 148-163 


1274 


PR00722B 


Chymotrypsin serine protease family (SI) 
signature II 


12.69 3.74e-09 203-217 


1274 


PR00722C 


Chymotrypsin serine protease family (SI) 
signature III 


10.74 1.69e-14 296-308 


1279 


PR00704I 


Calpam cysteine protease (C2) family 
signature IX 


f\ A t~% A 1 A 1 A 1 A C 1 ni 

9.42 4.14e-14 145-173 


1284 


IPB001706 


Ribosomal protein L35 


21.01 1.00e-18 119-157 


1285 


IPB001706 


Ribosomal protein L35 


21.01 1.00e-18 119-157 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 1.43e-ll 399-424 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 2.50e-20 259-284 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 3.25e-19 231-256 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 5.50e-19 343-368 


1287 


IPB000822 


Zinc finger, C2H2 type 


14.67 6.54e- 18 203-228 
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SEQ 
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Results* 


lZo 1 


tooaaaooo 
IrBUUUoZZ 


z,mc ringer, CZriZ type 


1 A fkl Q CAo n qk o/iA 
Ih-.O/ o.3Ue~15 513-54U 


1 007 

12o7 


tod r\ nn o o o 
lrB0U0822 


/-mc linger, C2H2 type 


1/1 /^T o /Co ^ io ooo on 

14.0/ o.oZe-lo Z0/-3IZ 


1 O 

12o7 


irBuU0822 


Zinc linger, C2H2 type 


1 A c~i c\ 1 n^. 00 1 oe 0 aa 
14. 0/ y.lUe-22 1 /3-2UU 


lZo/ 


tooaaaooo 
lrBUUU82Z 


z,inc linger, C2P12 type 


1/1 Q 1 An OO 171 on/C 

14. 0 / y. lUe-ZZ d / l-39o 


1 o oo 
lZo / 


TtJDAA 1 OOC 

IPBUU1275 


DM DNA binding domain 


1 a 10 0 oe^ 10 oen ano 
19. 1 / Z. /3e~lZ 539-390 


lZo / 


TDDAA1 OOe 

IrBUUlZ /3 


DM DNA binding domain 


in n 0 C7„ 11 ooe i~\ a 
19. 1/5.3 /e-11 Z /3-514 


1 O OO 

iZo / 


TDDAA1 OOe 

lrBUUlZ/3 


DM DNA binding domain 


1Q 1*7/1 O./l^ 11 1Q1 T3A 

ly.l/ 4.34e-ll 191-Z5U 


1 O 07 


TDDAA1 OOe 

lrBUUlZ/3 


DM UNA binding domain 


10 ion /cn« 10 O/io 00/c 

iy.i / y.oue-iz Z4/-Zoo 


1 OC7 
IZo / 


TDDAA1 OOe 

lrBUUlZ/3 


DM DNA binding domain 


in IT Q QOc» 10 0.O1 -JOA 

19. 1 / y.o /e-iz 55 1-5 /U 


1 OC7 
IZo / 


IrrJUU ly U9 


T/D A T> Krvv 

JSJ\Ad dox 


l/.j/ 3.55e-ZZ 10-3U 


1 O QO 


"DO f\C\(\A O A 


C2H2-type zinc finger signature I 


O Q/1 O ^Qo IO "3Q< /1AO 

y.94 z.39e-iz 3yo-4uy 


1 OOO 

IZo/ 


DDAAA/I <? A 

rKUUU4oA 


C2H2-type zinc finger signature I 


Q CkA O CCo AQ Q/JA OCQ 

y.y4 z.ooe-uy 54U-535 


1 O OO 
IZO / 


"DO AAA/1 O A 

JrKUUU4oA 


C2H2-type zinc finger signature I 


Q Q/1 Q 77o 1/1 Q/CO OOI 

y.y4 5. / /e-14 500-501 


1 O 07 


ooaaa/io a 

PR0UU48A 


C2H2-type zinc finger signature I 


n A/i e 1 e<-. i /i oo/i ono 
y.94 3.13e-14 Zo4-zy/ 


1 O C7 

1287 


T">r» nnn A O A 

PR00048A 


C2H2-type zinc finger signature I 


n C\A £. z:?^ nn 010 ooe 
9.y4 o.o5e-U9 5ia-5Zd 


1 o 

1287 


rRU004oA 


C2H2-type zinc finger signature I 


n c\A n Tin 1 n oe^r o/rn 
9.94 o.73e-lU 256-269 


1 007 

Izo/ 


DTIAAA/IO A 

PRUU048A 


C2H2-type zinc finger signature I 


n A/i n o.*c«. 10 oaa 010 
9.y4 9.5oe-15 ZUU-Z15 


1 O OO 

IZo / 


"DO AAA/1 OD 

rRUU048B 


C2H2-type zinc finger signature II 


e eo 1 aa« 10 100 1 ao 
3.3Z l.UUe-lZ loo- 19 / 


1287 


oo aaa/1 on 

PR0Q048B 


C2H2-type zinc finger signature II 


e eo 0 ic„ 1 1 10/I oao 
5.52 3.23e-ll 3o4-393 


1 O C7 

1287 


riT) AAA /I OT) 

PR00048B 


C2H2-type zinc finger signature II 


e eo A AA^. A A oe/r O/CC 

5.52 4.U0e-U9 356-365 


1 O OO 

1287 


t-jt-> AAA A OD 

PRU0048B 


C2H2-type zinc finger signature II 


e eo /i O/C^. 1 a O/i/i oeo 

5.52 4.o6e-lU 244-253 


1 O OO 

12o/ 


OO AAA/1 OO 

PKUUU4oB 


C2H2-type zinc finger signature II 


e eo e en« nn oaa oaa 
3. 32 3.3Ue-U9 3UU-3U9 


1 O 0*7 

1287 


OO AAA/! O T"> 

PR00048B 


C2H2-type zinc finger signature II 


c CO O rtA« 10 000 001 

5.5Z /.UUe-lZ Z/Z-2ol 


1 o m 

izyz 


TOO) AAA AO A 

1PBUUUU34C 


Laminin B 


1 0 no 0 0 1 nn e/i oo 
12.9/ /.3 le-U9 34- /Z 


i o no 

izyz 


TOO AAAe/C 1 

1PBUUU561 


Jc/Or-like domain 


/i on 0 no,-* nn e/i /co 
4.o9 o.U/e-U9 34-oZ 


1292 


1PB001774D 


Delta serrate ligand 


1 n 00 e ni « aa oa c c 
19.23 5.91e-U9 2U-66 


1 on /l 

1294 


TT1DAA1 AAOT3 

IPB001007B 


von Willebrand factor, type C repeat 


in no 0 on« AA AO 1 A1 

1U.U3 2.o9e-U9 92-1U1 


i one 
1Z93 


TOO AAAOOO 

lPBUUUo22 


z,inc linger, CZH2 type 


1/1 ^O 1 AAa 10 /IOO /ICO 

14.0/ l.UUe-lo 433-430 


1 IOC 


tdd nnn ooo 
IrBUUUoZZ 


/.mc tmger, CZHZ t^e 


i/i ^0 1 AAo 00 /ine /ion 
14. 0/ l.UUe-ZZ 4U3-43U 


1 ooe 
1Z93 


too nnn ooo 
IPBUUUoZZ 


z,inc linger, CZrlZ type 


1/1 ^O 1 OAa OO 0O1 1A£. 

14. 0/ i.9Ue-ZZ 3Z1-54D 


1 ooe 
1Z93 


tdd AAAOOO 

IPBUUUoZZ 


Zinc linger, CZrlZ type 


i/i ^0 /i aa q ia o^e oon 
14.0/ 4.UUe-lU Z63-Z9U 


1 one 
1Z93 


TOO AAAOOO 

IPBUUUoZZ 


Zmc linger, CZMZ type 


1/1 ^O /1 O/C^. 1A AQCi e 1 /I 

14. 0/ 4.ooe-lU 4o9-3 14 


1 OOe 

izy3 


TOO AAAOOO 

IPBUUUoZZ 


zinc linger, czriz type 


1/1 oe<3 10 000 /ino 
14. 0/ o.Z3e-iy 5 / /-4uz 


1 one 
1Z93 


tooaaaooo 
IPBUUUoZZ 


z,inc tmger, uzriz type 


14.0/ /. /3e-ZU 40 1-400 


1 one 


TOO AAAOOO 

IPBUUUoZZ 


7,„. /^*OTJ"0 Umn 

zanc ringer, CZrlZ type 


1/1 ^O C CA Q 1 a O/IQ OO/I 

14.0/ o.3Ue-iy 549-5/4 


i one 
129 J 


TOOAA1 OOO 

1PBUU1 ZZZ 


trllb zinc ribbon domain 


O/l /^O O /,iC Q AA /tOO /1/CQ 

Z4.03 3.ooe-U9 455-469 


i one 
1293 


tooaai ooe 

1PBU01275 


UM UJN A binding domain 


1A IO O Oe^-v AA OAA O/IO 

19.1 / 5.o3e-U9 3U9-54o 


1 one 

1295 


TT1DAA1 

1PB001275 


DM DNA binding domain 


in 10 a 00^, in 000 00^: 
19.1 / 4. / /e-lu 33 /-5 /6 


i one 
1295 


TTJDAA 1 nc 

1PB0U1275 


DM DNA binding domain 


IO IO e 00«. AA /IOI /1/CA 

19.1/ 3.ZZe-U9 4Z1-46U 


1 o ne 
1295 


TTiO AA 1 OOe 

1PB0U1275 


DM DNA binding domain 


in 10 0 /io.r* iaoao /loo 
19. 1 / o.4Ze-lU 393-432 


1 one 
1293 


tooaai ooe 
1PBUU1Z /3 


DM DNA binding domain 


10 10 0 T7a 1A/1/IA /IOO 

iy.i / y.z/e-iu 44y-4oo 


i o ne 

1295 


TOOAAI AAA 

1PBU01909 ! 


IvKAB box 


IO OO A Cfln OO 1A A A 

17.3/ 9.5Ue-32 10-44 


1 o ne 

1295 


T>r> AAA/1 O A 

PR00048A 


C2H2-type zinc finger signature I 


A C\A 1 AA« 1AOTO OOI 

9.94 l.UUe-lU 318-331 


i one 
1295 


OOAAA/IO A 

PRUUU48A 


C2H2-type zinc finger signature I 


A O/f 1 AArt IO /AO /lie 

9.94 l.UUe-13 4U2-415 


1 one 

1295 


OOAAA/IO A 

PRUUU48 A 


C2H2-type zinc finger signature I 


a A/1 1 /C /I 10 0/1^ oen 
9.94 i.o4e-13 346-339 




rivUi/uT'OA 


C2H2-type zinc finger signature I 


i?.-/*T J. / oc-iz 0 / *+- -?o / 


1295 


PR00048A 


C2H2-type zinc finger signature I 


9.94 5.85e-14 458-471 


1295 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.92e- 14 486-499 


1295 


PR00048A 


C2H2-type zinc finger signature I 


9.94 8.62e-14 430-443 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 1.50e-09 418-427 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.50e-09 474-483 


1295 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.79e-10 334-343 


1295 


PR00048B 


C2H2-type zinc finger signature II | 


5.52 8.50e-09 390-399 
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1296 


IPB000822 


— . — 

Zinc finger, C2H2 type 


1A An 1 AQp 152 ICid-IOQ 


1296 


IPB000822 


Zinc finger, C2H2 type 


1A An 1 0^» lid AAA AAQ 
14. 0/ J.Z~e-14 44H— H-Oy 


1296 


IPB000822 


Zinc linger, C2H2 type 


1 A An 1 1Hcl 1 9. OOA lAK 

14.0/ d . / / e- 1 o zzvj-zh- 3 


1296 


IPB000822 


Zinc linger, C2H2 type 


1 A & O^o 1Q 0A9. 071 


1296 


IPB000822 


Zinc linger, C2H2 type 


\A £H £> Hit* 01 A\Cx AAl 


1296 


IPB000822 


Zinc linger, C2H2 type 


1A 1 C\(\t± OA 01 A 

1m-. o/ /.uue-zu z /o-ju i 


1296 


IPB000822 


Zinc finger, C2H2 type 




1296 


IPB000822 


Zinc finger, C2H2 type 


i a c orio oi 1 oo oio 
14. o/ o.zUe-zj lyz-zi / 


1296 


IPB000822 


Zinc linger, Cztiz type 


1A An s Q/^q 01 ^^o i^n 
14. o / o.ooe-zi jjz-jj / 


1296 


IPB000822 


Zinc finger, C2H2 type 


1 A A1 53 *C A~ 1 Q AO0 AQ7 
14.0/ o.DUe- li? 4 / A-^ty 1 


1296 


IPB000822 


Zmc linger, C2H2 type 


A A An 52 /^0<* 1 52 ^AO ^0^ 

14. o / o.oze'io jUv jaj 


1296 


TTVT"» AAAOIO 

IPB000822 


Zinc ringer, CzrLz type 


1/1 ^O Q 1 01 ^52^ 

14. o / y. loe-zi jou-joj 


1296 


TT»TIAA1 OTC 

IPB0G1275 


DM DNA binding domain 


in 17 1 07- inAS52«^07 


1296 


IPB001275 


DM DNA binding domain 


iy.1/ zj/e-uy J/0-41J 


1296 


1PB0012/5 


DM DNA binding domain 


10170 Q^p-1 o o^vi-ifn 


1296 


TTIHAA 1 A '"7 .C 

IPB001275 


DM DNA binding domain 


1Q 17 1 QAe* 00 0^^-07^ 

iy.1/ J.7OC-U7 zjo-z/j 


1296 


TTinAAl one 

IPB001275 


DM DMA binding domain 


in 17 < i 00 4^0-471 


1296 


IPB001275 


DM DNA binding domain 


10 17 < ^52i=»-10 0O52_047 


1296 


IPB001275 


DM DNA binding domain 


iq 17 < ca p i n AC\A Adl 


1296 


IPB001275 


DM DNA binding domain 


1Q 17 c 7£e» 1 1 AAC\ AQQ 

iy. i / j. /oe- 1 1 40u-*+yy 


1296 


IPB001275 


DM DNA binding domain 


in 17 c nn 0 no i on 0 1 Q 

iy. i / D.yue-uy icu-ziy 


1296 


IPB001275 


DM DNA binding domain 


1Q H £ 7Qq 1 1 Q/152 1527 

iy. i/o. /ye-i 1 j^ooo / 


1296 


IPB001275 


DM DNA binding domain 


1 O 1 7 52 IHp 1 0 lOO ^^0 


1296 


TTHT1AA1 AAA 

IPB001909 


KRAB box 


17 in ^o^ in a a ^o 


1296 


PR00048A 


C2H2-type zinc finger signature I 


Q OA 1 Ifio OO 017 o^o 

y.y4 i.ooe-uy zi /-zou 


1296 


PR00048A 


C2H2-type zinc finger signature I 


O QA O n^o 10 0A^ 0^52 

y.y4 z.uoe~iz z4D-zdo 


1296 


PR00048A 


C2H2-type zinc finger signature I 


O QA O O/^o io im 11 A 

y.y4 z.uoe-iz jui-014 


1296 


PR00048A 


C2H2-type zinc ringer signature I 


Q QA 1 Q^o 10 071 052^ 

y.y4 j.ooe-iu z / j-zoo 


1296 


PR00048A 


C2H2-type zinc finger signature I 


QO/I /I 71o 19 AH 40^ ^ 

y.y4 4. / le-iz 4i j-4zo 


1296 


PR00048A 


C2H2-type zinc finger signature I 


Q QA A QAa 1 1 10Q 1AO 

y.y4 4.ooe-io jzy-o4Z 


1296 


PR00048A 


C2H2-type zinc finger signature I 


y.y4 j.3Ue- 1 j Doj-oyo 


1296 


PR00048A 


C2H2-type zinc finger signature I 


Q Q/l ^ ^0<a 1 A^Q 4520 

y,y4 D.Due-iJ 4oy- i foz 


1296 


TIT* AAA O A 

PR00048A 


C2H2-type zinc finger signature I 


o o/i < oia in AQ7 'Jin 
y.y4 o.yie-iu 4y/-3iu 


1296 


T^TI AAA /I fl A 

PR00048A 


C2H2-type zinc finger signature I 


Q QA A 1 Ao 111 52Q 000 

y.y4 o.i4e-iJ ioy-zuz 


1296 


T*T* AAA A CI A 

PR00048A 


C2H2-type zinc ringer signature I 


Q OA A 14<=» 1^ ^o*; 
y.y4 o.i4e-u jzj-jjc 


1296 


TVT» AAA A Ci A 

PR00048A 


C2H2-type zinc finger signature I 


Q QA ^ 520*= 10 1^7 ^70 

y.y4 o.oze- iz jj i -d ikj 


1296 


r*T» AAA A on 

PR00048B 


C2H2-type zinc finger signature II 


cci 1 nn P in 0^1-070 
3.Z3Z i .vjue-iu zoi-z /U 


1296 


T»T"» AAAyl OD 

PR00048B 


C2H2~type zinc finger signature II 


<: <9 i nn P 10 0520-OQ52 

J.jZ 1 .UUC-lu Z07 Z70 


1296 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.29e-10 205-214 


1296 


PR00048B 


C2H2-type zinc finger signature II 


< ^O A $2^o in a:ii ^77 


1296 


PR00048B 


C2H2-type zinc finger signature II 


^ ^O < ^Ho 10 117 10/^ 


1296 


PR00048B 


C2H2-type zmc finger signature II 


COZ D.DUe-iU 4oD-4y4 


1296 


PR00048B 


C2H2-type zinc finger signature II 


J.3Z 3.DUe-l 1 J4D-OD4 


1296 


PR00048B 


C2H2-type zinc finger signature II 


c co nn^ in oio O/io 
5.52 o./ye-lu 23J-242 


1296 


PR00048B 


/-SA T TO 2. * XT * j TT 

C2H2-type zinc ringer signature II 


c co *7 , 7Cn i i /ion /no 
j.jz / . / J e- 1 1 4zy-4oo 


1299 


TriO AAA 1 1 A 


Neutral zinc metallopeptidases, zinc- 
binding region 


5 1 00p-1 1 1 R52_1Q52 


1299 


IPB001818B 


Matrixin 


26.48 4.63e-13 87-128 


1299 


IPB001818C 


Matrixin 


24.38 4.54e-09 134-179 


1299 


IPB00181SD 


Matrixin 


14.91 3.36e-25 182-213 


1299 


IPB001818E 


Matrixin 


8.86 4.26e-10 222-235 


1299 


IPB002870E 


Reprolysin family propeptide 


11.90 2.00e-09 190-202 


1299 


PR00138A 


Matrixin signature I 


12.54 9.25e-10 37-50 
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1 900 


ppnm isr 

JTISAJU 1 joD 


— . — : : ~ 

Matrixm signature II 


14.84 4.51e-ll 106-121 


19QQ 
Lz,yy 


JrJtvUU 1 joU 


Matrixin signature III 


20.07 8.86e-ll 132-160 




ppnni ion 


Matrixin signature IV 


14.57 1.86e-21 188-213 


1 900 


risJjkJ L joH 


Matrixin signature V 


7.10 9.77e-ll 222-235 


1299 


PR00480B 


Astacin family signature II 


14.35 1.14e-09 183-201 




Ir'oUUjUUors 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.09e-ll 150-187 


i 


TPRHH9 <C^7Qr^ 
LrsjKjyJZD fy\^ 


Domain of unknown function DUF25 


13.50 3.89e-28 75-102 


lJUJ 




Domain of unknown function DUF25 


ll. 55 7.92e-l2 128-139 




TPROn9<;7QTH 


Domain of unknown function DUF25 


12.14 9.31e-13 149-159 


1 ^07 

LJ\J I 


TPRfirni^n 


Kazal-type serine protease inhibitor family 


31.78 4.09e-16 28-68 


1 ^07 


ppnn9QnT? 

i J\.UVJZ!7U.D 


Kazal-type serine protease inhibitor 
signature n 


16.63 1.26e-10 39-50 


1308 


TPR0014< £ 59fr 


Hepadnaviral P protein C-terminal domain 


17.22 3.38e-09 52-72 


LJ KJZr 


TPR001 971 


Mammalian defensin 


19.97 3.42e-21 139-167 


1312 


TPR0 A 9041 A 

If DUU^W 1 A 


GTP-binding nuclear protein Ran family 


11.12 9.3oe-12 63-107 


1312 


PR0ft44Q A 
r L\\j\JH i tyj\ 


Transforming protein P2 1 ras signature I 


12.48 2.57e-16 63-84 


1312 


PR00440R 


Transforming protein P2 1 ras signature II 


14.39 4.81e-09 86-102 


1312 


PR 00440P 


Transforming protein P2 1 ras signature III 


15.70 8.26e-12 103-125 


1312 


PR 00440"n 


Transforming protein P21 ras signature IV 


10.27 6.50e-13 167-180 


1312 


PR 0044QP 


Transforming protein P21 ras signature V 


13.39 2.04e-13 201-223 


1314 


TPR000899 


Ainc linger, i^zxiz type 


14.67 1.69e-18 165-190 


1^14 

1J 1*T 


TPR0nnS99 

11TX5UUU0XZ 


Ziinc linger, i^-Zriz type 


14.67 1.75e-19 305-330 


1314 


TPR0DnR99 


zznc imger, L/Zriz type 


14.67 2.93e-17 277-302 


1314 


TPRnoos99 


Ziinc imger, LZnz type 


14.67 3.08e-18 193-218 


1314 


TPR0nf)ft99 


zjinc imger, v^zriz type 


14.67 6.40e-16 221-246 


1314 


TPRn00S99 


zanc imger, L-Zriz type 


14.67 o.63e-14 333-358 


1314 


TPR00nS99 


zinc linger, LZnz type 


14.67 9.31e-18 249-274 


1^14 

LJ 1*T 


TPROfH 975 

XrDUUXZ / J 


DM DNA binding domain 


19.17 2.88e-12 321-360 


1J it 


TPRD01 975 
lrDuwiz / j 


DM DNA binding domain 


19.17 3.43e-10 209-248 


1314 


TPR0D1 975 

Lx DUU 1Z / J 


DM DIVA binding domain 


19.17 4.16e-10 265-304 


1314 


TPR001975 

J.JT JJUU 1 Z# / J 


i^ivi uisi\ Dinumg uomam 


in 1 T A O /C „ 1 1 T>'7 on/ 

19.1/ 4.o6e-ll 237-276 


1314 


TPR001975 

LlLj\J\J LZ, 1 J 


DM DNA binding domain 


19.17 5.26e-10 293-332 


1314 


TPR001 975 

Lr DVKJ LZ. 1 J 


uivi lain a Dinamg domain 


19.17 6.66e-ll 181-220 


1^14 

LJ 1*T 


PR 000.48 A 


C2H2-t3^pe zinc finger signature I 


9.94 2.23e-10 302-315 


1314 


PP0004S A 


C2H2-type zinc finger signature I 


9.94 2.29e-13 330-343 


1314 


PR 000452 A 


C2H2-type zinc finger signature I 


9.94 2.93e-l3 218-231 


1114 

LJ it 


PP 000452 A 


C2H2-type zinc finger signature I 


f\ $f\ A A O y^* -I/Art 

9.94 4.86e-13 190-203 


1114 

LJ It 


PR 0004 R A 


C2H2-type zinc finger signature I 


9.94 5.26e-ll 274-287 


1314 


PR 00048 A 


C2H2-type zinc finger signature I 


9.94 0.14e-13 162-175 


1114 

LJ 1*+ 


PR 00045? A 


C2H2-type zinc finger signature I 


9.94 8.07e-13 358-371 


1114 

LJ l*r 


PR0004S A 


C2H2-type zinc finger signature I 


9.94 8.S8e-09 246-259 


1114 

LJ 14 


PR000453R 


C2H2-type zinc finger signature II 


5.52 2.29e-l0 178-187 


1314 


PR0004SR 


^znz-type zinc linger signature 11 


d.dz 3.0 /e-10 318-327 


1115 

LJLJ 


TPR00000^ 


Vertebrate metallothionein, family 1 


IT /II 1 O 1 /-\ n -i/-»/^ 

13.41 1.58e-10 78-123 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13 41 1 Qze-OQ ^1 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.06e-10 63-108 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.65e-09 28-73 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 2.93e-09 5-50 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.07e-12 72-117 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.57e-09 82-127 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 4.86e-09 83-128 


1315 | 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.06e-10 56-101 
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1315 


IPBOOOOOo 


r 7} — TP : — ? — r i — i 

Vertebrate metallothionein, family 1 


1 1 41 5 06p-1 0 77-1 99 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


1 J.ti J.IYJC - 11 fO'/l 


1315 


tt»t*> r\t\r\c\c\ £1 

IPB000006 


Vertebrate metallothionein, family 1 


ii^i < 652p-OQ 16-81 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


n A1 5 Q fie- 09 48-01 


1315 


TTinAAAAA/T 

IPBOOOOOo 


Vertebrate metallothionein, family 1 


1141 6 ^0p-1 1 87-1^9 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 41 6 I^p-09 95-70 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 41 6 Q7p-0Q 01-118 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


H Al 7 48f> 10 15 60 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


1141 £ 90p-1 1 101-146 

1J).*+1 0.^uC"ll 1U1 ItU 


1315 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11418 44p-00 71-1 1 8 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 8. 81e-09 61-106 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


11 41 Q QAf* 1H 11 78 
1.3.41 o.o4c-lU 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 9.38e-12 34-79 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


11 /ii o coo no Q1 116 
13.41 y.oze-uy yi-130 


1315 


IPB000006 


Vertebrate metallothionein, family 1 


ii /ii a Q/io in 1 fii 1 An 
13.41 y.yue-iu iuz-i4/ 


1315 


IPB000359B 


Cystine-knot domain 


1 n o^r o /i c=» 1 a 1 in 1/11 

ly.zo 3.4oe-iu izy-i4/ 


1315 


IPB000359B 


Cystine-knot domain 


ly.zo o.34e-uy yu-ius 


1315 


IPB000437 


Prokaryotic membrane lipoprotein lipid 
attachment site 


ia '7T2c» in 1A 11 


1315 


IPB000437 


Prokaryotic membrane lipoprotein lipid 
attachment site 


11 oi nil,, in on oi 

13. o3 y.3ie-iu zU-31 


1315 


IPB000726B 


Glycoside hydrolase family 19 


< lO ^ 1 1 Q 1 53 


1315 


IPB000726B 


Glycoside hydrolase iamily 19 


^ 1Q *7 Qido 11 1 52 7Q 

3.3y /.o4e-ll lo-Zo i 


1315 


IPB001007C 


von Willebrand factor, type C repeat 


lO 53Q 4 77/= HQ 74 OS 

zu.&y 4.z /e-uy /H-yj 


1315 


TlVnftrtl 1 /'AT/' 

IPB001169K 


Integrin beta, C-terminus 


07 4s i nn^ in in ^9 


1315 


IPB001 169K 


Integrin beta, C-terminus 


974S9 91p-11 R 1 _ 1 91 


1315 


TT1T)AA1 1 £LC\J/~ 

IPB0011o9K 


Integrin beta, C-terminus 


97 4S 4 04f»-00 1 S-60 


1315 


IPB001169K 


Integrin beta, C-terminus 


77 4S 4 77p 1 1 1-4S 
Z/.HO *f . / /C"lJ J"H-J 


1315 


IPB001169K 


Integrin beta, C-terminus 


77 4S7 11i= 11 11-SS 


1315 


IPB001271 


Mammalian defensin 


1 Q 07 1 00*=> 10 1 9-40 


1315. 


IPB001762A 


Disintegrin 


■71 Q1 7 94#= OQ 7S-1 1 S 


1315 


IPB001762A 


Disintegrin 


71 01 7 41*=» 00 05-1 IS 


1315 


IPB001762A 


Disintegrin 


91 01 5 41f»-0Q 1 14-1 54 


1315 


IPB002174A 


Furin-like cysteine rich region 


10 51 1 7Qp-1 1 5-16 


1315 


TTITIAA1 1 *7 A A 

IPB002174A 


Furin-like cysteine rich region 


10 51 1 OOr-00 56-S7 


1315 


TT>T"> nno 1 *7 A A 

IPB002174A 


Furin-like cysteine rich region 


10 51 1 10p-10 19-4^ 


1315 


TTITlArtl 1 T A A 

IPB002174A 


Furin-like cysteine rich region 


lO 51 Q 16p 19 9-11 


1315 


rnnAAi/lAjl A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 1 OOp-OS 1 5-48 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


19 441 11f»-11 199-155 
IZ.Ht- 1. lie- 11 1ZZ"1JJ 


1315 


TTITIAA1 A C\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


19 44 1 67p-19 68-101 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 1 79p 14 74 1 07 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


11 44 1 75** 1 1 4Q 527 
J.Z.44 i./je-ii ^ty-oZ 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


1 O 44 1 OOf» 19 11 44 
1Z.44 Z,UUe-iZ 11-^ 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


1 1 44 1 (Y7e* 1 1 64 07 

iz. 44 z.u /e- 1 1 04-y / 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


11 44 1 11*=» 11 54 527 

lz.44 z.joe-iz / 


ins 


TPRnA14Q4 A 


T^prntin lii <yh Qiilfivr R9 rvrntpin 


12.44 2.85e-17 78-111 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.42e-09 98-131 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.52e-09 79-112 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.00e-15 117-150 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.60e-10 21-54 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.65e-09 102-135 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.93e-ll 5-38 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.00e-15 83-116 
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1315 


tt»T"> A AVO A f\ A A 

IPB002494A 


: — rr-; 77- — : 

Keratin, high sulfur B2 protein 


19 44 6 flip- 10 61-06 


1315 


TT*n /\/\0 A O A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 6 11p-00 6-10 


1315 


TT»T"»/\ A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 6 S6<=»-1 7 71-1 06 i 

IZ.^H' O.jOp-1 / / j-lUD 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


1 9 Ad 6 11 >- 111 6-40 
1Z.44 O. / fC-LD 10-4:7 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


19 AA 7 94p-1 0 1 1 8-1 SI 

1.Z.44 /.ZHC"1U llO-Ul 


1315 


IPB002494A 


Keratin, high sulfur B2 protein 


1 9 AA 1 f\l(*- 1 9 1 1 1-14.6 

1Z.44 /.O/C-l-6 1 lJ-l*rO 


1315 


TT»T»AA'^ A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 8 OOp- 10 119-1 4S 

lZ.*rH O.uUC-iy 1 lZ-lH-J 


1315 


TTiT»AAn A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 AA 8 1 6f*-1 0 SQ-09 

1Z.44 O.10C-1U 


1315 


1PB 002494 A 


Keratin, high sulfur B2 protein 


19 Ad 8 S6p-14 Q1-196 


1315 


TT5T5AAO/1 ClA A 


Keratin, high sulfur B2 protein 


19 44 8 57e-1Q 88-191 


1315 


TTVT»AAO AC\A A 

IPB 002494 A 


Keratin, high sulfur B2 protein 


19 4.4 R 6Sp-0Q 6Q-109. 


1315 


TT*» T*» f\f\*^ Al~\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 8 67p- 1 0 84- 1 1 7 


1315 


TTtnAAO A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


19 44 0 fl^p-flQ 80-199 


1315 


TTlTtAAl A r\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


1 9 44 0 71 p-00 1 0-41 


1315 


inn /"\rvo A r\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


19 44 0 80p-1 0 1 01-1 16 


1315 


m ti r\ n o Af\ A T~i 

IPB002494B 


Keratin, high sulfur B2 protein 


10 S8 1 97p-10 141-1SS 


1315 


1PB002494B 


Keratin, high sulfur B2 protein 


10 S8 1 09e-00 107-191 


1315 


TTVTJ A AO A A A "D 

1PBUU2494B 


iveraxm, nign sunur dz proxem 


10 SR 4 40e-10 111-145 


1315 


TTVD A AO A A A TD 

lrB002494rs 


rLeratm, nign sunur dz protein 


10 S8 0 1 Oe-19 1 02-1 1 6 


1315 


TT1T3AAO A A A TD 

1PB002494B 


Keratin, high sulfur B2 protein 


10 S8 0 10p-19 09-106 


1315 


mriAAi A A A /~* 

1PB002494C 


Keratin, high sulfur B2 protein 


14 46 1 00p-08 10-69 


1315 


TTiTl A AO A A A Z" 1 

IPB002494C 


Keratin, high sulfur B2 protein 


14 46 1 S9p-1 S 86-190 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 1 S0p-00 14-S7 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 1 7Qp- 1 0 8-S 1 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 9 1 69-1 OS 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 9 66p 11 6 1 - 1 04 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 1 IQf* 1 7 1 01 -144 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 1 46p-1 6 1 1 0-1 SI 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 1 8flp- 1 0 80- 191 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 d 71 p_9S 81-194 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 4 76p- 1 0 67- 110 


1315 


TTin A AO A A A 

IPB0Q2494L, 


Keratin, high sulfur B2 protein 


14 46 S 1 1p-1 0 1 1 1 -1 S4 


1315 


Tfin A AO A A A /~* 

1PB002494C 


Keratin, high sulfur B2 protein 


14 46 S 1 1p-1 0 S9-0S 


1315 


TT»n A AO A A A /"* 

1PB002494C 


Keratin, high sulfur B2 protein 


14 46 S 84e-1 8 76-1 1 0 


1315 


TT1DAAO A A A 

lPB0024y4C 


Keratin, high sulfur B2 protein 


14 46 6 49p-1 0 S 1 -04 


1315 


TTin A AO A A A 

IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 7 00p- 1 S 1 OS- 1 48 


1315 


ttj n A AO /f A /I Z" 1 

IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 7 4.7p-1 0 RS-1 98 


1315 


mri a ao At\ A /~i 

IPB 0 02494 C 


Keratin, high sulfur B2 protein 


14 46 7 69p-1 S 06-1 10 

I4.*+U /.UZrC-U 7U 1J7 


1315 


mn AAA A f~\ A /~\ 

IPB002494C 


Keratin, high sulfur B2 protein 


1 d 46 7 64p- 1 6 66- 1 00 


1315 


TTIDAAO A A A /~t 

IPB002494C 


Keratin, high sulfur B2 protein 


14 46 7 08p-00 0S-1 18 
I'+.H'U / ,7oc "\jy y+)~ ljo 


1315 


mn A AO A A A 

IPB002494C 


Keratin, high sulfur B2 protein 


1 4 46 8 07p-94 01 - 1 14 


1315 


TT4TIAAO A A A 

IPB002494C 


Keratin, high sulfur B2 protein 


14 46 8 1 1p-1 9 1 06-140 


1315 


TT»T1 A AO A A A 

IPB002494C 


Keratin, high sulfur B2 protein 


14 46 8 44p-91 71-1 14 

I4.*tU O.ttC'Z.1 / 11 It 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 8 4Sp-1 1 79-1 1 S 
lt«40 O.tJC-ll /Z-l 1J 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14 46 8 7flp-14 00-1 11 


1 11 s 


tptioo94Q4P 


TTprivHr* Vucrh ^ulfivr R9 tvrn.tp'i'n 


14.46 9.00e-ll 46-89 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.30e-14 100-143 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.65e-10 75-118 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.82e-ll 77-120 


1315 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 9.S3e-09 39-82 


1315 


IPB003571B 


Snake toxin 


18.08 1.20e-09 22-45 


1315 


IPB003854B 


Gibberellin regulated protein 


19.26 9.5 le-10 45-83 


1315 


PR00858B 


Crustacean metallothionein signature II 


5.93 3.39e-09 49-67 
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13 ID 


"DT> AAQCQD 

rKAJUoJoB 


— — — — ; ; — 

Crustacean metallothionein signature II 


C AO O OO,-, AO 1 C 

j.y3 /.zze-uy 0-33 


1-51 r 
13 IJ 


"D"D aaqo/1 r~\ 
rKUUo /4C 


Fungi-IV metallothionein signature III 


A OO 1 OC« AO A Q 

4.3 / 3. /De-Uy 4o-6z 


1 3 ID 


JrKUUo /0B 


Nematode metallothionein signature II 


1 ^ A1 e» AO QG 1 AO 

/.00 D.uie-uy oy-iuz 


1 0 1 
13 10 


TT>"D AAAAA/C 

IrBUUUUUO 


Vertebrate metallothionein, family 1 


1 O A 1 1 IQo 1 A C'? 1 AO 

13.41 i.zye-iu j /-iuz 


13 10 


IrrJUUUUUO 


Vertebrate metallothionein, family 1 


1 O A 1 A 1 O ^ AA 17 OO 

13.41 4. ize-uy 3 /-oz 


Id 10 


IrBUUUUUO 


Vertebrate metallothionein, family 1 


IO /II C A 1 a AA co ao 

13.41 3.4ie-uy jz-y/ 


1310 


"TDDAA AAAiC 

lrBUUUUU6 


Vertebrate metallothionein, family 1 


1 o /j 1 c cr\— AO O O OO 

13.41 5.5ye-uy 3o-o3 


1316 


T"T*D AAAAA/C 

IPB000006 


Vertebrate metallothionein, family 1 


1 O A 1 C /TO„ AA £ZO 1 1 O 

13.41 5. 68e-U9 68-113 


1 O 1 /C 

13 lo 


TT>D AAAAAj£ 

IrBUUUUUO 


Vertebrate metallothionein, family 1 


1-1 At c OA~ 11 An AO 

13.41 D./Ue-ll 4/-yz 


1310 


TT>DAAAAA/C 

lrBUUUUU6 


Vertebrate metallothionein, family 1 


/t 1 f 1 A OO IOO 

13.41 5./4e-lU o/~13z 


1 1 1 /c 
13 10 


TDD AAAAA/C 

IrBUUUUUO 


Vertebrate metallothionein, family 1 


11 /f 1 c AA 1 0 /co 

13.41 J.o/e-Uy Io-03 


1 1 1 /C 

1310 


I"DD AAAAA/C 

IrBUUUUUO 


Vertebrate metallothionein, family 1 


13.41 O.zUe-il 00-133 


1310 


TTJDA AAAA/T 

IrrJUUUUUO 


Vertebrate metallothionein, family 1 


1 1 A 1 £Z £Zf\„ f\C\ 1 O /CO 

13.41 6.6Ue-U9 17-62 


13 10 


TTJT5 AAAAA/C 

lrBUUUUU6 


Vertebrate metallothionein, family 1 


10 /II O O C ^ AA OO £iO 

13.41 o.35e-U9 23-68 


1310 


rDDAHAAA/C 

IrrJUUUUUO 


Vertebrate metallothionein, family 1 


IO. /I 1 O AA TO IOI 

13.41 o.3De-Uy /o-lz3 


1310 


TTJD AAAAA^ 

IrrJUUUUUO 


Vertebrate metallothionein, family 1 


IO /I1 O A Cn 1A AO 1 07 

13.41 o.4je-lU 9z- 13 / 


1 O 1 

13 lo 


TDD A AAAA/T 

1FBUUUU06 


Vertebrate metallothionein, family 1 


1 O /i 1 O HA„ 1 1 OO ^O 

13.41 o.7Ue-ll 22-67 


1310 


TDD AAAAA/C 

IrrJUUUUUO 


Vertebrate metallothionein, family 1 


IO /! 1 O AA„ AA ^70 

13.41 o.yye-uy 33- /o 


1310 


TDD A AAA A£ 

lrr$UUUUU6 


Vertebrate metallothionein, family 1 


IO /I 1 A K„ AA A1 OO 

13.41 9.45e-U9 43-bo 


1310 


i t-» t-» nnm cad 

LPB0U0359B 


Cystine-knot domain 


19.26 l.UUe-14 9U-1U8 


1310 


lrBUUU359B 


Cystine-knot domain 


1A O^T 1 tZ/Z*. AA /I C ^TO 

19.26 1.56e-U9 45-63 


1316 


IPB000359B 


Cystine-knot domain 


19.26 2.97e-09 80-98 


1316 


1PB000359B 


Cystine-knot domain 


19.26 5.78e-09 120-138 


1310 


1PB000359B 


Cystine-knot domain 


19.26 6.25e-12 20-38 


1310 


lrBUUU35yB 


Cystine-knot domain 


19.26 8.31e-09 55-73 


1316 


TDDO/IAO f AD 

1PB000359B 


Cystine-knot domain 


19.26 9.16e-09 25-43 


1 O 1 /T 

1316 


1PB000359B 


Cystme-knot domain 


1 r\ o/T rv 1 /T „ rvD Ac 110 

19.26 9. 16e-09 95-1 13 


1316 


TnDAAATim 

1PB00U726B 


Glycoside hydrolase family 1 9 


C OA O /I „ AA01 Ol 

5.39 8.46e-09 21-31 


1310 


TTJDAAAA^At? 

1PBUUU9/Ur 


Developmental signaling protein, Wnt-1 
family 


OO A1 C O /I „ AA CA 1 AO" 

23 A3 5.94e-09 59-107 


1316 


1PBUU1271 


Mammalian defensin 


1A AO 1 0 c 1 r» OC CO 

19.97 1.35e-10 25-53 


13 10 


TDD A A 1 001 

IrBUUlz/1 


Mammalian defensin 


1A AO 1 OO^ 1A A/1 100 

iy.y/ l.ooe-iu y4-izz 


13 10 


TDDAA1 OOI 

IrBUUlz/1 


Mammalian defensin 


1AAOOA/l^ 1AOC 1AO 

iy.y / z.y4e-iu /d-iu3 


13 10 


TDD AA 1 OOI 

IrBUUlz/1 


Mammalian defensin 


1A AO A ATn AA CA OO 

iy.y / 4.y3e-uy du-/o 


1310 


TDDAA 1 OO 1 

IrBUUlz/i 


Mammalian defensin 


1A AO A A0 0 AA AC 1 OO 

iy.y / 4.y3e-uy y5-iz3 


1 o 1 ^ 
13 10 


TDDAA1 T7 1 

IrBUUiz/l 


Mammalian defensin 


1 A AO C O/l-a 1A O/l CO 

iy.y / 5.Z4e-lU z4-5z 


1 O 1 /C 

1310 


TDDAA 1 1T1 

1PBUU1271 


Mammalian defensin 


1A AO C OC^. AA OA CO 

19.97 5.25e-U9 3U-58 


13 10 


TDDAA 1 OOI 

1PBUU1271 


Mammalian defensin 


1 A AO C CO« AA OC 1 1 O 

19.97 5.5oe-Uy 85-113 


101/ 
1316 


TDDAA 1 T71 

1PBUU1271 


Mammalian defensin 


1 A AO O lOn 1AOA 1 no 

iy.y/ /. ioe-iu ou-iuo 


1316 


TDDAA1 T71 

1PBUU1271 


Mammalian defensin 


1 A AO O OA„ AA /CC AO 

19.97 /.22e-U9 65-93 


1316 


TDDAA 1 n 1 

1PB001271 


Mammalian defensin 


1A AO O OC^ 1A A A /CO 

19.97 /.35e-lU 4U-68 


1 O 1 /T 

1316 


TDDA/II T7 1 

1PBUU1271 


Mammalian defensin 


1 f\ no O A/C^. 1 A OA 1 1 O 

19.97 o.U6e-10 89-117 


1316 


TT> T"» AA1 1 

IPB001271 


Mammalian defensin 


1 r\ do o od^« nA oc /fo 

19.97 8.20e-09 35-63 


111/; 

1316 


TDT5AA1 ni 

1PB001271 


Mammalian defensin 


-ir\ r^<-T o t 1 „ 1 1 i^- 

19.97 8.31e-ll 15-43 


1316 


TT»r>AA1 T71 

1PB001271 


Mammalian defensin 


1f\ OO O 0£T~ AA O yl 1 lO 

19.97 8.36e-09 84-112 


13 lO 


TDDAA 1 OOI 

IrBUUIZ/ 1 


Mammalian defensin 


1 o on $ <Qt* 1 n oq /ro 
iy.y/ o.Dye-iu 3y-o / 


1316 


IPB001271 


Mammalian defensin 


19.97 8.85e-09 64-92 


1316 


IPB001271 


Mammalian defensin 


19.97 8.94e-10 49-77 


1316 


IPB001271 J 


Mammalian defensin 


19.97 9.35e-09 74-102 


1316 


IPB001271 


Mammalian defensin 


19.97 9.67e-09 34-62 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 1.97e-09 94-127 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 2.23e-18 53-86 


1316 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 2.85e-13 108-141 
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Description 


Results* 


ID LO 


TPT3AA9/IQ/1 A 
lr Jt5 0 OZ4y 4 A 


■ — — ; ~~~ : 

Keratin, high sulfur B2 protein 


1Z.44 Z.ooe-iy 4o-ol 


1 11 £ 
ID LO 


ir J3 u uz4y 4 1\ 


Keratin, high sulfur B2 protein 


10 A A o 0/t«. AO AC\ Ol 

iz.44 z.y4e-uy 4y-oZ 


1 11 £ 

1 D LO 


TPT3 AAl/lQ^d A 
lr.fcS00Z4y4A 


Keratin, high sulfur B2 protein 


1Z.44 j.jje-lZ D-DO 


1 1 1 £ 
LD LO 


Irl3UUZ4y4A 


Keratin, high sulfur B2 protein 


10 AA 1 IQo 1^ 11 A£l 

1Z.44 JJoe-lD 1 o-4o 


1 1 1 £ 
LD LO 


TPT3AA9/1Q/1 A 
Ir D U UZ4y 4-rV 


Keratin, high sulfur B2 protein 


lOAA 1 A A a. 10 1Q ^1 

1Z.44 j.44e-lZ lo-Dl 


1 11 £ 
LD LO 


lr.t>UUZ4y4A 


Keratin, high sulfur B2 protein 


1 O A A 1 ^7q 1 O Q A 1 

1Z.44 j.o/e-lZ 0-41 


1 1 1 £ 


1 r xi U UZ 4 y 4/\ 


Keratin, high sulfur B2 protein 


10 A A 1 T\ o. 10 Q1 1 O/C 

iz.44 d. I le-iy yj- izo 


1 11 £ 
LD LO 


tptiaa9AQ/1 a 

lr rs U UZ4y 4 A 


Keratin, high sulfur B2 protein 


1 0 AA A >10a AO c a c~J 

LZAQ 4.4oe-Uy D4-0 / 


1116 
ID LV 


TPRA09ilQJ.A 

1 Jr £5 U UZ^t-y Hf\. 


iveraiin, nign sunur dz protein 


1 9 AA A 6Q** 1A11Q 1<1 
IZ.44 4.0oe-lU 11o-1j1 


1116 




jveraun, nign sunur r>z proiem 


1 9 A A A 91 ex 17 OQ ^1 
IZ.44 4. / ie-1 / Zo-0 1 


i n 6 

IjIO 




Keratin, high sulfur B2 protein 


19/1/1/1 COo 1 A 1A K"7 
LZ.H-4 4.oye-lU Z4-D / 


1116 
LD LO 


TPRAA9j1QJ. A 
lr r>UUZ4y4/Y 


Keratin, high sulfur B2 protein 


in AA A C7cx AO f^A Ol 

iz.44 4.y /e-uy o4-y / 


1116 


ir rJUUZ4y4.A. 


Keratin, high sulfur B2 protein 


in a A A QQo 1 a no /zn 

LZA*t 4.yye-iu zy-oz 


1 i 1 6 


tpt3aa94G/i a 

lrr>00Z4y4A 


Keratin, high sulfur B2 protein 


lO AA *Z f\A*> 11 A1 *7/C 

IZ.44 D.U4e-13 4o- /O 


in*; 

LD LO 


TPT3AAO/1Q/1 A 
lr\DO0Z4y4A 


Keratin, high sulfur B2 protein 


in AA G 1 An 1 H i o n 1 


1116 
1 D LO 


TPT3AA9/tQ/l A 
lr.DUOZ4y4J'\ 


Keratin, high sulfur B2 protein 


m aa c o/i« nn cn no 

iz.44 j.o4e-uy Dy-yz 


1 1 1 6 


tpr aao/Iqa a 

lr\t5UUZ4y4A 


Keratin, high sulfur B2 protein 


IZ.44 j.yze-14 oo-lZl 


1116 
LD LO 


TPT3AAO/tQ/l A 
Lr r>UUZ4y4/\ 


Keratin, high sulfur B2 protein 


n yi/i r /To ^ 1 c oo 11/: 
IZ.44 o.oJe-lj oi-ilo 


1116 


TPT2 AA9/1 OA A 
lrr300Z4y4A 


Keratin, high sulfur B2 protein 


in aa £. nn ^ 10 tzo 1 ai 
Iz.44 o.77e-li bo- 101 


1 1 1 £ 
1 j 10 


TPT3AAO/1Q/1 A 
lr J300Z4y4A 


Keratin, high sulfur B2 protein 


12.44 7.22e-17 73-106 


1116 
IjIO 


TPT3AAOAQ/1 A 
lr±5UUZ4y4/\ 


Keratin, high sulfur B2 protein 


10 /t/i *7 cc^ nn m 

iz.44 /.joe-oy iy-DZ 


1116 
IjIO 


tpt* a A9/IQ/1 a 

lr r>UUZ4y4/\ 


Keratin, high sulfur B2 protein 


1 0 /i /i 0 ^„ k 1 no ii/: 
IZ.44 /.Oje-lj 103-IjO 


1116 
1 j lO 


TPT3AA9/1G/1 A 
lrrJUUZ4y4A 


Keratin, high sulfur B2 protein 


10 /f a n oin 10 00 
1Z.44 /.oze-lo Zj-DO 


1 j 10 


TPt3AA9/1Q/1 A 
lrJ300Z4y4A 


Keratin, high sulfur B2 protein 


in a a 0 n*7^ 1 c 0 n 1 
IZ.44 o.O/e-lO 3o-yi 


1116 
1 j 10 


tpraao/iq/i a 
lr r>UUZ4y4A 


Keratin, high sulfur B2 protein 


1Z.44 o.33e-lo oj-yo 


IjIO 


lr\DUUZ4y4A 


Keratin, high sulfur B2 protein 


12.44 o.94e-17 98-131 


11 1 A 
1 j lO 


TPR AAOAQA A 
lr.D00Z4y4A 


Keratin, high sulfur B2 protein 


1 0 /i /i n no ^ nn 1 n 00 

iz.44 y.o3e-oy 39- /Z 


1 11 A 
IjIO 


TPT1AA9/1Q/i A 
lr Jt>UUZ4y4A 


Keratin, high sulfur B2 protein 


1 0 /i a n 1 1 a no 111 
IZ.44 y.loe-14 /o- 111 


1116 
LD LO 


TPP: A A9/1QA A 
lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


in aa c\ /cn« in on no 

iz.44 y.oye-io ©y-izz 


1116 
1 j lO 


TPRAA9/1Q/1 A 
lr I3UUZ4y4A 


Keratin, high sulfur B2 protein 


10 /i/i 0 i&a. i/i 00 
iz.44 y. /oe-14 33-00 


1116 
1 J 1U 


lr JjUUZ*ti74r3 


iveratm, nign sunur r>z protein 


lU.Do l.Zoe-il ZZ-30 


LD LO 


lrr>UUZ4y4r> 


Keratin, high sulfur B2 protein 


in ro 0 nHo. AO C7 1A1 

io. jo z.z/e-oy 0 /-ioi 


1116 

LD LO 


TP"RAA9,AQilR 
lrrSUUZ4y4.D 


Keratin, high sulfur B2 protein 


in co 0 /rn Q i n no i n^ 

10.3 0 3.oye-io yz-ioo 


1116 
1 J lO 


TPT3AA94Q/1P: 

lr.t>00Z4y4rJ 


Keratin, high sulfur B2 protein 


in co c oo,-! 1 1 nn m 

10. Do D.ZZe-l 1 //-yl 


1116 
IjIO 


TPR AA9 AO AT* 
lrr>UUZ4y4r3 


Keratin, high sulfur B2 protein 


in co c 7Q Q 1 1 00 O/C 
10. Do j. /oe-l 1 /Z-00 


1116 


TPT3AA9AGAT3 
irrJUUZ4y4l3 


Keratin, high sulfur B2 protein 


in co /z noo. nn co /C/c 

10. do o.o/e-oy DZ-OO 


1116 
1 j lO 


TPT3AA9/1G/1PJ 

lrr$UUZ4y4r> 


Keratin, high sulfur B2 protein 


in co iC on«->. 1 1 0*7 ci 
10. Do O.Z0e-ll 3/-D1 


1116 
1 j lO 


tpp>aao/iq/1 t> 
lrr500Z4y4rS 


Keratin, high sulfur B2 protein 


inco/T/C/t,-. in 00 4 1 
10.DO O.o4e-10 Z/-41 


1116 
1 j lO 


TPQAA9/1 G/IT3 
lr JbJUUZ4y4r> 


Keratin, high sulfur B2 protein 


m co iC tzA<-% in no 111 

10. Do o.o4e-io y/-iii 


1116 
13 lO 


lrr>UUZ4y4r> 


Keratin, high sulfur B2 protein 


in co m a in 1 no 101 

10. Do o.yie-io 10/-1Z1 


1116 
IjIO 


TPP> A f\OAQA Ti 
lrr500Z4y4JtS 


Keratin, high sulfur B2 protein 


inco/cm« in ^ro nn 
10. Do o.yie-10 0Z-/0 


1116 
LD LO 


TPT3AAO/1Q/1T3 

lr.D00Z4y4.fc> 


Keratin, high sulfur B2 protein 


in co 0 nO/-> nn 110 io^r 
10. Do /.Ooe-Oy llZ-lzo 


1116 
1 j lO 


TPTi A AO/1 Q/1 "D 
lrrJUUZ4y4ri 


Keratin, high sulfur B2 protein 


1 a c 0 0 nOn nn cn 01 
10. Do /.Ooe-oy o/-oi 


1116 
1 J lO 


TPPj a m A O A "D 

lrr500Z4y4r> 


Keratin, high sulfur B2 protein 


in co n oCn 10 An £. 1 

10. Do y.ZDe-lZ 4/-01 


1116 
IjIO 


TP"Q A AO/1 Q/1 T3 

lrr>00Z4y4J3 


Keratin, high sulfur B2 protein 


incon/i/in 11 110 in 

10. do 9.44e-l 1 I1/-131 


1316 


TPR0094Q4P 


T^p^rcitin 1rio"li Qillfiir R9 r\rrvff*in 
JVC! dU.ll, lllgll DU11U1 XJZ, pitJLClll 


14. 40 i.uue-uo LD-Do 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.18e-l 155-98 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.22e-17 101-144 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 L36e-ll 95-138 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.54e-16 1-44 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.60e-31 41-84 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.94e-26 26-69 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.76e-13 25-68 
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Results * 


1316 


IPB002494C 


: r^— i 7? — ^ : 

Keratin, high sulfur B2 protein 


1A AG 1 A^p 10 90-A1 
I'r.H'O 3.*+3e-JLv/ ZU-Oj 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AA 1 A7<= 99 ^A QQ 
14. 40 J.^/c-iiZ J\j-yy 


1316 ! 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AG A HQp 1 9 A*\ 5252 
14. 40 4.Uye-lZ HO-oo 


1316 j 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AG A 1 Qt* 19 1 1 A 1^0 

14.40 4. rye- iz iio-ijy 


1316 j 


IPB002494C 


Keratin, high sulfur B2 protem 


1 A AG A 9Ae» 1Q 91 ALA 
14.40 4.ZuC- 1 o Zl-04 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A A A Ala OA CI 1 9 A 
14. 40 4.4/e-x4 ol-lZ^f 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A£. A <Qo 1 1 A AQ 

14. 4o 4.oye-i / o-4y 


1316 


IPB002494C 


Keratm, high sulfur B2 protein 


1 A A£ A Qf\a 9Q Hfk 1 1 Q 

14.40 4.oUe-Zo /o-iiy 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A£. A Q/li= 1A 1 A53 
14. 4o 4.cS4e-lU OD-lUo 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A Af* A Q^o 1 H fs.V\ 1 CV\ 

14. 4o 4.yoe-iu OU-1LO 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AG. A Q^/=> AQ 1 AA 1 A"\ 

14. 40 4.ioe-uy LW-IHD 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14. 40 _).Uie-l3 yKJ-LJj 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


\A Af* ^ SHp-?^ ^1-74. 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AG 5 ^Ap 1 A ^ ^ 75? 


1316 


1PB002494C 


Keratin, high sulfur B2 protein 


1A AG *\ D 80-1 9^ 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


i a AG G A9f»-10 191-1 fiA 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


14. 40 O. JUt Zj H-D-Oy 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A 6 s9p 91 01-1^A 
14. 40 0.jZ6"Z1 yL"\.D t r 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AA A AA^ 9A OA 1 ^0 
14. 4D 0.04e-Z0 


1316 


TTin AA(^ A f~\ A 

IPB002494C 


Keratin, high sulfur B2 protein 


14.40 o.yie-ii o3-izo 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 /I /t< 7 A/l» 99 111 1 ^A 
14.40 /.U4e-ZZ 111-134 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AG 7 1 9o 1 O 70 1 1 "3 

14.40 /.ize-iu /U-113 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AG 7 <7*> 11 1A ^Q 

14. 40 /.o/e-3j io-3y 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AA 7 7^o 98 9.G 1 90 
H.HO /./j6-Z(5 oD-IZ!7 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AA R Q/Ip 1 0 7- < 4 1 1 S 
14. 40 o. 346-11/ /3-lio 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 ylA S ^1*» 11 ^A Q1 
14.40 i5.33e-13 3U-yj 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /I A C 7f|o on 1 1 ^A 
14.40 o. /Ue-ZU 1 1-34 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AA S2. 7A& AQ 10 ^1 
14.40 O. /4e-Ui7 1U-33 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


\A AG Q QCp 91 1A 7Q 
14. 40 «S.oc5e-Z3 30-/y 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 AG Q QOa 9Q AA 1 OQ 

14.40 o.ooe-zo oo-iuy 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /1A Q 1 Ho 1 A 1A 91 
14.40 y.3Ue-lU 3U-/3 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /1A Q 19» 99 S 1 OA 

14.40 y.3Ze-Z / 3 1-V4 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 ylA Q 1C*a 91 71 1 1A 
14.40 y.3oe-Z3 / I- 114 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AA Q ^7*» 17 1 AA 1 AQ 

14.40 y.3 /e-i / ivo- ity 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A AG Q AQ 1A<J 1A2 

14.40 y.3cse-uy 1U3-140 


1316 


IPB002494C 


Keratin, high sulfur B2 protein 


1A A A Q AA*» 9^ A1-10A 

14.40 y.oue-z3 oi-iv'H- 


1316 


TT^TlAAA A f\ A i 

IPB002494C 


Keratin, high sulfur B2 protein 


1 A AA QQ1(» 1 A A0-R1 
14. 40 z?.zt ie~l\J W-oj 


1316 


IPB003888D 


FY-rich domain N-terminus 


9/1 9A G '\ Go AQ 1 00 1 A9 

Z4. /u o.ioe-uy lUU-l^Z i 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


11/11 1 HAp 052 152 521 
13.41 l.UUe-UO 3o-o3 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


11/11 1 1 52*» OQ 1 A 4 ? 91 O 
13.41 1. IOC-US' 103-Z1U 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


11 Ai" 1 952<a AQ <2^ 11A 
13.41 l.Zoe-Uy o3- 13U 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


11 A1 1 1Q/* 10 110 17S 

13.41 i.3ye-iu 13U-J./3 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


11/11 1 AGt> HQ AO 1 A^n 
13.41 1.40e-Uy 0U-1U3 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


1 O /I 1 1 CCa AQ 9A 1 1 < 

13.41 1.33e-uy /U-113 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


1 1 A 1 O OOa AQ 1 9A 991 i 

13.41 z.zye-uy 1/0-zzi 


1117 
1 j 1 / 


tproooooa 


\T pL-r+fzYvmtf^ m F»t£i 1 1 of Tii on i»i n fVi m W\i 1 

y CriCLM. ttLw IlJ.dttllUl.lllVJl.lwl.il, lttlllil_y 1 


13.41 2.35e-10 64-109 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.11e-09 69-114 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.32e-10 104-149 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 3.94e-09 239-284 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 4.12e-09 125-170 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.32e-09 201-246 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.59e-09 190-235 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 5.68e-09 175-220 
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1317 


Tnn a a a a azt 

IPB000006 


Vertebrate metallothionein, family 1 


IT C /TO„ AA 1 A £ZA 

13.41 5.68e-09 19-64 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 A 1 tZ O /I ~ 1 A O 1 A iff 

13.41 5.84e-10 210-255 


1317 


tt» t"» a a a a a 

IPBOOOOOo 


Vertebrate metallothionein, family 1 


A 1 c OT« AA OA 1 AC 

13.41 5.87e-09 80-125 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 /) 1 C C\A~ 1 A 1 1A OK 

13.41 5.94e-10 1/0-215 


1317 


TTk"D r\(~\r\r\r\ tz 

IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 /I 1 C A/I ^ 1A1AA1/1C 

13.41 5.94e- 10 200-245 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 yl 1 ZT O O „ AA 1 O/T 111 

13.41 6.23e-09 186-231 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


1 1 At C 00„ AA 1 iCA AAC 

13.41 o.ooe-09 160-205 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


1 O At 1 1 A,-> 1A AAC T7A 

13.41 /.39e-10 225-2/0 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 At 7 /ll^ AA 11C 1/CA 

13.41 7.43e-09 115-160 


1317 


Tnr» A Ai"V A A /T 

IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 /I 1 T ylA^ A A ff 1 AA 

13.41 7.43e-09 55-100 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 A 1 T^O^. 1 A A/1 /C A 

13.41 /.ooe-10 24-oy v 


1317 


T"n"n a a a a A f 

IPBOOOOOo 


Vertebrate metallothionein, family 1 


13.41 7.97e-10 215-260 


1317 


fni^ AA AAA/" 

IPB000006 


Vertebrate metallothionein, family 1 


11 At O Of „ A A 1 C £Zf\ 

13.41 8.35e-09 15-60 


1317 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 8.35e-09 45-90 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family 1 


11 /I 1 o /TO _ AA A 1 /I A 

13.41 8. 62e-09 95-140 


1317 


TTiT* A A A A A/" 

IPBOOOOOo 


Vertebrate metallothionein, family l 


1 o /I 1 r\ 1 O _ 1/^ 1<^A t £ E 

13.41 9.13e-10 120-165 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family l 


11 /ii f\ AA AAC If A 

13.41 9.27e-09 205-250 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family l 


1 yf 1 /~\ O ^7 „ AA OA/C Of 1 

13.41 9.27e-09 206-251 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family l 


11 yl 1 A /I C „ AA 1 A^ ni 

13.41 9.45e-09 126-171 


1317 


TTITIAAAAA^ 

IPBOOOOOo 


Vertebrate metallothionein, family l 


11 At A ^1 „ 1A1/1 OA 

13.41 9.61e-10 34-79 


1317 


IPBOOOOOo 


Vertebrate metallothionein, family l 


ii /i 1 r\Ai— . a a t ct 1 a^t 

13.41 9.91e-09 151-196 


1317 


TnnAAAAAit 

IPBOOOOOo 


Vertebrate metallothionein, family l 


11 At AA1^. AA1A "~1 A 

13.41 9,91e-09 29-/4 


1317 


IPB000359B 


Cystine-knot domain 


19.26 1.28e-09 128-146 


1317 


TT* H A AA^ £* AT"» 

IPB000359B 


Cystine-knot domain 


lAO/TO OA„ 11 1ZT1 101 

19.26 2.80e-l 1 163-181 


1317 


T'nT^AAA'i c at> 

IPB000359B 


Cystme-knot domain 


1A 1^" 1 CA„ AA 1/11 1 1 

19.26 3.53e-09 243-261 


1317 


THTl AA A1 /7 AT"* 

IPB000359B 


Cystine-knot domain 


1 A 1 X" /I 10_ AA H /< A 

19.26 4.38e-09 22-40 


1317 


t r"fcT~\ a a a «*■* r at-* 

IPB000359B 


Cystine-knot domain 


19.26 6.06e-09 248-266 


1317 


TTkT»AAA1 C i~\T~\ 

IPB000359B 


Cystine-knot domain 


1A 1 /T Zf 1 A ~ AA AA1 111 

19.26 6.34e-09 203-221 


1317 


TTinAAAl £ ATI 

IPB000359B 


Cystine-knot domain 


1 A 1 SZ £~ /■')„ II 1 AO A 1 £ 

19.26 6.63e-12 198-216 


1317 


TT»T» AAA1 C AT> 

IPB000359B 


Cystine-knot domain 


1 A 1 /T O Of „ 1 1 /^1 OA 

19.26 7.75e-12 62-80 


1317 


TT»T"» nnAO C ATI 

IPB000359B 


Cystine-knot domain 


1A 1 /T O 00„ AA 11 AA 

19.26 8.88e-09 72-90 


1317 


TnT*AAAA/"TP 

IPB000967E 


Zinc finger NF-X1 type 


11 OO 1 OT„ AA 111 If 1 

21.88 3.73e-09 111-151 


1317 


THT* A A A A /"TP 

IPB000967E 


r-r • _ _c X TT? "V1 J 

Zinc finger NF-X1 type 


11 OO £L A/Z~ AA OA 1 1A 

21.88 6.46e-09 89-129 


1317 


IPB00H69K 


Integrin beta, C-terminus 


0*7 AC C TO„ AA 1 1 /C ICO 

27.45 5.38e-09 216-258 


1317 


IPB001271 


Mammalian defensin 5 


1A f\<~I 1 AA_ AO t- 1 OA 

19.97 1.00e-08 61-89 


1317 


IPB001271 


Mammalian defensin 


1 /A /*\^7 1 A f\ f\r\ 1 01 1 r\ 

19.97 1.49e-09 132-160 


1317 


IPB001271 


Mammalian defensin 


1 r\ t\r-t 1 OO 1 A ill OA 

19.97 1.88e-10 42-70 


1317 


IPB001271 


Mammalian defensin 


-t r\ A 1 /■* AA *t A O AA 1 

19.97 2.15e-09 193-221 


1317 


IPB001271 


Mammalian defensin 


19.97 2.31e-09 163-191 


1317 


IPB001271 


Mammalian defensin 


1 f\ f\*t 1 jl ^"J _ AA 1/11 IT/A 

19.97 2.47e-09 242-270 


1317 


IPB001271 


Mammalian defensin 


1 A A *~~t A OA AA "f *1 A /" 

19.97 2.80e-09 157-185 


1317 


TTTiT"* A A 1 A i 

IPB001271 


Mammalian defensin 


1 1 I^T 11A 1Af/* O/l 

19.97 3.29e-10 56-84 


1317 


TTir»AA 1 1 O 1 

IPB001271 


Mammalian defensin 


1 />*7 /I '"T/T _ AA 1 1 1 A A 

19.97 4.76e-09 162-190 


1317 


IPB001271 


Mammalian defensin 


*f A A^T ^ A A _ A A A f F A 

19.97 5.09e-09 26-54 


1317 


IPB001271 


Mammalian defensin 


1 A A^T C A A 1 A 1/IO 11/* 

19.97 5.09e-12 148-176 


LD 1 / 


TDDAA1 H 1 

lrBU012/l 


Mammalian defensin 


iQ a^ c oio aa A/1 


1317 


IPB001271 


Mammalian defensin 


19.97 6.25e-ll 57-85 


1317 


IPB001271 


Mammalian defensin 


19.97 6.29e-10 51-79 


1317 


IPB001271 


Mammalian defensin 


19.97 6.47e-10 52-80 


1317 


IPB001271 


Mammalian defensin 


19.97 6.65e-10 158-186 


1317 


IPB001271 


Mammalian defensin 


19.97 7.05e-09 227-255 


1317 


IPB001271 


Mammalian defensin 


19.97 7.22e-09 167-195 


1317 


IPB001271 


Mammalian defensin 


19.97 7.53e-10 17-45 
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Results* 


1 d 1 / 


TTJ"DA A1 T71 

1FB0012 /l 


Mammalian defensin 


1 r\ f\T i~t i-t ■»/\iA , *"» t >i £• 

19.97 7.71e-10 147-175 


ion 

13 1 1 


tdtd aa 1071 
1FB001271 


Mammalian defensin 


19.97 7.87e-09 202-230 


inn 
131 / 


TT1"D A A 1771 

1FB001271 


Mammalian defensin 


1 n n*r 0 o/*\ nn 1 00 011 

19.97 8.20e-09 183-211 


ion 
13 1 / 


TDDAni 771 

1FB001271 


Mammalian defensin 


19.97 9.18e-09 192-220 


ion 
131 1 


TT>T5 A A 1071 

1PB001271 


Mammalian defensin 


19.97 9.35e-09 81-109 


1317 


TTIT5 A A 1TT1 

1PB001271 


Mammalian defensin 


19.97 9.67e~09 123-151 


1317 


1PB001762A 


Disintegrin 


23.93 4.88e-09 91-131 


1 O 1 ^7 

1317 


TT1T>AA1 TZTO A 

IPB001762A 


Disintegrin 


23.93 7.86e-12 99-139 


131 / 


TDT5 A AO A C\A A 

1PB 002494 A 


Keratin, high sulfur B2 protein 


10 a A -i r\r\ 1 a t f 1 t r\ a 

12.44 1.00e-14 161-194 


ion 
131 / 


TT>TD A AO A C\ A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12 1.41e-18 201-234 


1 O 1 "7 

131 / 


Tuonno AC\ A A 

1FB002494A 


Keratin, high sulfur B2 protein 


12.44 l.ooe-17 166-199 


ion 
131 / 


TDDAAO AC\ A A 

1FB002494A 


Keratin, high sulfur B2 protein 


10 A A 1 TO — 1/"VOOO r\ s~ r 

12.44 1.72e- 10 232-265 


131/ 


1FB 002494 A 


Keratin, high sulfur B2 protein 


10 a A 1 oo_ 1 c *\ r\ s~ T^/^i 

12.44 L88e-15 196-229 


1317 


TDDAAO A C\ A A 

1FB002494A 


Keratin, high sulfur B2 protein 


10 if a 0 i/^ -1/"^*/% n 0 

12.44 2.16e-16 50-83 


ion 

1317 


TnnAno,m/i a 

1FB002494A 


Keratin, high sulfur B2 protein 


10 a a 0 0 zr rtrt 0 r> 

12.44 2.35e-09 5-38 


1317 


TT1T!> A AO /I A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 2.44e-12 80-1 13 


ion 
13 1 / 


TDDAA7/1A>1 A 

1FB 002494 A 


Keratin, high sulfur B2 protein 


10 /I/IO A A -1 A 1 A/" ion 

12.44 2.44e-14 106-139 


nn 
1317 


TOT* A AO A A A A 

1FB 002494 A 


Keratin, high sulfur B2 protein 


IO A O ✓"'-7 1^" 101 1 S~ A 

12.44 2.67e-16 131-164 


nn 

1317 


IPB002494A 


T7" J * 1*1 1 r* f>» « . • 

Keratm, high sulfur B2 protein 


12.44 2.67e-16 146-179 


nn 

1317 


IPB002494A 


1 • 1*1 1 r T"»0 j- * 

Keratin, high sulfur B2 protein 


12.44 2.84e-10 251-284 


1317 


IPB002494A 


IT J- * 1*1 1 f> T"fcO 1 • 

Keratin, high sulfur B2 protein 


12.44 3.12e-17 40-73 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 3.42e-09 97-130 


ion 
131/ 


TTJT3 A AO A A A A 

1FB 002494 A 


Keratin, high sulfur B2 protein 


12.44 3.42e-13 15-48 


nn 
13 1 / 


TTJDAA7/IA/1 A 

1FB002494A 


Keratin, high sulfur B2 protein 


12.44 3.43e-19 171-204 


ion 
131/ 


TTITD A AO A A A A 

1FB002494A 


Keratin, high sulfur B2 protein 


12.44 3.43e-19 206-239 


nn 
1317 


TTlDAAO/IA/i A 

1FB002494A 


Keratin, high sulfur B2 protein 


12.44 3.57e-16 25-58 


ion 
13 1 / 


TTinAAo.in.t a 

1FB 002494 A 


Keratin, high sulfur B2 protein 


12.44 3.81e-09 41-1 A 


nn 
13 1 / 


TT»"D AAO A A A A 

1FB002494A 


Keratin, high sulfur B2 protein 


10 vi/i 0 n^" in»-»/-»y^ ^£"n 

12.44 3.86e-19 226-259 


ion 
1317 


TDDAAI A C\A A 

1FB002494A 


Keratin, high sulfur B2 protein 


12.44 3.88e-15 176-209 


ion 

1317 


THDAAO/1A/I A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 4.00e-09 81-114 


1317 


TTJTD A AO A A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 4.23e-13 21 1-244 


nn 

1317 


TTitS AAO A A /I A 

1PB002494A 


Keratm, high sulfur B2 protein 


12.44 4.36e-14 45-78 


i o n 
13 1 / 


TTIDAAO A C\ A A 

1PB002494A 


Keratin, high sulfur B2 protein 


*i r*% A A A A A 1 n 1 n 1 a 1 i 

12.44 4.44e-17 181-214 


ion 
13 1 / 


TDD AAO /I A/1 A 

1PB 002494 A 


Keratin, high sulfur B2 protein 


10 A A A *T 1 ITIn ✓"O 

12.44 4.71e-17 30-63 


nn 
13 1 / 


TT5TDAAO/1 C\ A A 

1PB002494A 


Keratin, high sulfur B2 protein 


10 A A A f~i*1 n.f\ 110 1 A c 

12.44 4.87e-09 112-145 


nn 
13 1 / 


TTJ"D AAO A A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


1 O A A A On 10 111 1 A A 

12.44 4.89e-12 111-144 


1017 
13 1 / 


TT1DAAO/1A/1 A 

1FB002494A 


Keratin, high sulfur B2 protein 


10 a A c r\f\ ini<"\i on/i 

12.44 5.00e-19 191-224 


nn 
13 1 / 


1PB002494A 


Keratin, high sulfur B2 protein 


12.44 5.27e-13 96-129 


ion 

13 17 


TTIT"» A AO A A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 5.35e-09 177-210 


nn 
1317 


TT>T"> f\ /"VO A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.40e-10 26-59 


1017 
1317 


IPB002494A 


Keratin, high sulfur B2 protein 


•1 r\ A A c r f\ 1 £~ 1 n 1 *i A 

12.44 5.50e-15 101-134 


ion 

1317 


TT»T»AA^ j< n A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.71e-ll 151-184 


1317 


IPB002494A 


Keratm, high sulfur B2 protein 


12.44 5.73e-13 60-93 


1317 


IPB002494A 


Keratm, high sulfur B2 protein 


12.44 6. 14e- 16 35-68 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.28e-14 75-108 


1117 
ID L / 




Keratin, high sulfur B2 protein 


10 A A /T1 „ AA OOO" r\ 

12.44 6.61e-09 237-270 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66e-16 136-169 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66e-16 65-98 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.66e- 16 70-103 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.71e-09 36-69 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.73e-18 156-189 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 6.79e-ll 221-254 


1317 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7.00e-l 120-53 
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TT\ 
IU 


Database 
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Description 


Results* 


13 1 / 


TDDAAO/1Q/1 A 

l.rr>UUZ4y4A 


. — . 

Keratin, high sulfur B2 protein 


1 O A A *7 1 1 „ 11 1 n/ 1.CA 

12.44 7.1 le-1 1 126-159 


13 1 / 


lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


10 A A *7 1 r _ i /- 1 A 1 -t c A 

12.44 7.17e-16 121-154 


13 1 / 


TDDAAO/tA/t A 

lr£>UUz4y4A 


Keratin, high sulfur B2 protein 


11 A A "7 O O _ 1 <"7 o 1 Z" A C\ 

12.44 7.22e-17 216-249 


1 1 O" 

1-51/ 


TDDAAO/1A/1 A 

lrr>UU/4y4A 


Keratin, high sulfur B2 protein 


1A A A r-1 A A — 1 A 1 O O 1 A 

12.44 7.44e-12 186-219 


13 1 / 


lrr$UUz4y4A 


Keratin, high sulfur B2 protein 


-i a A c o ~ a a Tif i aa 

12.44 7.58e-09 76-109 


1 1 1 "7 
131/ 


tdd aao /i a a a 
lrrJUU24y4A 


Keratin, high sulfur B2 protein 


Irt A A r-i rjrj^ AA 1 Ol /-»-!/- 

12.44 7.77e-09 182-215 


1-117 

13 1 / 


TDDPlAO/IA/l A 

lrJt>UU24y4A 


Keratin, high sulfur B2 protein 


12.44 7.97e-09 142-175 


i m 7 
1.51/ 


lrr5UUz4y4A 


Keratin, high sulfur B2 protein 


12.44 8.00e-12 236-269 


1 1 7 
131/ 


TPDA AO AQA A 
Lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


1 O A A O r\^7„ 1 /C c c oo 

12.44 o.07e-16 55-88 


1j1 / 


TPDAAO/IQ/t A 

lrr$UUZ4y4A 


Keratin, high sulfur B2 protein 


10 A A O A ^7* 1 A O A iC T7f\ 

12.44 8.47e-10 246-279 


I J 1 / 


TPDAAO/1 G/1 A 

lrr$UUZ4y4A 


Keratin, high sulfur B2 protein 


lO A A O zrT„ 1 A £ OA 

12.44 8.67e-10 6-39 


ni7 
13 1 / 


lrr>UUz4y4A 


Keratin, high sulfur B2 protein 


12.44 8.93e-ll 141-174 


1 1 1 7 

1-51/ 


TPDAAO/1 Q/1 A 

lrr5UUZ4y4A 


Keratin, high sulfur B2 protein 


12.44 9.13e-15 231-264 


131/ 


TPDAAO/1A/1 A 

lrhJUUz4y4A 


Keratin, high sulfur B2 protein 


12.44 9.33e-12 241-274 


m7 
l j l / 


TPDAAO/1Q/1 A 

lrrJUUZ4y4A 


Keratin, high sulfur B2 protein 


10 /I /l A O 1 „ Art O A"7 O /I A 

12.44 9.81e-09 207-240 


i j i / 


TPDAAO/IG/I A 

lrr>UUZ4y4A 


Keratin, high sulfur B2 protein 


10 /l /I A A A „ AA O.C IIO 

12.44 9.90e-09 85-118 


m7 


TPDAAO/1Q/1D 

lr^r>UUZ4y4r> 


Keratin, high sulfur B2 protein 


1A CO 1 AA„ AA 1 iCA 1^7/1 

10.58 1.00e-09 160-174 


1 j 1 / 


TP"RAAOzlQ,dR. 
Lrr> KJ UZ4y4Jt> 


Keratin, high sulfur B2 protein 


1ACO 1 /ICrt lO 1 AA OA/1 

lU.Do 1.4De-lz iyu-z04 


1117 


lr^l3UUZ4y4J3 


Keratin, high sulfur B2 protein 


1A CO 1 /|f- 10 OOC OIA 

10.58 1.45e-12 225-239 


1117 
13 1 / 


TPD A AO A O /I D 

lJrJb>UUZ4y4JB 


Keratin, high sulfur B2 protein 


1 A CO 1 /TT „ AA OA 1 AO 

10.58 1.63e-09 89-103 


1117 
1 J 1 / 


TPDAAO/1 0/ID 

lJrr>UUZ4y4r> 


Keratin, high sulfur B2 protein 


lACOOOC^ 1 A IOC 1 O A 

10.5b 2.75e-10 125-139 


1117 
1 3 1 1 


TPDAAO/1 A/ID 

lrr>UUZ4y4-t> 


Keratin, high sulfur B2 protein 


1ACOO 11~ 11 C A /- Ci 

10.58 3.1 le-1 1 54-68 


1 "3 1 7 

13 1 / 


TP "DA AO A A/I'D 

lx J r>UUZ4y4r> 


Keratin, high sulfur B2 protein 


1ACOO 01 11 lOA 10/1 

10.58 3.81e-l 1 120-134 


1117 

1 J 1 / 


TDD A AO A A AID 

lrr>UUz4y4r5 


Keratin, high sulfur B2 protein 


1A CO C yt 1 1 A A /TO 

10.58 5.64e-ll 49-63 


1117 
1 J 1 / 


ll J r>Uuz4y4h> 


Keratin, high sulfur B2 protein 


1 A C O C /rn„ AA 10A 1A/1 

10.58 5.69e-09 180-194 


1117 

131/ 


TDD A AO /l A /I'D 

lrr>UUz4y4B 


Keratin, high sulfur B2 protein 


10.58 5.69e-09 215-229 


1117 
131/ 


TPDAAO/IG/I D 

lri3(JUZ4y4r> 


Keratin, high sulfur B2 protein 


1A ^o o ^ _ 1A ir\ oo 

10.58 5.84e-10 19-33 


1117 
13 1 / 


TPDAAO/1 A/ID 

lrr>UUz4y4r5 


Keratin, high sulfur B2 protein 


1 A C O AO 11 1 1 z^rt 

10.58 5.92e-ll 155-169 


1117 
131/ 


TT>DAAO/1 A/ID 

lrr5UUz4y4JD 


Keratin, high sulfur B2 protein 


10.58 6.07e-09 69-83 


1117 
131/ 


TPDAAO/1 A/ID 

l±'r>ULIZ4y4r> 


iveratm, nign sullur Bz protein 


10.58 6.58e-09 185-199 


1117 
131/ 


TPDAAO/1 A/ID 

lrr$UUz4y4r$ 


Keratin, high sulfur B2 protein 


1 A cct y* o _ rtr\ oo/*v oo /i 

10.58 6.58e-09 220-234 


1117 
131/ 


TPDAAO/IO/ID 

lrr> U UZ4y 4d 


Keratin, high sulfur B2 protein 


1 A CO zT A AA C A no 

10.58 6.96e-09 59-73 


1117 

13 1 / 


TPDAAO/1 A/ID 

lFoUUz4y4r> 


Keratin, high sulfur B2 protein 


1A CO *7 AO AA 1 CA 1 f A 

10.58 7.08e-09 150-164 


1117 

131/ 


TPDAAO/1 A/1 D 

lr'r>UUZ4y4Jb> 


Keratin, high sulfur B2 protein 


1 rV r" O T AO AA 1 Af OArt 

10.58 7.08e-09 195-209 


1117 

131/ 


TDD A AO A A A D 

lrr>UUz4y4r> 


Keratin, high sulfur B2 protein 


1 A CO "7 A*"7 AA ^n AO 

10.58 7.97e-09 79-93 


1117 

131/ 


TDD A AO /I H/ID 

lrr5UUz4y4r5 


Keratin, high sulfur B2 protein 


10.58 8.52e-10 170-184 


1117 

13 1 / 


TDD a AO /t a A D 

lrr>UUZ4y4JD 


Keratin, high sulfur B2 protein 


10.58 8.52e-10 205-219 


1117 

13 1 / 


tdd ri/n ^n/iD 
lrr>(JUz4y4B 


Keratin, high sulfur B2 protein 


10.58 9.25e-12 64-78 


1117 

13 1 / 


TDD A AO /I A /ID 

lrl3UUz4y4r> 


Keratin, high sulfur B2 protein 


10.58 9.33e-10 130-144 


1117 
13 1 / 


TDD A AO /I A A D 

lrr>UUz4y4r> 


Keratin, high sulfur B2 protein 


10.58 9.37e-09 94-108 


1117 
131/ 


TDD A AO /I A/I D 

lJrr>UUz4y4h> 


Keratin, high sulfur B2 protein 


10.58 9.62e-09 240-254 


1117 
13 1 / 


TDD A AO A A A 

!JrBUUz4y4C 


Keratin, high sulfur B2 protein 


1^ /ly"1 lO 01 1^/1 1 <"\^J 

14.46 1.12e-21 154-197 


1117 
131/ 


TP D A AO A A A 

lrDUUz4y4C 


Keratin, high sulfur B2 protein 


i a Aft i^f\ 1010 c s~ 

14.46 1.23e-18 13-56 


1117 
13 1 / 


TDD A AO ACS A 

lrr>UUz494C 


Keratin, high sulfur B2 protein 


14.46 1.35e-10 89-132 


1117 
131/ 


TDD A AO /I A /I /""* 

lrr>UUz4y4C 


Keratin, high sulfur B2 protein 


14.46 1.38e-12 22-65 


1317 




jverdiin, nign buiiui jdz proicin 


1/1/1^1 <Qo AO TO 11*; 

14.40 i.Dye-uy /z-iid 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.71e-20 224-267 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.73e-ll 27-70 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.91e-ll 37-80 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.11e-22 209-252 


1317 


IPB002494C j 


Keratin, high sulfur B2 protein 


14.46 2.13e-12 84-127 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.22e-10 123-166 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.22e-12 168-211 
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in 

INTO* 


Database 
entry lu 


Description 


Results* 


1117 

LJ L / 


LJT Lj\j\J JJ'~TZ''-r\^ 


iverdiiii, mgn suiiur rsz proiem 


1 A 1 OQo on 1 qa oil 
14.40 Z.ZyQ-Zy lqh-ZZ / 


1117 




jvcratin, nign suiiur oz protein 


14.40 Z.ole-lU 4Z-0D 


1317 




jveraun, iiign suiiur jdz proiein 


1 A O A<<=> 1 ^ OQ 1 AO 

14.40 z.4De-ij yy-i4z 


1317 


TPR0094Q4P 

J.JL iJuuZiT^y'Tvj 


jvciaLiii, iiigu ouiiur oz pioteiii 


14 AA 9 A Qc* 1 O n£ 

14. 40 z.4oe**iy JJ-/0 


1117 


TP"Rfl094Q4r 1 


iverann, nign suiiur dz proiein 


1/1 AC. O f A 0 1 A OIA OT~l 

14.40 z.DUe- 14 zJ4-z/ / 


1117 

LJ L / 


LIT LJ\J\jZ, c *y L f\^> 


jveraxin, nign. suiiur rsz protein 


1 A AC O Z^a. 1 c no 1 A 1 

14.40 z.jje-lj yo-l4i 


1117 


LTD \J UZ4y40 


rverann, nign suiiur r>z protein 


1 A AC 0 C7n on co im 

14. 4o z.j /e-zy jo-iui 


1117 

LJLf 


1 Jr jD \J UZ4y4 


jveraxin, nign suiiur dz protein 


1 A AC 1 CCi^. HO AH C\(\ 

14.40 z.oUe-uy 4/-yu 


1117 


TPR0094Q4^ , 


rwerdLiiij nign suiiur r>z protein 


1 A AfZ 1 £LA a 11 1 A1 -\ QC 

14.40 Z.o4e-l 1 143-loD 


1117 

LJ 1 / 


TPRflft94Q4P 

LIT D\J\JZt l ry-fK^ 


jverdim, nign suiiur r>z protein 


1 A AfL 1 Hflo Ol 1 TO OOO 

14.40 3.UUe-z3 1 ly-ZZZ 


1117 




iveraiin, nign suiiur dz protein 


1 a AC 1 Ana 11 m 1 n c 
14.40 ^.uye-l 1 133-1 /o 


1317 


TPR009404P 


ivciaiiii, nign suiiur jdz protein 


1/1 A(k 1 HO OO'X 0£.£. 

14.40 3.3oe-Uy ZZ3-ZOO 


1317 


TPR00^4Q4r 1 


jvcidim, nign suiiur dz protein 


1/1 A£L 'X TCo 11 OA/f OA "7 

14.40 j.joQ-Zj ZKjQ-Zht 1 


1317 


TPR009404P 


ivcidiin, nign suiiur r>z protein 


1/t AtZ 1 A A a. AO TJO OOI 

14.40 3.44e-Uy z3o-zol 


1117 


TPR0n94Q4r 1 
IJr JDUUZ4y4k^ 


Keratin, high sulfur B2 protein 


i/i ac 1 o£ n nn no oa 

14.4o 3.ooe-(Jy zld-zol 


1117 

LJ L / 


ijrr>uoz4y4v_^ 


Keratin, high sulfur B2 protein 


1/t ac 1 nn„ 11 \C\a r }">'~i 

14.46 3.90e-3z 194-237 


1117 


TPRn09404r' 


Keratin, high sulfur B2 protein 


1/1 ac i m^. in n cc 
14.40 3.y /e-lu Iz-DD 


1117 

LJ L / 


TPRn09404r < 

Lr LJ U vZ*T7 4 T^ 


Keratin, high sulfur B2 protein 


1/1 /1/C /l f\Cn 1 A 1 OO OO 1 

14.4o 4.Uoe-lU loo-z3i 


1117 


TPR0n94Q4r s 


Keratin, high sulfur B2 protein 


i a ac a iOn 11 no oo 1 

14.40 4. loe-1 1 17 o-zzl 


1117 

LJ I / 


TPRnn9zlQ4P 


Keratin, high sulfur B2 protein 


1 /t /l/C /I O /T„ 1 1 /T'-? 1 1 A 

14.46 4.3oe-ll 67-110 


1117 


TPRnn94Q4r i 


Keratin, high sulfur B2 protein 


1/1 /t/£ /i iCj~k oi 1 nn o/io 

14.40 4.3oe-zi lyy-242 


1317 


TPR0094Q4P 


jverdim, nign suiiur isz protein 


1/1 /I /l /I Oo. 1 O 1 A A 10"7 

14.40 4.4ye-lo 144-lo/ 


1317 


TPR009404P 


jtveidim, nign suiiur dz protein 


1/1 Ad A COq OO /lO OC 

14.40 4. D2e-2o 43-ou 


1117 


TPR009404P 


jverdim, nign suiiur r>z protein 


i/i ac A on OOO 00"0 1 
14.40 4. j je-2 / ZZy-ZIZ 


1117 

LJ 1 / 


TPR0094Q4r 1 


iveratm, nign suiiur £>z protein 


i a ac A c~i* in 1 co om 
14.40 4.6 /e-lU iDo-ZU 1 


1317 


TPR009404P 


xvcrdim, nign suiiur dz protein 


1/1 /t/^ /I C/1/a 10 1QQ O/l 1 

14,40 4.o4e-lz iyo-z41 


1117 

LJ X I 


TPR0094Q4P 


rvcidtm, nign suiiur dz protein 


1 /i /i ^ ^ o/i c oo i m i >t 
14.40 D.U4e-Uy 1U3-140 


1117 

LJ L 1 


TPR009404P 


jverdim, nign suiiur dz protein 


1/1 /t^ ^ in a ok 1 ^o 010 
14.40 D. lUe-zD loy-ziz 


1117 

LJ L I 


TPR009404P 


rvcrdiin, nign suiiur dz protein 


1/1 /I £ C 1 Oq Ifl Kl 1 AiT 

14.40 j. iye-iu Ij3-ly6 


1117 

LJ L 1 


^PR^094Q4^ , 


jveraun, nign suiiur dz protein 


1/1 /1/C c en a 1/t o/i in 
14.46 O.OUe-14 y4-13 / 


1117 

LJ L 1 


TPR009404r' 


iveidim, nign suiiur dz protein 


1 /i A k. ^ria iQ oo 1 o/c 
14.40 J.DUe-lo o3-lz6 


1117 
LJ L l 


TPR0094Q4P 


jverdtin, nign suiiur dz protein 


1 A AC C OC £TO 111 

14.40 D.DOe-zD Oo- 111 


1117 
LJ X / 


TPR^^9404^ , 


Keratin, high sulfur B2 protein 


1/t /i^c c nn« i/i o/i/i ioi 
14.40 D.yUe-14 244-2 o / 


1117 

LJ L / 


TPR0094Q4P 


jxeratin, nign suiiur dz protein 


1/1 AC C \ C^ 10 010 OZ. C 

14.46 o. loe-12 213-256 


1317 


TPR0094Q4P 


jverdim, nign suiiur dz protein 


1/1 /t£ < 1 /T„ 01 01/1 OCT 

14.40 0. 16e-zl Z14-Z0/ 


1117 

LJ L / 


TPRnn94Q4r* 

lJr±3UUZ4y4lw' 


Keratin, high sulfur B2 protein 


i/i ac c 1 n« oo i cn ono 
14.46 6.19e-3z 159-202 


1117 

LJ 1 / 


TPR0n94Q4r' 
IJrJjUUZ4y4l^ 


Keratin, high sulfur B2 protein 


1/1 C OC~ OO ~l 1 A 01*7 

14.46 6.z5e-z3 174-217 


1117 

LJ L f 


IJr J3 U UZ4 y 41*^ 


Keratin, high sulfur B2 protein 


1/1 /i/c ^ oc« inoon ooo 

14.46 6.35e-19 239-282 


1117 


li i3UUZ4y4L^ 


Keratin, high sulfur B2 protein 


1 A AC £1 f H„ 1 /" no 101 

14.46 6.57e-16 78-121 


1117 

LJ L I 


TPR009404P 


Keratin, high sulfur B2 protein 


1/1 AC C HAa. lO 100 1 TO 

14.40 6. /4e-lo 129-1 1 Z 


1117 

LJLf 


TPRnn9ziQAr i 

LLLD U UZ4 y *r 


Keratin, high sulfur B2 protein 


1 a ac c m ~ io /to mc 

14.46 o.yle-12 62-105 


1117 
LJl/ 


ijrjDUUZ4y4L^ 


Keratin, high sulfur B2 protein 


1 /I /I S~ C C\C^ 1^7 1 1 r\ 1 zTO 

14.46 o.9oe-17 119-162 


1117 

LJ 1 / 




Keratin, high sulfur B2 protein 


14.46 7.00e-ll 173-216 


1117 

LJ L / 


IJr O u UZ4 y 4 


Keratin, high sulfur B2 protein 


1/1 AC *7 AA- 10 100 ni 

14.46 7.00e-12 128-171 


1117 
LJL/ 


TPRnH9AQAP 
13 U UZ4y 4l~- 


Keratin, high sulfur B2 protein 


14.46 7.12e-10 208-251 


1317 


IPB002494C 


Keratin hi^h sulfur B2 rvrofein 


1/1 46 7 1 0f=»-9S 1 94-1 67 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.21e-16 3-46 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.30e- 18 73-116 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.43e-24 134-177 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.50e-23 53-96 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.55e-10 32-75 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.64e- 11 193-236 


1317 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 7.73e-10 148-191 
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1317 


TPR0094Q4P 

xx xj \J 1/Zi*t^ 1 t\^ 


xverdiin, nign suirur t>z protein 


1 A AC T ^7C _ OA 11/1 1 C~l 

14.46 7.75e-20 114-157 


1317 


1 X XJ KJ \J X, *T *T 


iveraiin, mgn suiiur r$z protein 


14.46 7.88e-23 189-232 


1 i17 


TPROIYMQAr 1 


Keratin, high sulfur B2 protein 


14.46 7.99e-20 109-152 


1317 


TPR009404P 


jverdxin, mgn suiiur t>z protein 


14.46 8.03e-12 163-206 


1317 


TPR0094Q4P 

xx X>v/VyZ. t TJ/ 4 TV^ 


jverdiin, nign suiiur uz protein 


1/1 /l /C O Af „ o 1 1/lr\ 1 An 

14.46 8.05e-31 149-192 


1317 


TPR0094Q4P 

J-X JJ VJ VX/'T J7'"T\^ 


xvciciLin, mgn suiiur r>z protein 


1 A A £Z O 1 C ~ OO 11 


1317 


TPR0094Q4P 


xvcraim, mgn suiiur rsz protein 


1 A /1/COOI^ TOO C1 

14.46 o.zze-lz 8-51 


1317 


TPFion?4Q4.r i 

XX iJvUi<4>''T\^ 


iveraxin, nign suiiur d/ protein 


1 A AC O 1C« 11 1 ✓T/i om 

14.46 8.25e-23 164-207 


1317 


TPR0094Q4P 


ivcraLiii, mgn suiiur dz protein 


14.46 o.oZe-Zl Zi-66 


1317 


TPR0024Q4C 


Kw PfOtm K»1fTM CI 1 1 4*1 It* T30 *^-t»/"vf-i»i"t-» 

rvciaiiii, iiigii suiiur \->jL protein 


1 A AC O Ola 1C 10 

14.40 o.ole-Zj lo-61 


1317 


TPR0074Q4P 

XX X-> \J WZ,*T J? 1 T\_/ 


xverdLin, mgn suiiur r>z protein 


1 /i /i c 0 r\ 1 c\c\ t~7 1 
14.46 o.91e-09 77-120 


1317 


TPR0074Q4P 


ivcrdiin, mgn suiiur dz protein 


14.46 o.yZe-l / 249-292 


1317 


TPR009404P 


xv.crdiin 5 mgn suiiur x>z protein 


1/1 /i iC c\ r\"Ti „ it 00 101 

14.46 y.Uze-1 / 00- 131 


1317 


TPR009404P 


xverdLin, mgn suiiur dz protein 


1 A AC C\ f\Ars. 1 A C7 1 A A 

14.46 y.U4e-lU D /-1 00 


1317 


TPR009404P 

XX xj\j\JZt'-rZ7"-r\^ 


xvcidim, mgn suiiur uz protein 


1 /I /I^C O Ozl« AO 1 A /Tl 

14.46 9.Z4e-09 19-62 


1317 


TPR007404P 


xverdim, mgn suiiur r>z protein 


1/1 /l iT A 11- in OAO ^% A c 

14.46 9.32e-13 203-246 


1317 


TPR0024Q4P 


i\.crd.Liii } nign suiiur i3z protein 


1 /l /I C A /I 1 _ OA /TO 1 A/T 

14.46 y.41e-Z0 63-106 


1317 


TPR0094Q4P 


xvcidiin, mgn suiiur dz protein 


1/1 AC Ci 1A nn 101 

14.46 9.43e-19 139-182 


1317 


TPR0074Q4P 


ivcidLiri, mgn suiiur r>z protein 


1/1 AC A CI „ OOOO Ol 

14,46 y.Dle-ZZ jo-ol 


1317 


TPR009404P 


iveraiin, nign suiiur r>z protein 


1/1 AC A 11„ 0 /T A 0 A1 

14.46 9.73e-26 48-91 


1317 


TPRD09404P 


jveratm, mgn suiiur ±>z protein 


1/1 /I /C A A ^ 1A 1AO 1 f 1 

14.46 9.74e-10 108-151 


1317 


TPR0094Q4P 

XX XJ vy 1(^7^ V_/ 


iveratin, mgn suiiur r>z protein 


1 A AC A Ol- 1 1 C1 /-\ /- 

14.46 9.82e-ll 52-95 


1317 


TPR0094Q4P 


Keratin, high sulfur B2 protein 


1/1 AC A OC~ OO 11A 1 

14.46 9.86e-28 219-262 


1^17 


TPR0094Q4P 


Keratin, high sulfur B2 protein 


1/1 /I /T A OA 1/T AO 1 O 

14.46 9.89e-16 93-136 


1317 


rPR0n9404P 


Keratin, high sulfur B2 protein 


1/1 a f r\ e\f\ 1 a t r\ a 1 a f—i 

14.46 9.90e-14 104-147 


1^17 

LJ I, / 


x xvUWo / Ox3 


Nematode metallothionein signature II 


7.66 9.76e-09 113-126 


1318 


XX XJ\J\J\J\J\J\J 


Vertebrate metallothionein, family 1 


11 /II 1 AA_ /\0 /I O O^T 

13.41 1.00e-08 42-87 


1318 


TPR000006 

XX XJW V/ \J\J\J \J 


v cneurdie metaiioimonem, iamny 1 


1Q /I 1 O 1 A „ AA 11C 1 1A 

13.41 2.10e-09 125-170 


1318 


XX DuvUUuU 


vericDrdie meiaiioimonem, iamny 1 


11 A I 0 »7yf _ AA 1 /I *-TA 

13.41 2.74e-09 34-79 


1318 


XX XJ\J\J\J\J\J\J 


vcneordLc meiaiiotmonem, iamny i 


11 /j 1 1 rn„ 10 ii/( 

13.41 3.59e-12 124-169 


1318 


TPROOOOOfi ! 

XX XJ\J\J\J\J\J\J 


v crLcDidie meiaiiotmonein, iamny i 


11 /ji /j 11 . 10 11A 1 /r /i 

13.41 4.21e-12 119-164 


1318 


IPB000006 

xx xj\j\j\j\j\j\j 


v crtcDidie meidiioimonem, iamny i 


11 /II /l f n„ AA 11 n 

13.41 4.58e-09 32-77 


1318 


XX JJVUUuUU 


v eneDidie meidiioimonem, iamny 1 


11 /II y< CH ^ AA A/1 1 1 A 

13.41 4.67e-09 94-139 


1318 


TPROOOOOfi 


vcrteDiaie metaiiotmonein, iamny I 


IO /11 C A /I ^ A A OO 111 

13.41 5.04e-09 88-133 


1318 


xx xj\J\J\J\j\JKj 


veneDraie metaiiotmonein, iamny 1 


11 A 1 C 11-. A A 1 A 1 1 A C 

13.41 5.13e-09 101-146 


1318 


xir xjuwvvv/O 


verieDraie metaiiotmonein, iamny I 


11 A I ^ 1 <C „ 1 A 1 AA 1 /I C 

13.41 5.16e-10 100-145 


1318 


xx xS>\J\J\j\J\J\J 


v eneprdie meidiiotnionem, iamny l 


1 O /I 1 C 1A„ 111 AA 1 C A 

13.41 5.20e-ll 109-154 


1318 


tprooooo^ 


vciLcuidLc nicidiioLiiionein, iamny i 


IO /I 1 c /in. 11 11A T n c 

13.41 5.40e-ll 120-165 


1318 


XX Jj\J\J\J\J\J\J 


Vertebrate metallothionein, family 1 


11 A I C CC ~ 1A11/1 1CA 

13.41 5.65e-10 114-159 


1318 


XJT £> \J \J \J \J \J u 


Vertebrate metallothionein, family 1 


11 yl 1 C CO ^ AA11A 1 r" r* 

13.41 5.68e-09 110-155 


1318 


TPROOOOOfi 

XX DUUUvUU 


verieDraie meiauoLnionem, iamny i 


1 1 yj 1 ^ IO- AA 1A/1 1/1A 

13.41 5.78e-09 104-149 


1318 


XX XJ\J\J\J\J\J\J 


Vertebrate metallothionein, family 1 


13.41 5.96e-09 55-100 


1318 




Vertebrate metallothionein, family 1 


11 A 1 ✓"Al 1 n AO 1>«o 

13.41 6.03e-l0 98-143 


1318 


TPRononn^ 


Vertebrate metallothionein, family 1 


13.41 6.50e-ll 35-80 


1318 


TPRnonnn^ 

lx OUUVJULfD 


Vertebrate metallothionein, family 1 


13.41 6.59e-l2 H5-160 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


1^416 07p-00 111 1 ^6 

IO. tl 0.27/c-U>' 1 1 1-1jO 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 7.61e-09 43-88 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 9.23e-10 83-128 


1318 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 9.67e-13 93-138 


1318 


IPB000254 


Cellulose-binding domain, fungal type 


18.11 6.86e-09 94-124 


1318 


IPB000359B 


Cystine-knot domain 


19.26 3,25e-12 147-165 


1318 


IPB000359B 


Cystine-knot domain 


19.26 5.29e-13 127-145 


1318 IPB000726A 


Glycoside hydrolase family 19 


14.41 9.88e-09 3-29 
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mo 
13 lo 


1PBUU0 /zod 


. _ . 

Glycoside hydrolase family 19 


539 3.40e-10 16-26 


1010 
13 lo 


1PB000726B 


Glycoside hydrolase family 19 


5.39 3.40e-10 9-19 


1 Q 1 O 

1318 


IFB00 1 Q07C 


von Willebrand factor, type C repeat 


20.89 4.27e-09 111-132 


1010 

1318 


1PB001 169K 


Integrin beta, C-terminus 


27.45 3.07e-09 118-160 


ill o 

1318 


1PB001 169K 


Integrin beta, C-terminus 


27.45 5.62e-09 25-67 


1318 


t t~» Tp. r\r\-\ i s~ fxr/ 

IPB001 169K 


Integrin beta, C-terminus 


27.45 8.91e-09 35-77 


1318 


ynriAA i <**»t i 

IPB001271 


Mammalian defensin 


19.97 5.91e-09 142-170 


1318 


IPB001271 


Mammalian defensin 


19.97 6.89e-09 107-135 


1318 


IPB001271 


Mammalian defensin 


19.97 8.41e-10 21-49 


loin 

1318 


1PB001762A 


Disintegrin 


23.93 3.03e-13 112-152 


I'll o 

1318 


1PB001762A 


Disintegrin 


23.93 4.18e-09 113-153 


1318 


TO"DAA1 0*^TO A 

1FB001762A 


Disintegrin 


23.93 5.60e-10 103-143 


1315 


TDDAA1 O^O A 

1PB001 /ozA 


Disintegrin 


23.93 5.60e-10 123-163 


13 18 


TAT~> A AO 1 O /I A 

IPB002174A 


Furin-like cysteine rich region 


30.51 1.97e-10 83-114 


1318 


1PB002174A 


Furin-like cysteine rich region 


30.51 3.92e-10 93-124 


1010 

1318 


TTJT5 A AO 1 *"l /I A 

IPB002174A 


Furin-like cysteine rich region 


30.51 8.20e-09 73-104 


1-11 o 

1318 


If>r>AA^ 1 'HA A 

IPB002174A 


Furin-like cysteine rich region 


30.51 8.88e-09 17-48 


1 Q 1 o 

1318 


IPB002174A 


Furin-like cysteine rich region 


30.51 9.27e-10 3-34 


1318 


TTinAAn /I A A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 l.lle-11 101-134 


1318 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 1.13e-17 130-163 


1318 


IPB 002494 A 


Keratm, high sulfur B2 protein 


12.44 1.31e-10 116-149 


1318 


TT1T5 A AO A A A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 1.78e-12 105-138 


1318 


IPB 002494 A 


Keratm, high sulfur B2 protein 


12.44 2.00e-12 25-58 


1 "2 1 Q 

1318 


Tiro AAO A C\ A A 

IPB 002494 A 


Keratin, high sulfur B2 protein 


12.44 2.16e-09 31-64 


1318 


TDDAA1/1A/1 A 

IPB 002494 A 


Keratin, high sulfur B2 protein 


12.44 2.63e-15 140-173 


1318 


T n n A AO A C\ A A 

IPB 002494 A 


Keratin, high sulfur B2 protein 


12.44 3.32e-18 115-148 


1318 


TTVIO A AO A C\A A 

1PB002494A 


Keratm, high sulfur B2 protein 


12.44 3.44e-16 135-168 


111 o 

1318 


T Tj t-» a a o a c\ a a 

IPB 002494 A 


Keratm, high sulfur B2 protein 


12.44 4.21e-16 120-153 [ 


1 1 1 o 

1318 


TTJT) A AO AC\ A A 

IPB 002494 A 


~rr j * 1*1 1 _gy t"* » * 

Keratin, high sulfur B2 protein 


12.44 4.48e-10 100-133 


1 **> 1 O 

1318 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 5.04e-13 91-124 


1318 


IPB002494A 


Keratm, high sulfur B2 protein 


12.44 5. 81e-10 30-63 


1110 

1318 


TTHO AAO A A A A 

IPB 002494 A 


Keratin, high sulfur B2 protein 


12.44 6.32e-09 106-139 


1110 

1318 


TTJTO A AO A A /i A 

IPB 002494 A 


XT' l ' 1*1 1f> T"»i<-» j • 

Keratm, high sulfur B2 protein 


12.44 6.67e-12 14-47 


1318 


IDDAAO Af \ A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 6.89e-12 20-53 


1110 
1318 


T ATO A AO /l A /I A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 7. 03 e- 10 96-129 


1 O 1 o 

1318 


TTJT3 AAO A C\A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 7.65e-10 121-154 


ioi o 

13 18 


T"D"DAAO/f A/1 A 

lrBU(J24y4A 


Keratin, high sulfur B2 protein 


\2A4 8.35e-09 21-54 


1010 
13 18 


TT»T> AAO A A A A 

IPB002494A 


T/ j • 1*1 1J? TOO j • 

Keratm, high sulfur B2 protein 


12.44 8.88e-15 111-144 


1-51 O 

1318 


TTVT"* A AO A A A A 

1PB002494A 


XT _ j." 1*1 1 r* t> r\ . ' 

Keratin, high sulfur B2 protein 


12.44 9.03e-09 26-59 


13 18 


TTVD AAO /l A /t A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 9.22e-12 86-119 


1 1 1 o 

1318 


T TITO AAO A C\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 9.25e-ll 24-57 


mo 
1318 


TnnAAi a r\ a a 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 9.57e-19 110-143 


1^1 o 

1318 


TTIDAAI A f\ A A 

IPB002494A 


Keratin, high sulfur B2 protein 


12.44 9.73e-18 125-158 


nio 

1318 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 1.51e-09 124-138 


1 1 1 o 

1318 


1PB002494B 


Keratm, high sulfur B2 protein 


10.58 2.27e-09 159-173 


"ma 

LJ I O 




Keratin, high sulfur B2 protein 


ia co o cc« io 

IU.Do 3.3De-lz 139-133 


1318 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 6.70e-12 129-143 


1318 


IPB002494B 


Keratin, high sulfur B2 protein 


10.58 8.45e-ll 149-163 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.27e-ll 114-157 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.27e-ll 99-142 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.41e-27 128-171 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 1.70e-10 104-147 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 2.08e-13 98-141 
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SEQ 

ID 

NO: 


Database 
entry ID 


Description 

. 


Results* 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


14.40 j.4oe-lo Hj-IjO 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A£L 1 CO„ 1 A 1 CiH 1 Cf\ 

14.46 3.Boe-10 1U7-150 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A£L A nn a 11 110 1<< 

14.46 4.09e-l 1 112-lDD 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a A£L A nc\~ nn i no , \az 
14.46 4. /Ue-Uy 1UZ-14D 


1318 


IPB002494C 


T J" j • 1*1 1 T"» O A. * ■ 

Keratin, high sulfur B2 protein 


1 a A£. a oa^ m 1 no 1 CI 
14.46 4.o4e-21 IUo-IjI 


1318 


1PB002494C 


Keratin, high sulfur B2 protein 


1 A A£. ^ 1 Ao 1Q 111 1 £A 

14.40 D.iue-iy iz^-ioo 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a /i^ccc« 11 1*7 ezr\ 
14.40 D.jDe-1 1 1 /-0U 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A A/Z C T7„ 1 H 1 n*2 1 A £Z 

14.46 D. / /e-1 / 1U3-146 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


i /i /i/;coo^. 11 1 no 1^0 
14.40 D.oze-1 1 iuy-ljz 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AC C 07^ 10 T3 ^T/C 

14.46 0.0 /e-1 3 23-06 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a a 'i 01 „ rvn n /i in 

14.46 /.ole-uy y4-l 3 / 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 yl /T T O /f ^ 10 1T> 1/TC 

14.46 7.o4e-12 lzz-165 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 a ac 0 Af\* nn a 0 m 
14.46 o.40e-0y 4o-yl 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i 0 c~i^ on 110 i/ri 

14.46 8.57e-2y Ho-161 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i 0 ^rn^. in 00 ni 

14.46 8.69e-lU 88-131 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1 A AC O CC\ n 10 1 n 1 /TA 

14.46 S.69e-12 117-160 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


i/i /i /T r\ iT- aa no 11c 
14.46 9.33e-0y 93-136 


1318 


IPB002494C 


Keratin, high sulfur B2 protein 


1/1 /l/T A 71« IO 1 T7 1 *7n 

14.46 9.71e-13 lzV-l/U 


1318 


IPB003571B 


Snake toxin 


lo.Uo 4. /ze-Uy 3Z-DD 


1318 


PR00874C 


Fungi-IV metallothionein signature III 


yi T7 r\ 1 a ~ 1 a ac inn 

4.37 9.74e-10 95-109 


1318 


PR00876B 


Nematode metallothionein signature II 


r-t 0 17-. 1 A 1 nT 1 OA 

7.66 2.27e-10 107-120 


1319 


yr\Tk A A AAA A 

IPB000822 


Zinc finger, C2H2 type 


1 A CH O AO„ 10 1 7C OAA 

14.67 3.08e-18 1 /3-zOO 


1319 


IPB000822 


Zmc finger, C2H2 type 


14.67 4.75e-19 147-172 


1319 


YT"*T"* A A A A A A 

IPB000822 


Zmc finger, C2H2 type 


1 /l £T*7 /I A/1~ 1/1 11A 1/1/1 

14.67 4.94e-14 119-144 


1319 


Tt\T^ A A A A A A 

IPB000822 


Zmc finger, C2H2 type 


1 A C~S O A1« 1 O CI OO 

14.67 7.92e-18 63-88 


1319 


THT* A A A A A A 

IPB000822 


Zmc finger, C2H2 type 


1 /I /CO O O 01 OA1 OOO 

14.6/ 8.36e-zl z03-zzb 


1319 


IPB000822 


Zmc finger, C2H2 type 


1 A C~l OO/C^OI m 1 K 

14.67 8.36e-21 91-116 


1319 


IPB000822 


Zmc finger, C2H2 type 


1 /I /CO r\ ~> 1 „ iCT A 

14.67 9.3 1 e-1 0 35-60 


1319 


TT*ri A A *t AAA 

IPB001222 


rpriTT c*t * * 1 1 1 _^ * 

TFIIS zmc ribbon domain 


O/l /CO /C OA^ AA IOC 011 

24.63 6.70e-09 175-211 


1319 


IPB001275 


DM DNA binding domain 


in 10 0. ai ^ 11 on 110 
19.1 / 3.43e-13 /9-118 


1319 


IPB001275 


DM DNA binding domain 


1 n 1 0 /i 77« 1 n 1 /Co ono 
19. 1 / 4. / /e-10 103-zOz 


1319 


Tr\r\ A A *1 A^ F 

IPB001275 


DM DNA binding domain 


in 1^7 /T r\ 1 „ 11 ci nn 

19.17 6.91e-ll 51-90 


1319 


IPB001275 


DM DNA binding domain 


1 A 1 O O 1A„ 1 A 1 AO 1 AC 

19.17 7.20e-10 107-146 


1319 


IPB001275 


DM DNA binding domain 


1 A 1 O A AA« AA OO /CO 

19.17 9.09e-09 z3-6z 


1319 


IPB001275 


DM DNA binding domain 


in ion 1 a„ i/i 10c 17/t 

19.17 9.10e-14 135-1 /4 


1319 


IPB001275 


DM DNA binding domain 


1A 10AOA« IO 1 A1 OOA 

19.17 9.29e-13 191-230 


1319 


IPB002867C 


Cysteme-nch domain (C6HC) 


1A AC O 1 A ~ AA OO A Ci 

19.46 8.34e-09 32-49 


1319 


PR00048A 


C2H2-type zinc finger signature I 


r\A/f 0 00 1AOAA 010 

9.94 2.23e-10 200-213 


1319 


T"fcY^ A A A A A A. 

PR00048A 


C2H2-type zmc finger signature I 


r\ C\A O O A ^ 11 Cf\ OO 

9.94 3.84e-ll 60-73 


1319 


T\T\ A A A j€ A K 

PR00048A 


C2H2-type zinc finger signature I 


9.94 4.38e-09 88-101 


1319 


TlT"* AAA A A A 

PR00048A 


C2H2-type zinc finger signature I 


a A c c /~\ ~ irk 00 /tc 

9.94 5.50e-10 32-45 


1319 


PR00048A 


C2H2-type zinc finger signature I 


A A/1 O 77« 1 n 1 00 1 oc 

9.94 8.77e-10 l/z-185 


1319 


PR00048A 


C2H2-type zinc finger signature I 


r\ Ci A C\ 1o„ 1 a 1 A A 1 C7 

9.94 9.18e-10 144-15/ 


1319 


PR00048B 


C2H2-type zinc finger signature II 


i"ZT01 y^/\_ Afl "lOO t A 1 

5.52 1.00e-08 132-141 


1319 


Tpfc AAA A AT'fc 

PR00048B 


C2H2-type zinc finger signature II 


5.52 2.50e-09 48-57 


1 3 iy 


T>"D AAA/1 OT2 


C2H2-type zinc finger signature II 


< ^9 ^. ^7^. i n 1 i /^q 
j.jz j.j /e-iu lou-ioy 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 4.00e-ll 104-113 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 5.50e-09 76-85 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.14e-10 188-197 


1319 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.75e-ll 216-225 


1320 


IPB002041A 


GTP-binding nuclear protein Ran family 


11.12 9.17e-10 4-48 


1320 


PR00449A 


Transforming protein P21 ras signature I 


12,48 1.29e-14 4-25 


1321 


IPB000006 


Vertebrate metallothionein, family 1 


13.41 6.00e-ll 71-116 
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Results* 


1 321 


1PB000006 


Vertebrate metallothionein, family 1 


13.41 9.72e-09 146-191 


1 1 1 1 

1321 


1PB000726B 


Glycoside hydrolase family 19 


5.39 2.54e-09 66-76 


1 O 1 1 

1321 


Tn \n a a a a £i it? 

1PB000967E 


Zinc finger NF-Xl type 


21.88 1.24e-09 95-135 


1321 


1PB001169K 


Integrin beta, C-terminus 


27.45 5.38e-09 56-98 


1321 


t"A"f»aai '-i f r\ a 

IPB001762A 


Disintegrin 


23.93 9.29e-09 69-109 


1321 


1PB002494A 


Keratin, high sulfur B2 protein 


12.44 1.56e-12 161-194 


1321 


1PB 002494 A 


Keratin, high sulfur B2 protein 


12.44 2.06e-09 146-179 


1321 


TT*T> f\ /"V» A f\ A A 

IPB002494A 


Keratm, high sulfur B2 protein 


12.44 2.94e-09 103-136 


1 O 1 1 

1321 


1PB002494A 


Keratm, high sulfur B2 protein 


12.44 4.00e-14 81-114 


1 1 

1321 


1PB 002494 A 


T7" j * 1* 1 1/» T~* "1 j. * 

Keratin, high sulfur B2 protein 


12.44 4.19e-09 72-105 


1321 


lPB0U24y4A 


Keratin, high sulfur B2 protein 


12 .44 5.26e-09 67-100 


1 10 1 

1321 


1PBUU2494A 


Keratin, high sulfur B2 protein 


12.44 5.38e-13 156-189 


1 'JO 1 

1321 


T"D"D AA1 A C\A A 

1PBU02494A 


Keratin, high sulfur B2 protein 


12.44 5.56e-14 71-104 


1321 


TT)~r) A A1 AC\ A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 6.32e-09 166-199 


1321 


1PB002494A 


Keratin, high sulfur B2 protein 


12.44 6.56e-12 123-156 


1321 


TfiT^AAl A A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 7.34e-10 124-157 


1321 


1PB 002494 A 


Keratin, high sulfur B2 protein 


12.44 7.39e-09 82-115 


1321 


TT1DAAO AC\A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 7.48e-09 151-184 


ion 

1321 


TTIDAAI A A A A 

IPB002494A 


T/" j • 1*1 IX* T"*'**, j ♦ 

Keratin, high sulfur B2 protein 


12.44 7.58e-09 57-90 


1321 


IPB002494A 


Keratm, high sulfur B2 protein 


12.44 7.60e-14 118-151 


1321 


TrinAA^ A t\ A A 

IPB002494A 


Keratm, high sulfur B2 protein 


12.44 7.97e-09 86-119 


1321 


TTIDAAI A A A A 

1PB002494A 


Keratm, high sulfur B2 protein 


12.44 8.33e-12 113-146 


ion 
1321 


TTJ'O A A1 A A A A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 8.35e-09 62-95 


1321 


TT>DAA1/IAyl A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 8.97e-16 76-109 


ion 
1321 


TnDAAOylA/l A 

1PB002494A 


Keratin, high sulfur B2 protein 


12.44 9.13e-09 147-180 


1321 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 9.14e-ll 66-99 


1321 


tnr* aai a r\ a r* 

1PB002494B 


fr j • 1*1 i r> r% r\ j • 

Keratm, high sulfur B2 protein 


10.58 1.13e-09 70-84 


1321 


IPB002494B 


Keratm, high sulfur B2 protein 


10.58 3.53e-ll 122-136 


1321 


TT»r»AA1 A f\ A T*\ 

IPB002494B 


Keratin, high sulfur B2 protein 


10.58 4.17e-09 142-156 


1321 


tnr>AA1 AC\ A T» 

1PB002494B 


-Tjr ■ • 1*1 IX" T-» i ' 

Keratm, high sulfur B2 protein 


10.58 4.17e-09 22-36 


1001 

1321 


TtiDAAO A A A T*» 

1PB002494B 


-fir . • 1*1 1 /*» "f»./"» j • 

Keratin, high sulfur B2 protein 


10.58 5.56e-09 137-151 


1321 


TTIDAAI A A /in 

1PB002494B 


j_*^1-i1 i/> -r\ <*> . • 

Keratm, high sulfur B2 protein 


10.58 6.58e-09 80-94 


ion 
1321 


TTJT5 A A1 A A A T» 

1PB002494B 


tr 1*1 i r* f\ , 

Keratin, high sulfur B2 protein 


10.58 6.78e-10 95-109 


1 n i 
1321 


TTJDAAO A A A /"* 

1PB002494C 


Keratin, high sulfur B2 protein 


-A A A a*~ "t t* tT\ *f A f\ *f s~ A 

14.46 1.50e-14 21-64 


1321 


TDDAAO AC\A i~i 

1PBU02494C 


Keratin, high sulfur B2 protein 


14.46 1.59e-09 70-113 


1 io 1 
1321 


lPr>UU24y4C 


Keratin, high sulfur B2 protein 


14.46 1.96e-10 6-49 


1 101 
132 1 


TT>TD AAi AC\Af* 

Lr 130024940 


Keratin, high sulfur B2 protein 


14.46 2.01e-09 36-79 


ion 
1321 


TTJTZ> AA1 A A A /~* 

1PB002494C 


Keratm, high sulfur B2 protein 


14.46 2.20e-21 74-117 


1321 


TT>"D A A1 A A A /— * 

1PB002494C 


Keratin, high sulfur B2 protein 


14.46 2.27e-13 164-207 


ion 
1321 


TT5T3 A A1 A A A /~1 

1PB002494C 


T/ • * 1*1 in f-» /-» . < 

Keratm, high sulfur B2 protein 


14.46 2.55e-ll 26-69 


1321 


T"0"D A A1 A A e~\ 

1PB002494C 


t/- a.' 1*1 i r~» i 

Keratm, high sulfur B2 protein 


14.46 2.70e-14 159-202 


1321 


TTJ"D AA1 A A A /~t 

1PB002494C 


tr a ' i*i 1 r* t~» f\ • • 

Keratm, high sulfur B2 protein 


14.46 2.86e-13 149-192 


1 Q 1 1 

1321 


1PB002494C 


ty j • i*i if* tta , • 

Keratm, high sulfur B2 protein 


14.46 3.10e-09 131-174 


1321 


1PB002494C 


Keratin, high sulfur B2 protein 


14.46 3.10e-09 75-118 


1321 


tnnAA'i a r\ a /-"i 

1PB002494C 


Keratin, high sulfur B2 protein 


14.46 3.30e-14 126-169 






Keratin, high sulfur B2 protein 


1 A AC A 1A n 11 OA 111 

14.4o 4.19e-12 54-12/ 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 4.36e-17 116-159 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.36e-ll 144-187 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.61e-16 79-122 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 5.79e-09 145-188 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.06e-12 111-154 1 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.53e-12 11-54 ! 


1321 


IPB002494C 


Keratin, high sulfur B2 protein 


14.46 6.91e-12 89-132 



WO 03/054152 PCT/US02/39555 



Table 3B 
413 



SEQ 
ID 
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Results* 


1 QO 1 


TDT3QOO/1 QA r~* 

!FoUUZ4y40 


: — rr-i : 

Keratin, high sulfur B2 protein 


14.46 7. lUe-lo 64-107 


1 n i 


lJrrJUUZ4y4C 


Keratin, high sulfur B2 protein 


14.46 7.31e-15 121-164 


1 1 

UZl 


lroUUZ4y4C 


Keratin, high sulfur B2 protein 


14.46 7.82e-ll 169-212 


1 JZ1 


T"DT3 A OO A O A f~* 

lroUUZ4y4C 


Keratin, high sulfur B2 protein 


t A A £Z T Of, 1 "> f A 1 AO 

14.46 7.85e-13 59-102 


ioZl 


lrJt>UUZ4y4U 


Keratin, high sulfur B2 protein 


1 A as* *~i r\r\ i/\-io/*v 

14.46 7.99e-10 139-182 


1 JZ1 


iJrJt>UUZ4y4C 


Keratin, high sulfur B2 protein 


1 A AS* 0 >~i £Z ^ 10 *\ c A 1 r^'~7 

14.46 8.76e-18 154-197 


1 SO 1 


lrl3UUZ4y4L, 


Keratin, high sulfur B2 protein 


14.46 9.07e-09 27-70 


1 lO 1 


1FBUU2494C 


Keratin, high sulfur B2 protein 


14.46 9.44e-12 31-74 


1 SO 1 


TTIDAm AC\A /~* 

li J r>UUZ494C 


Keratin, high sulfur B2 protein 


14.46 9.66e-18 69-112 


1 lO 1 

ioZi 


lrrJUUZ4y4C 


Keratin, high sulfur B2 protein 


1 A A f (~\ T~i 1 i 1/l/-\ 

14.46 9.73e-ll 106-149 


1 SO 1 


TTyr~> AAoni n A 

ir*r$uuzy iyA 


Trypsin Inhibitor-like cysteine rich domain 


15.56 6.57e-09 24-36 


1 soo 

1JZ / 


TDDAAA1 1 OD 

lrBOOOl loB 


Granulin 


7.94 l.30e-l0 29-67 


i ooo 


Tt>DAAA1 1 

IJrBUUUl IoCj 


Granulin 


12.18 8.85e-09 18-66 


1 soo 


1PB000203A 


GPS domain 


18.40 4.65e-09 44-74 


1 107 

1 jZ/ 


TDDAAAIAQ A 

IrBUUUZUiA 


uPb domain 


"t O A f~\ C CI I\f\ n f IIS* 

18.40 5.57e-09 86-116 


1 soo 

uzy 


TDDAAAAA^ 

lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 1.64e-09 58-103 


1 SOQ 


TT>T3nnnno/c 
IJrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 2.70e-ll 54-99 


1 soo 


TT>Dnonon/^ 
lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 3.39e-09 70-115 


1 QOQ 

uzy 


lrBUUUUUo 


Vertebrate metallothionein, family 1 


10 /l 1 O £l£*^ t\C\ *~I C 1 OA 

13.41 3.66e-09 75-120 


i no 


TDDAAAAA/; 

lrBUUUUUo 


Vertebrate metallothionein, family 1 


*l 0 ii 1 A cut *t r\ c 1 

13.41 4.87e-l0 51-96 


1 son 

uzy 


lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 5.59e-09 57-102 


1 ioo 


1DDAAAAA/C 

lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 6.00e-ll 50-95 


1 SOQ 

ij/y 


TDDAAAAA/J 

lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 6. 4 2 e- 09 65-110 


i q on 

i3zy 


TDDAAAAA/C 

lrBUUUUUo 


Vertebrate metallothionein, family 1 


10 /II n AO f\f\ A f\ CI c 

13.41 7.98e-09 40-85 


1 son 

uzy 


lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 8.07e-09 55-100 


1 ^ on 

lizy 


TT1DAAAAA/T 

lrBOOOOOo 


Vertebrate metallothionein, family 1 


13.41 8.48e-13 47-92 


1 SOQ 

i jzy 


lrBUUUUUo 


Vertebrate metallothionein, family 1 


13.41 8. 94e- 10 74-119 


1 no 

i3zy 


TDDAAAIC/t 

lrBUUUZ54 


Cellulose-binding domain, fungal type 


I8.l l 7.00e-09 43-73 


1 SOQ 

lozy 


lrBUUUDo4A 


Zre-Za rerreaoxin 


17.31 7.80e-09 1-19 


1 QOQ 

lozy 


TDDAAAC1 *7 A ' 

IrBUUUol/A | 


Prion protein 


8.34 1.85e-09 13-55 


1 SOQ 

lozy 


TDDAAAO 17A 

IrBUUUol /A 


Prion protein 


0 0 yl /i f-it^ r\c\ f~\ r* -1 

8.34 4.79e-09 9-51 


1 SOQ 

i3zy 


TDDAAA01 T A 

IrBUUUol /A 


Prion protein 


8.34 6.87e-09 12-54 


i 1 on 

ljzy 


TDDAAAO 1 T A 

IrBUUUol /A 


Prion protein 


Ct H A Ci S~ A t t f AO 

8.34 8.64e-ll 6-48 


1 SOQ 

ijzy 


TTJDAAAO HA 

IrBUUUo 1 /A 


Prion protein 


8.34 9.01e-10 10-52 


1 soo 

i^zy 


TDDAAAA^Tn 

lrBUUUyo /D 


Zinc linger JNr-Xl type 


10.42 7.77e-09 75-110 


1329 


IPB001169K 


Integrin beta, C-terminus 


27.45 3.19e-09 57-99 


i no 

1329 


1PB001169K 


Integrin beta, C-terminus 


27.45 8.05e-09 72-114 


1329 


IPB001169K 


Integrin beta, C-terminus 


27.45 9.15e-09 16-58 


1329 


TT\r» ftni o ^ 1 

IPB001271 


Mammalian defensin 


19.97 3.29e-10 66-94 


1329 


IPB001271 


Mammalian defensin 


19.97 4.44e-09 73-101 


1329 


1PBUU1271 


Mammalian defensin 


19.97 5.42e-09 82-110 


1329 


1PB001271 


Mammalian defensin 


19.97 7.35e-10 44-72 


1329 


irin AA 1 ^ ^ a a 

1PB0U1442A 


C-termmal tandem repeated domain m type 
4 procollagen 


26.12 2.10e-09 4-56 


1 oon 
1329 


TBDAA1 /I /I O A 

1PBUU1442A 


C-terminal tandem repeated domain in type 
*t procollagen 


26.12 2.80e-09 1-53 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 2.58e-09 51-82 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.38e-09 34-65 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.97e-ll 2-33 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 5.28e-09 6-37 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 6.35e-10 26-57 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 7.62e-ll 30-61 


1329 


IPB002174A 


Furin-like cysteine rich region 


30.51 9.27e-10 18-49 
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Results* 


i ion 

13zy 


TDonno 1 HA A 
lrrJUUZl /4A 


runn-uice cysteine ncn region 


^0 51 9 27e-10 22-53 


loon 

1329 


TT>tDnnooo 1 r> 
IroUUZZZl-b 


"\X7 A P limn fWTltair A DrAf/am 1 ~rV\l 1 T*_ 

WAr-type ^ wney aciqic rrotemj iour- 

UloUlllvie tUlC UAJIllalll 


17 19 1 OOp-OR 6^-84 


1 19Q 

13Zy 


TPT3fifi9AQ4 A 
irr>UUZ4y4A 


avntin litcrVi cnl'Piir T19 T*vrntpin 


12 44 1 39e-09 44-77 


1 lOQ 

13Zy 


TPPiflfiOAQjd A 
ir rSUUZ4y4A 


jveraiin, nign bunur jjz piuicni 


12 44 3 23e-09 34-67 


1 3Zy 


TT>nnno/iQA A 

I r L> KJ U Z4 y 4 -A. 


rvcraiin, mgn sunur piuLcni 


12 44 7 75e-10 85-118 


1329 


IPB002494A 


Keratin, high sulfur B2 protein 


12.44 8.67e-10 47-80 


13zy 


lrr>UUZ4y4A 


jveraLin, nign suuur r>z proLcin 


1 2 44 9 03e-09 40-73 


1 ion 
1329 


lrr>UUZ4y4C 


js^eraun, nign sunur i>z proLciii 


14 46 9 11p-10 60-10^ 


1 ion 
13Zy 


TPr*nno^ i p 
IrrJUUZD j Ir 


\^oronavirus oi giycoproLcm 


ii Qi 7 68e-09 79-120 


i3zy 


IrrJUUZoO /1J 


^w/ysieine-ricn cioriid.in ^oiiL'J 


24 88 6 49e-09 53-84 


uzy 


TPnnnooi Q A 


irypbin iriiiiDiLoi~iiivc cyatciiic 11^11 uumaiii 


15 56 3 57e-09 67-79 


1 190 

i3zy 


ppnne^cp> 


crustacean meianoimonern bigiidLmc 11 


5 93 1 48e-09 55-73 


i ion 

1329 


rKUUo/ZA 


uiptera ^urosopnuaj meiaiiOLmonein 
signature I 


5 17 7 41 p-09 99-1 1 1 


lozy 


T)T> ni i 1 n A 

rKUl 1 1 /A 


V^J_/V^-0 CniOriLIC UllalUlw SslgllclLUlc l 


7 79 9 47e-09 66-78 


i ion 


PP m 1 1 1 P2 

rKUl 13 IrJ 


Pr\««avi-n 1^ ^ PvQ^ PinriofllfA TT 

Lonnexiiijo ^xjoj bigndiurc 11 


3 45 5 1 2e-09 42-54 


1 ion 

lizy 


rKUlZZoC 


riggsneu proiein signaiure 1x1 


5 69 1 22e-10 17-32 


1 1 on 
1329 


rKUlZZoC 


-tiggsneii proiem signature 111 


5 69 1 ?.?p-1 0 21-36 


1 ion 
1329 


rKUlZZoc 


Eggshell protein signature III 


5 69 1 99p-1 0 25-40 


1 10Q 

lizy 


ppm 99QP 1 
rJvUlZZol^ 


xiggsneii protein Mgnaiure 111 


5 69 1 22e-10 29-44 


1 lOQ 

13Zy 


ppni ooze* 

rivUlZZoV^ 


jjrggoiieii pioiciii oigiidLuie xxx 


5 69 1 22e-10 33-48 


1 19Q 

13Zy 




T-h crrrc!Vi<a11 nTTi+f^in oicmQtlTr<=» TT '. 
JZ/ggblieil piULClIl MgllaLUlC li-X 


5 69 1 22e-10 37-52 




riwlZZov., 


Th rrrrcVi (=>1 1 •nrr»f<= k in ci oniltlirf* TT^ 
x_/ggoiicii piULOin bigiidiuie xxl 


5.69 2.34e-09 41-56 


1 19Q 

lozy 


JrivUlZZo^ 


nggbncii pioLcni oigiidLuio 111 


5 69 5 86e-09 18-33 


1 lOQ 

i3zy 


ppni 992/^* 
JrxCUlZZo^ 


117 /-r/TcVi ^1 1 *%r , /-v+/3-i -n oionQti irp Til 

riggoncii pioLCiii oigiiciiuLc 111 


5 69 5 86e-09 22-37 


1 lOQ 

i jzy 


PPH1 0950 
rKUlZZol^ 


rrrvoTi /ai 1 «rA+oin oifrna+nfP TTT 

rj>ggbncii pioLein oigiidLuic xn 


5 69 5 86e-09 26-41 


1 ion 
1 3Zy 


ppm t>cp 


riggsnen protein signdLuie 111 


5 69 5 86e-09 30-45 


lion 

i3zy 


PP fM 99 CP* 
rKUiZZoL, 


xiggsneii protein signaiure 111 


5 69 5 86e-09 34-49 


1 ion 
13Zy 


pp ai 09QP* 
rKUlZZoL, 


nggsnen protein signdture 111 


5 69 5 86e-09 38-53 


1 lOQ 

uzy 


ppni oofiP 
r IxUlZZov^ 


riggsneii proiem bigiiaiuic 111 


5 69 6 48e-09 13-28 


1 nn 

1 33U 


TPP.nnn99 1 

Ir TdUUUZZ 1 


rrotdmine r 1 


5 48 1 00e-08 76-102 


1 iin 
Lddkj 


TPRnnn.99 i 

Ir UUUUZZ 1 


rrotammc r 1 


5 48 1 00e-09 83-109 


1 iin 
133U 


TPRnnn99 1 
ir JtSUUUZZ 1 


rrotamme r 1 


5.48 1.46e-09 65-91 


i iin 
IjjU 


TPtinfin.99 1 


DrA+omiMP P1 

rroLdiiinic r i 


5.48 2.19e-ll 103-129 


1 nn 

13 3U 


TPT3n.fi.A9 9 1 
IrrJUUUZZ 1 


rrotdmme r i 


5 48 2 59e-ll 64-90 


i iin 
133U 


TPT3nnn99 i 
IrrJUUUZZ 1 


rrotdmine r i 


5 48 2 73e-10 118-144 

• 1^ iJ Zmt • / «/ V XX/ X XV/ X i l 


1 iin 
133U 


TP"Qnnn99 1 
IrriUUUZZl 


rrotamme r i 


5 48 2 97e-12 74-100 

*_/ ■ 1^ U • S /XV X Z-r / "T X x/x/ 


i iin 
133U 


TPunnnoo 1 
Ir riUUUZZ 1 


rrotamme r i 


5.48 3.31e-09 109-135 


1 iin 
133U 


tppaaaoo 1 
IrrJUUUZZ 1 


rrotamme r i 


5 48 3 31e-09 122-148 

w» «T^i/ _/ • w/ X w \J L iwiw X~xJ 


1 1 1 n 

1 33U 


rDDnnnoo i 
IrrJUUUZZ 1 


Protamine PI 


5 48 3 91e-ll 78-104 


1 1 1 n 
133U 


TT>"Dnnnoo i 
IrrJUUUZZ 1 


rrotamme r i 


5 48 4 70e-l 0 62-88 


1 1 1 n 
1330 


rmDnnnoo 1 
IrrJUUUZZ 1 


rrotamme ri 


5 48 4 70e-10 94-120 


1 1 1 n 
1330 


TTi"Dnnnoo 1 
IrrJUUUZZl 


Protamine PI 


5 48 5 15e-09 107-133 


1 iin 
I33U 


TPT5nnno9 1 
IrrJUUUZZ 1 


rroLdiiinic r i 


5 48 5 27e-09 52-78 


1330 


IPB000221 

XX X^X/ X/V/^.*-* X 


Protamine P 1 


5.48 6.16e-ll 92-118 


1330 


IPB000221 


Protamine PI 


5.48 6.19e-09 116-142 


1330 


IPB000221 


Protamine PI 


5.48 6.43e-ll 99-125 


1330 


IPB000221 


Protamine PI 


5.48 6.88e-09 111-137 


1330 


IPB000221 


Protamine PI 


5.48 6.88e-09 97-123 


1330 


IPB000221 


Protamine PI 


5.48 7.62e-ll 60-86 


1330 


IPB000221 


Protamine PI 


5.48 8.89e-10 101-127 


1330 


IPB000221 


Protamine PI 


5.48 9.30e-12 63-89 
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1330 


IPB000271 


rr 1 ; — . - — 

Ribosomal protein L34 


1J,0 / 7. /OC"U7 11 l"l*tO 


1330 


TT>T~» AAA yl AAr» 

IPB000492B 


Protamine Z (PK.MZ) 


^ Ifk ^ R4r-0Q 7^-100 


1330 


IPB000492B 


Protamine z (PKMz ; 


J.ZO J.OOC-l 1 70"lJi 


1330 


TDDAAA/imD 


Protamine z i^r Kiviz j 




1330 


1PB000492B 


Protamine z (jr kjviz j 


^ 9^ fi7f»-0Q 107-141 


1330 


TT)tlAnA/tAOt) 

1PB000492B 


Protamine z (^pkjyiz j 


^ 9^ 6 07*=-10 10^-1^7 


1 o i r\ 

1330 


lPBUU04yzB 


Protamine z ^rKiviz j 


^ 96 7 7Sp-0Q 77-1 1 1 


1330 


1PB000492B 


Protamine z (PKJVLZ j 


SOAR 1 9p-1 0 1 06- 1 40 


1330 


TTJTD f\f\f\A OTO 

lPBUUU4y/B 


Protamine z ^JrKiviz j 




1330 


TT1DAAA/1 ftlD 

lrBUUU4yzB 


Protamine z (jrxcivizj 


5 96 8 S^p-10 105-1^9 


1330 


IPB000492B 


Protamine 2 (PRM2) 


5.26 9.06e-10 78-112 


1330 


IPB000492B 


Protamine Z (PKMZ) 


5 96 0 1 109-1^6 


1330 


TlTDAAA/tAOD 

1PB000492B 


Protamine z (^pkjvizj 


S 96 Q ^5p-1 1 7Q-1 1 ^ 


1330 


IPB000492B 


Protamine 2 (PRM2) 


5.26 9.69e-10 100-134 


1330 


IPB000492B 


Protamine 2 (PRM2) 


^ 1& Q Qfip OQ 111 1 AS 

j.zo y.yue-uy 111-1^40 


1330 


IPB003134F 


Repeat in HS 1/Cortactin 


15.66 7.21e-09 183-231 


1330 


PR00055C 


HIV TAT domain signature III 


y. iz D.yze-uy io-oz 


1331 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 2.80e-10 54-91 


1332 


IPB002223 


Pancreatic trypsin inhibitor (Kunitz) family 


l /.oo z.oie-Zo yz-izo 


1332 | 


PR00759A 


Basic protease (Kunitz-type) inhibitor 
family signature I 


14.o / j. / le-IU iD-Qf 


1332 


PR00759B 


Basic protease (Kunitz-type) inhibitor 
family signature II 


n or i oo*a in mi 111 
lz.^j i.Zoe-iu lui-iii 


1332 


PR00759C 


Basic protease (Kunitz-type) inhibitor 
family signature III 


19 ATK 9 9 zip 1111 1-196 

lz.^fo z.z^+e- 1 1 i i i-izo 


1333 


TT1T» AAAOTO 

lPB00Uo22 


zinc linger, czriz type 


14 67 1 75p-1Q 901-996 


1333 


IPBUUUozz 


Zinc linger, v^ZriZ type 


14 67 1 75p-90 ^41-^66 


1333 ; 


lPBUUUoZZ 


r 7 r 4-M /-» -PlnfTor POLIO 

Ziinc linger, v^zriz type 


1 4 67 9 1 1e-1 S ^60-^94 


1333 


TDDAAAOT"> 

lrBUUUozZ 


Zvinc linger, t^zriz type 


14 67 5 50p-90 99Q-2S4 


1333 


TDDAAAOOI 

lrBUUUoZZ 


7i'ti fI«Aor /~*0 T_TO fima 

Z/inc linger, c-zriz type 


14 67 5 01p-91 285-^10 


ion 

1333 


TT1DAAAOT) 

1PB0U05ZZ 


Zinc linger, L^Zriz type 


14 67 7 00p-90 ^1 ^-^8 


1333 


TT1DAAAOT1 

IPBUOOozz 


Zinc linger, ozriz type 


14 67 0 1 Op-99 957-989 


1333 


1PB001275 


DM DNA binding domain 


1Q 17 1 9Qp_1 ^ 945-984 


1333 


lPBUUlz/D 


DM DNA binding domain 


10 17 9 ^4e-1 9 ^90-^68 


1333 


1PB001275 


DM DNA binding domain 


10 17 9 05p-1 0 9 1 7-256 


1333 


TT1DAA1 nr 

1PB001275 


DM DNA binding domain 


10 17^ 51 p-00 ^57-^06 


1333 


1PB001275 


DM DNA binding domain 


10 17 7 00p- 1 ^ 97^-^ 1 9 


1333 


1PB001275 


DM DNA binding domain 


10 17 7 1 5p-14 ^01 -^40 


1333 


TTJT> AA 1 AAA 

lrBuuiyuy 


J\JvAJL> DOX 


17 ^7 0 10e-^1 8-42 


1333 


T)T") AAA/1 O A 


i^zriz-type zinc linger signature i 


0 04 1 00e-1 1 1 98-21 1 


1333 


DT> AAA/1 O A 


VwZxiz-type zinc linger signature i 


0 04 1 64e-1^ 254-267 


1333 


T»T) AAA/1 O A 


\_/Zriz-type zinc linger signature i 


0 04 1 64e-1^ 282-295 


1333 


T»T»AAA/10 A 

PR00048A 


C2H2-type zinc finger signature I 


Q 04 6 9 1 p-1 1 996-9^0 

y.y*r O.ZrlC-1 1 £.L\J-£,Dy 


1333 


TI'nAAAytO A 

PR00048A 


C2H2-type zinc finger signature I 


0 04 6 7Qp-1 ^ ^66-^70 


1 


PT? 00048 A 


f^9 T-f9 -t vnp 7inp finp'Pr Qicrnntiirp T 


9.94 9.36e-13 310-323 


1333 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.53e-ll 338-351 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 1.00e-09 270-279 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.29e-10 326-335 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.50e-ll 242-251 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.00e-09 298-307 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.00e-12 354-363 


1333 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.50e-09 214-223 
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£VC3U113 


IjjO 


ppoorokr 

X xWJ \J O V UD 


\/inr»ii1in cionntiirp TT 


4.27 9.04e-09 333-346 




ppooro^r 


\/inr*n1in cionatnrp TT 


4.27 9.64e-09 332-345 


1 JjO 


PP01 91 7rj 
ri\U 1 Z 1 / O 


Prr\1 imp ri/^Ti p\ r tpn Qi'n QionfitiiT'P \/TT 

X I011I1C illsll CAlClluSllI OlgXlClLUl. \j V XX 


4.02 7.47e-09 399-424 


1337 


PR00806B 


Vinculin signature II 


4.27 9.04e-09 333-346 


1.3:5 / 


JrKUUoUOJo 


vincuiin signdiure 11 


4 27 9 64e-09 332-345 


1337 


PR01217G 


Proline rich extensin signature VII 


4.02 7.47e-09 399-424 


1338 


IJrBUUl jZuA 


Ly-o/u-rAK aomain 


1 3 24 8 33e-09 7-22 


1338 


t-»t> r\ 1 o 1 A 

rKU122oA 


Expansin signature I 


S 84 6 1 le-09 259-273 


1339 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 7.30e-10 182-209 


1340 


IPB000729D 


rMF-zz/JbMF/MrVU iamiiy 


iq q<7 0 1 73-200 

10.70 /.JVJC-iv/ 1/J jL,\J\J 


1341 


IPB003397B 


Mitochondrial import inner membrane 
translocase subunit Tim 17 


91 1 S Q 7Qe_l 0 60-1 OS 


1343 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 4.94e-16 187-236 


1343 


IPB001003B 


MHC Class II, alpha chain, alpha- 1 domain 


14.72 9.90e-10 195-238 


1343 


1PB003006A 


Immunoglobulin and major 
histocompatibility complex domain 


17 <i 5 ?0R-?^0 

l/.Jl O.JUC 1U iV/O ZtJU 


1344 


IPB002499N 


Major Vault Protein repeat 




1345 


PR00462B 


Fungal lignin peroxidase family signature II 


14.83 4.74e-09 34-52 


1347 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


90 9^ ^ 77p-0Q 909-9^0 


1347 


IPB003006B 


Immunoglobulin and major 
nisiocompaiiDiniy complex uuiiidxii 




1347 


PR01236A 


Tumour necrosis factor beta (lymphotoxin- 
alpha) signature I 


4.92 5.60e-09 3-19 


1348 


IrBUUoUUoB 


Immunoglobulin and major 

Vii erf r\r»/^»mnciTil*ii 1i+\/ c nwi n 1 Y 0 011*151111 
niaLOL/UIIipd.lIUlllLy L/UHipiCA. uvjliiciiii 


90 93 1 36e-10 403-440 


1350 


PR01218B 


Pistil-specific extensin-like signature II 


8.47 4.88e-09 115-138 


1353 


"015 A A A 1 A A 

rKUUUiyA 


Leucine-rich repeat signature I 


1 1 79 7 ^^e-10 193-206 

I 1. / A /.JJC XV X^J Z»V/W 


1356 


IPB002896F 


Herpesvirus glycoprotein D 


21.08 9.22e-09 327-362 


1356 


IPB003006B 


Immunoglobulin and major 
nistocompatiDiiity complex aomam 


90 9^ S 1 Sf>-0Q 976-^1 ^ 

Akj.jLD J . 1JC-U7 Zr/U-JXJ 


1357 


IPB002159A 


CD jo iamiiy 


JU.vJO J.ZUC"J7 Z.JO J X X 


1357 


tat* aao 1 cad 

1PB002159B 


CjJjo iamiiy 


9Q 9S S 1 1^4-^7Q 


1 1 CA 

1359 


IrBUluUUoA 


Immunoglobulin and major 

Vii of /~>r« /->mr"\of 1 V\l 1 if "\/" C r\V>1 nl P> Y H Am Clin 

nioivjcuiripdi i unity uumpicA. uuiiidiii 


17 51 1 00^-10 160-182 




lrBUUjUUOB 


lmmunogioouiin anu nidjui 

Viictr\r»omTmtiV»i1itv pom a! PY HAmain 

IllotUL'UllipclLlL/llll.jf V^VJlllL^lt^A. VlVJllldlll 


20 23 6 09e-ll 206-243 


1 jOj 


PR01 A'X'Jir 


Pal^cmfin ci on atilTf* ✓vT 


2.19 8.43e-09 873-895 


IjOO 


TPR000RR5 A 
1J: DUUuoo J/V 


T7iV\ri1lQr r»r\1 1 q ctfti (^-tPTTnitinl H Amain 

J/lUlllldl V^vJll<liit/ii ^> lv^i iiiii lai uwinciiii 


11.46 1.00e-15 200-237 


1300 


TPR000SR5 A 


PiT^fillot* oril 1 a crpn f^—f pt*TTl iTlfll Hornnin 
l^lDllllcll UUllct^Cll LCI liiiiia.1 vXWlllctlli 


11.46 1.20e-10 167-204 


IjOO 


tprooors^ a 


T?fl-vrillot* 1 ci (Tpn (^-l"f : »TTrii'n5il H rtmnin 


11.46 1.25e-16 191-228 


IJOO 


TPR000RR5 A 
IJrxjUUUOo j/y 


T7iV\ri11ar r»r\l 1 c> crpn l^-tprmiTial Hnmair 
riUllllclI Cvllctgcii Lwiniiiiai ui^iiicliii 


11.46 2.00e-13 203-240 


IjOO 




THiV\r*i11 a-r r»r\1 1 a apn C^—t^Trnirml Haiti e> in 
-TlUrilldX UUlictgCIl 11111 icll uVJlllctlll 


11.46 2.20e-20 179-216 


IjOO 


tproooqp.*; a 


PiKfillaT* r»rv1 1 (Tpri (~ 1 -'l"f i TTninn1 HAmsiin 
JTlDrillcll CUiid^cii v^-tci liiiiiai uvjiiictiii 


11.46 2.80e-18 176-213 


IjOO 


lJrJtSUUUoo j/\ 


T?iV\rill or rr\\ 1 Q crf^n l^-tf^rmillfll rl r»m am 

riDriiidi cuiidgcn Ldiimicti uuiiictiii 


11 46 4 29e-19 182-219 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.58e-10 209-246 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.15e-09 164-201 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.85e-16 173-210 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.20e-20 185-222 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.99&-11 206-243 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.03e-15 197-234 


1366 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.11e-13 170-207 


1366 | IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 9.37e-16 194-231 
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1366 


tod A AO Q C *I A 


T7iKrt11ar rnllflffpn C*- terminal domain 
JTlDIlllcU COllagCll LCI iiiuiai uvjuiaui 


11.46 9.71e-19 188-225 


1366 




rlDrillar COliagen v^-lCHIllIiai uuinani 


19.15 1.12e-19 174-227 


1366 


t"dt3Aaaog<"Q 


"P-iT-vnill or r»/VHarrf»i*> P-ff^tMYTinal H OTT1 PI 1T1 
JrlOrilidl COllagCll V_/~LClIlllliai UUlliani 


19.15 2.73e-20 186-239 


1 o cc 

1366 


TDDAfkAQ Q <T3 

lrr>UUUooDl5 


T7iVtT'i11ar' r*r»11cicrf»n f^-tfMTninfll dATTiain 
rlUIlllal OUliagCli v>— IClliiniai 


19.15 3.59e-17 171-224 


looo 


tdtjaaacr^r 
IJ^rsUUUooDrs 


TN'hrillar rAllacrpn C-tprminal domain 


19.15 4.58e-16 189-242 


1366 


Ir^otlUUooDiJ 


"RiKfillciT r-z-JI a crp>n (^-tprrnirial d Amain 

.FlDrlllal COllagCll ^-tClllllliai uvJiiictin 


19.15 5.11e-10 153-206 


1366 


IrrSUUUooDo 


TTiVirtllnr rAllacrpn O-terrninal domain 

J/lUllliai vvllAj^vll Lvl 1111 1 114-1 UUilluill 


19.15 5.31e-12 159-212 


1366 


lrr>UUUooDr> 


TTiKrillcir nrVHacwm f^-tprrninal domain 

j/lOrillaT CUliagCll Vy-iciliiiiiai dwiiicini 


19.15 5.77e-16 192-245 


1366 


IFJdUUUooDd 


"Cil-will o-r f*r%\1a of*n f^-t fnrmTial dAtnain 


19.15 5.98e-15 162-215 


1366 


IrhJUUUoojo 


TTiTwillo-r r>i^i11 q crr^n f^-tprrmnal dAtnain 
j7iDriii«r coiiagen ^-tcinimai vjvji iiciiii 


19.15 6.06e-14 165-218 


1366 


IrrJUUUoojo 


T?iVw tlla-r r«r\11 a cr/=»n f-tprmiTial d Amain 
flDilllclI UL/Xlct^Cll tClllllllcii V-iwiiidiii 


19.15 6.18e-20 177-230 


1366 


IFliUUUooDrJ 


T7iVkvil1oT- n/-\11c«crf=>n O-f f^rm i n a 1 dAmain 
riunilal t/Ulla-gCli v^-LC?iiiiiiiai vauiiidiii 


19.15 6.46e-14 195-248 


1366 


lrrJUUUooDrJ 


TT-Jl-wi 1 1 of r»r\11 a ov^n ^-tf^rminal dATTiain 
J/lDrillaL COlldgCll \^/-LCllllllla.l cnjiiiciiii 


19.15 6.66e-10 201-254 


1366 


IriDUUUoojo 


PlDrillar COllagen \^-LCIlIllllal uuiiiaiii 


19.15 6.87e-13 198-251 


1366 


Tt)DAAAOC\D 


T7iT-%-ril1of r>/-il la cmn r ( -tprm inn! dATTiain 
PlDrillar COllagCIl v^-LClliiinai uwiiiaiii 


19.15 7.36e-19 168-221 


1366 


TDDAAACC<T5 


T7-iVwill or r>/-»1 lo rrp»n r ( -t< 3 rtninn1 dATTiain 
PlDilllai COllagCll v^**icniiiiiai vawiiiaiii 


19.15 7.55e-13 156-209 


1366 


TDDAAAOC^D 

IFoUUUooDo 


l^'-^l^fi^1c»■r r»r\11aorp*n f^-tPTlTlinal dATTiain 

j/lOilllar COllagCll v^~iciiiniiai uv^xiiaiii 


19.15 8.78e-12 150-203 


1366 


ron aaaqq^"d 


tj'-IT-.viI 1 or* n <-\] 1 ci crr^n (~*—t prmitial dAmain 
j/lDrillal COllagCll \^»~lClllliiiai u.\ji.iiaiii 


19.15 9.16e-24 180-233 


1366 


IFrJUUUooDrs 


TH-iKrilloT* r»rt11c»o , #»fi (^-tprrnmal domain 

i71Drillal COllagCll \»^~Udiiiiiiai vav^iiiciiii 


19.15 9.60e-22 183-236 


1366 


TDT5AA 1 ATI A 

lrBUUlU/JA 


L>onipienienL v^iu t pioLcin 


22.14 1.00e-13 202-236 


1366 


TDDAA1 mi A 


^oiTipienieni 1 1| protein j 


22.14 1.46e-16 193-227 


1366 


TDDAA 1 A*71 A 


^-Ornp l em en l m piotcm 


22.14 2.76e-15 196-230 j 


1366 


TDDAA1 A7'3 A 

IroUUlU / JA 


Pnmnlpmpnt Pin nr ATPin 
l_xOITipiClIieilL 1 LJ piULClll 


22.14 3.77e-16 190-224 


1366 


TT3TDAA1 (\T1 A 

IroUUlU / 3A 


^ornpienienx ^114 piotcm 


22.14 3.90e-10 208-242 


1366 


TDDAA 1 ATI A 


^ompiemeni oiq protein 


22.14 4.69e-16 175-209 


1366 


TDDAA1 AT3 A 

lrr>UUlU /JA 


i^ompiement i^it| protein 


22.14 5.83e-15 178-212 


1366 


TDDAA 1 A^71 A 

IFrJUUlU / jA 


v^ornpieiiieiit v^itj jjiuiom 


22.14 6.18e-19 184-218 


1366 


IKtJUUlU/ jA 


Pr»t«n1 ptyi pn+ C~^1 ft TirATpin 

v^ompieiiient ^-itj jjujiciii 


22.14 7.51e-13 181-215 


1366 


IPB001073A 


Complement Clq protein 


22.14 8.02e-15 205-239 


1366 




complement d (4 pi oieni 


22.14 8.40e-14 199-233 


1366 


IPB001073A 


Complement Clq protein 


22.14 8.78e-10 172-206 


1366 


IPB001073A 


Complement Clq protein 


22 14 8 85e-13 187-221 


1366 


IPB001073B 


Complement Clq protein 


20.88 7.26e-29 281-315 


1366 


IPB001073C 


Complement Clq protein 


1^ 07 1 S7e-14 ^49-368 

1 J,U/ l.O /C ±*T JTJ' ~>\J<J 


1366 


IPB001073D 


Complement Clq protein 


7.60 8. 20 e- 13 383-392 


1366 


IPB001442A , 


C-terminal tandem repeated domain in type 
4 procollagen 


Of* 19 1 44^-1 8 

z.o.1^- l.ttC'io i \jy i 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


9£ 1? 1 S9p-?^ 181-73^ 


1366 


TT>r\ a a 1 A A r\ A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


96 19 9 OSp-14 160-912 


1366 


T--r»T% AA1 A A r\ A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


96 19 3 00e-18 166-218 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


96 19 1 90^-1 6 909-9S4 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 

4 rvrArAllacrpn 

"T |^1 UvUllUtV/ll 


10 Q 0^*=. 19 1 ^7„90Q 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.72e-13 196-248 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.93e-17 193-245 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.97e-12 199-251 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 


26.12 4.08e-21 175-227 
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4 procollagen 




1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Of 19 A A*\e* 1f 1 on 949 

zo.iz 4.43e-io lyu-ZHZ 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


ntz n /i 7/i Q in 1 qa oif 

Zo. iz 4. /4e-iy 154-zoo 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


1£. n < A1-, oo 177 99 A 

zo.iz o.uie-zz i /z-zz4 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


of n ^ OA** no 1 *\A onf 
zo.iz o.Z4e-uy id4-zuo 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


zo.iz o.4ue-zz 1 1 Q-ZD\j 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Of, 19 f *\Af* 1 1 148-900 
zo. iz o.-j^+c-i 1 iHo-zuu 


1366 \ 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Of 19 f Qfe* 17 1 9.1 00.Q 
ZO.IZ O.oDc-1 / 1 0 f-ZDy 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Of tO f Q7o 1^1 fX 91^ 

Zo. iz o.y /e-io loj-zlo 


1366 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


Of IT Q Ala io 1^1 90^ 
ZO.IZ o.4/e-lZ 131-ZUD 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


17 "XQ 1 1 1 900 99Q 
LZ.do 1.0/e-ll ZVy-ZZy 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


TO O 1At* HQ 1529 909 

iz.oo z.^4e-uy loz-zuz 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


1 O 19 O Oo no 1 sc 90s 1 
IZ.Jo Z.oZe-Uy loo-ZUo 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


10 1Q "2 no 1 70 1QQ 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


10 QQ /I flfto 1 1 9H£ 00 f 

iZ.3o 4.UUe-ll zuo-zzo 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


1 O a O /I A lex 1A 1Q1 111 

iz.3o 4.4/e-iu lyi-zii 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


11 1Q ^ OQ 901 99^ 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


T) 00 c OAr> IO 1 04 914 

iz.jo 3.z4e-iu Lyt-t-z in 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


10 10 7 mo in 9nn 99n 
iz.Jo /.jue-iu zuu-zzu 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


1 9 IS 7 ^7p 1 1 1 Q7 917 
IZ.jo /.j/c-ii iy / -zi / 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


IZ.JO /.iOc-U.? 1/0-1:70 


1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 




1366 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


19 R 66e-09 21 2-2^2 


1366 


rKUUUU /A 


l^OIIiplClllCIlt V^IV^ UOIIlctlll blgild.lU.iC 1 


20 64 6 54e-20 274-300 


1366 


PR00007B 


Complement C1Q domain signature II 


15.63 3.13e-14 301-320 


1366 


PR00007C 


Complement C1Q domain signature III 


16.13 2.62e-15 349-370 


1366 


PR00007D 


Complement C1Q domain signature IV 


9.66 1.95e-12 381-391 


1367 


IPB001427A 


Pancreatic ribonuclease family 


12.97 1.41e-10 31-40 


1367 


IPB001427B 


Pancreatic ribonuclease family 


3I.44 4.30e-29 45-95 


1367 


IPB001427C 


Pancreatic ribonuclease family 


24.33 8.07e-20 83-113 


1367 


PR00794A 


Pancreatic ribonuclease family signature I 


15.68 1.00e-18 46-65 


1367 


PR00794B 


Pancreatic ribonuclease family signature II 


17.91 7.55e-16 66-85 
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1367 


PR00794D 


pancreatic riDonuciedae lainiiy oig,iia.Lui^ ± v 


19.74 6.40e-20 94-116 


1368 


IPB000171A 


,t>acteriai-type pnyioene ueiiyuiu^Giiaow 


19.28 9.72e~ll 57-87 


1368 


IPB000927A 


D-amino acid oxidase 


8.56 7.91e-09 55-67 


1368 


IPB002135A 


j-nyoroxyacyi-L/'O.A. aenyui ugonaac 


20.39 2.50e-10 58-83 


1369 


tt*t> r\ f~\f\ 1 ^7 *l A 

IPB00017IA 


iJacteriai-iype pnyioene ueiiyuiijgciiaofc' 


19.28 9.72e-ll 57-87 


1369 


IPB000927A 


jj-amino aciu oxiaa&c 


8.56 7.91e-09 55-67 


1369 


tt\t* AAA *1 1 E A 

IPB002135A 


«5-nyuroxyacyi-v_,o/\ acnyuiugcnd&c 


20.39 2.50e-10 58-83 


1371 


PR00762A 


Chloride channel signaxure 1 


14.76 7.95e-19 108-125 


1371 


PR00762B 


Chloride cnannei signature 11 


11.84 2.97e-16 139-158 


1371 


PR00762C 


Chloride channel signature III 


9.42 3.57e-20 203-222 


1371 


PR00762D 


Chloride cnannei signaiure i v 


11.34 4.54e-19 440-460 


1371 


PR00762E 


Chloride cnannei signature v 


11.24 8.45e-16 475-491 


1371 


PR00762F 


Chloride cnannei signature vi j 


15.29 4.18e-15 493-512 


1371 


PR00762G 


Chloride cnannei signature v 11 


13.53 7.19e-12 531-545 


1371 


PR01117A 


CLC-d cnionue cndxiiiei bi^iidimc j. 


7.79 1.09e-19 2-14 


1371 


PR01117C 


CLC-6 chloride channel signature III 


9.38 1.19e-17 54-66 


1371 


PR01117D 


CLC-o chloride cnannei signature i v 


6.15 1.10e-13 159-168 


1371 


PR01 1 17E 


CLC-o cniorioe cnannei signaiiuc v 


7.29 1.10e-13 170-179 


1372 


IPB000276A 


Khodopsin-UKe Lj.rCiv superiamiiy 


11.56 1.69e-13 159-170 n 


1372 


PR00245A 


Olfactory receptor signature I 


10.98 2.50e-13 133-144 


1372 


PR00245B 


Olfactory receptor signature II 


13.73 5.70e-12 170-182 


1372 


PR00534A 


Melanocortin receptor family signature I 


12.77 8.04e-09 92-104 


1372 


PR00642C 


EDG-1 spningosme i-pnospnaxe receptor 
signature III 


R7 4 43e-09 89-101 


1374 


IPB000109A 


PTR peptide transporters (r 1 kz; 


10 85 3 79e-15 44-62 


1374 


IPB000109B 


PTR peptide transporters (PTR2) 


29.23 4.18e-23 67-119 


1374 


PR00308C 


Type I antifreeze protein signature III 




1374 


PR01471B 


Histamine H3 receptor signature II 


12.38 9.63e-09 24-42 


1375 


IPB000169A 


Eukaryotic thiol (cysteine) proteases active 
site 


10 17 1 00p~14 1^9-141 


1375 


IPB000169C 


Eukaryotic thiol (cysteine) proteases active 
site 


o.ll z.yoe-ii z/*+-z<o'+ 


1375 


IPB000169D 


Eukaryotic thiol (cysteine) proteases active 
site 




1375 


PR00704C 


Calpain cysteine protease (C2) family 
signature III 


11^^^ 01p-00 1^9-148 


1375 


PR00705A 


Papain cysteine protease (CI) family 
signature I 


1 1 99 9 R0p-91 132-147 


1375 


PR00705B 


Papain cysteine protease (CI) family 
signature II 


10 09 1 4^e-10 276-286 


1375 


PR00705C 


Papain cysteine protease (CI) family 
signature III 


1190 1 OOe-OQ 295-301 


1376 


IPB000276A 


Rhodopsm-hke GrCK superlamily 


11.56 9.05e-10 125-136 


1376 


IPB000276B 


Rhodopsin-like GPCR superfamily 


4.97 5.74e-09 201-212 


1376 


PR00237B 


Rhodopsm-like GPCR superfamily 
signature II 


19 7 S7p-0Q 66-87 


1376 


ris\J\)Zo 


T? Vi r»r?rvncin_1ilv~p rrPOT? ^nnprfamil V 
iVTLUQUUblll — llivc vjjrv^xv oU^Jt/ixciiiiii y 

signature III 


14.77 8.50e-12 111-133 


1376 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 4.21e-ll 193-216 


1376 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


14.34 4.55e-l 1 397-421 


1376 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


19.23 6.09e-09 436-462 
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1376 


PR01103B 


Adrenergic receptor signature II 


13.72 2.50e-10 139-147 


1378 


IPB000203A 


GPS domain 


18.40 9.74e-09 209-239 


1378 


IPB000203B 


GPS domain 


13.98 3.25e-15 325-346 


1378 


IPB000832C 


G-protein coupled receptors family 2 
(secretin-like) 


19.53 1.00e-10 325-354 


1378 


PR00249C 


Secretin-like GPCR superfamily signature 
III 


15.44 3.32e-09 3z 7-35U 


1378 


PR01380A 


Claudin-5 signature I 


5.16 8.79e-09 265-273 


1380 


IPB002190A 


MAGE family 


11 a a r\ ^ 1 o 1 n 

31.63 4.40e-18 1-37 


1380 


IPB002190B 


MAGE family 


27.39 1.61e-26 109-153 


1380 


IPB002190C 


MAGE family 


33.01 9.12e-31 179-233 


1380 


IPB002190D 


MAGE family 


18.35 1.00e-40 239-288 


1381 


PR00169G 


Potassium channel signature VII 


11.30 6.38e-09 265-287 


1381 


PR01320B 


Inward rectifier K+ channel superfamily 
signature II 


12.73 7.46e-ll 162-184 


1381 


PR01333A 


Two pore domain K+ channel signature I 


18.74 8.11e-28 168-196 


1381 


PR01333B 


Two pore domain K+ channel signature II 


10.39 2.58e-09 171-180 


1381 


PR01333B 


Two pore domain K+ channel signature II 


10.39 7.10e-10 280-289 


1381 


PR01463F 


EAG/ELK/ERG potassium channel family 
signature VI 


4.09 3.42e-09 268-285 


1381 


PR01499A 


Trek K+ channel signature I 


8.28 7.99e-12 21-34 


1381 


PR01499B 


Trek K+ channel signature II 


9.88 8.15e-21 oo-oj 


1381 


PR01499C 


Trek K+ channel signature III 


r\ 1 A ^7 A^7 — 1 A 1 A C 1 1 ^7 

9.14 7.07e-14 IUd- 11/ 


1381 j 


PR01499D 


Trek K+ channel signature IV 


o eft f\ r\c ^ o a 1 1 n i o ^ 

8.87 9.05e-2u 119-13/ 


1381 


PR01499E 


Trek K+ channel signature V 


1 A A"> A Tl„ 1/1 1 A /Z ~\ *Z Q 

14.43 2.73e-14 140-00 


1381 


PR01499F 


Trek K+ channel signature VI 


1 s\ c-t 1 r\r\ ~ o a 1 A/C TIC 

10.57 1.00e-24 190-/1 j 


1381 


PR01499G 


Trek K+ channel signature VII 


-i<*> oo >n OA. aa ai /; ao r 

12.38 7.80e-22 216-233 


1381 


PR01499H 


Trek K+ channel signature VIII 


9.00 6.71e-14 23 /-z4o 


1381 


PR01499I 


Trek K+ channel signature IX 


11 1 £* /l T) _ OA OAT 1 AO 

11.15 4.73e-20 2y3-3U8 


1381 


PR01499J 


Trek K+ channel signature X 


16.62 l.lle-16 315-326 


1381 


PR01499K 


Trek K+ channel signature XI 


11.81 2.96e-20 328-344 


1381 


PR01499L 


Trek K+ channel signature XII 


13.04 4.80e-17 364-378 


1382 


IPB000920C 


Myelin P0 protein 


15.78 6.50e-09 245-297 


1382 


IPB003784 


BioY 


27.20 1.22e-09 263-313 


1382 


PR00213E 


Myelin P0 protein signature V 


5.51 8.97e-09 263-287 


1382 


PR01228C 


Eggshell protein signature III 


5.69 5.14e-09 306-321 


1385 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17.17 8.89e-ll 19-72 


1388 


IPB002130A 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


13.87 1.53e-18 58-79 


1388 


IPB002130B 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


21.20 2.93e-37 88-126 


1388 


IPB002130C 


Cyclophilin-type peptidyl-prolyl cis-trans 
isomerase 


16.92 9.50e-34 138-175 


1388 


PR00153A 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature I 


14.60 1.00e-15 64-79 


1388 


PR00153B 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature II 




1388 


PR00153C 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature III 


10.79 1.00e-17 136-151 


1388 


PR00153D 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature IV 


10.73 7.75e-16 151-163 


1388 


PR00153E 


Cyclophilin peptidyl-prolyl cis-trans 
isomerase signature V 


~~8.39 5.15e-10 164-179 
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1390 


IPB000276B 


Rhodopsin-like GPCR superfamily 


A 07 6 06<=» 1 0 1 09-1 1 1 


1390 


IPB000276C 


Rhodopsin-like GPCR superfamily 


8.03 8.50e-ll 162-173 


1390 


PR00237D 


Rhodopsin-like GPCR superfamily 
signature IV 


O 06 A 1Q<» OQ HQ 

y. /o 4. ooe-uy JO" Jy 


1390 


PR00237E 


Rhodopsm-hke GPCR superfamily 
signature V 


lo.U3 (OUe-10 y4-l 1 / 


1390 


PR00237F 


Rhodopsin-like GPCR superfamily 
signature VI 


1/1 "3/1 Qlo K 1 KH 17^ 

14. .34 D.yie-1D IDZ-l/O 


1390 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


1 Q HI ^ HC\a. AO 1 QO H 1 8 
Ly.Zo 3. /Ue-Uy iyZ-Zlo 


1390 


PR00529D 


Gonadotropin releasing hormone receptor 
signature IV 


HO 6n H TCa 1^0 17 

ZZ. oU /.joe-ij z-i / 


1390 


PR00529F 


Gonadotropin releasing hormone receptor 
signature VI 


5 715 AQp- 1 0 75-80 


1390 


TJT> r» A C O r\TJT 

PR n 0529ri 


Oonaaotropnin releasing normone receptor 
signature VIII 


Q64 0 4^p-11 176-106 


1392 


IPB003662A 


General substrate transporters 


18.97 8.76e-13 196-228 


1393 


1PB0002/DA 


Rhodopsin-like GPCR superfamily 


1 1 56 1 6Qp-1^ 157-168 


1393 


PR00237C 


Rhodopsin-like GPCR superfamily 
signature III 


14.77 3.84e-10 143-165 


1393 


PR00245B 


Olfactory receptor signature II 


1 1 7^ 6 54^- 11 1 68- 1 80 


1393 


PR00245C 


Olfactory receptor signature III 


1 A 65 1 75<» 1191 5-9'*. 1 
14. OO j. /JC-11 ZlJ-Zjl 


1393 


PR00245E 


Olfactory receptor signature V 


5 Q6 7 Q1p in 199 m 

o.yo /.yic-iu jzz-jjj 


1397 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


on Ol 6 8R*» HQ 904 HI 


1397 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 8.92e- 10 487-524 


1397 


lrBUU3D2yr5 


Long hematopoietin receptor, gpl30 family 


90 65 9 11 p-OQ 646-668 


1397 


PR00014C 


Fibronectin type III repeat signature III 


14.47 8.71e-09 585-603 


1397 


PR00014D 


Fibronectin type III repeat signature IV 


15 19 9 Q6p-0Q 1 091-1 017 

LD.lZi Z.7UC-U7 1V/Z,J- lUJ / 


1397 


T1DAAA1 AT\ 

PR0U014D 


Fibronectin type III repeat signature IV 


15 19 0 99p-00 701-71 5 


1402 


TT1T1AAA1 C\ C "PI 

IPB000195B 


KauvjrAr/ 1 r>C domain 


1zl 15 5 1 5f* 00 18-50 


1402 


PR00250G 


Fungal pheromone mating factor STE2 
urLK signature v 11 


9.58 9.75 e-09 21-42 


1409 


IPB001359H 


Synapsin 


22.58 9.63e-10 53-103 


1409 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


On Ol 1 A'Xt* 1 1 007 11A 


1409 


IPB003531C 


Short hematopoietin receptor family 1 


1^ $27 Q 1S«= 1 1 5A1-560 


1413 


PR0H31B 


Connexin36 (Cx36) signature II 


3.45 2.41e-09 500-512 


1414 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


i/- ai *7 Af\ a 11 l/IO 1 <C'7 

lo.yi /.UUe-ll 14-5-10/ 


1414 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


1£ 01 O 1 Q /16 OA 

lo.yi /.DUe-lo 40- /u 


1414 


IPB001993B 


Mitochondrial energy transfer proteins 
(carrier protein) 


n nn ^ 1 c« 1/11 qo i q< 
9.UU D. lDe-14 loz-iyo 


1414 


PR00927E 


Adenine nucleotide translocator 1 signature 

V 


1 a on lid no HA 

14. yy o.jjQ-uy /4-yj 


1416 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 1.00e-09 6-43 


1416 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 7.92e-09 88-125 


1417 


IPB000315A 


B-box zinc finger superfamily 


24.51 3.25e-10 30-53 


1417 


IPB001841 


RING finger 


10.69 1.56e-10 42-51 


1418 


IPB001285A 


Synaptophysin/synaptoporin 


12.19 1.00e-28 41-87 
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1418 


IPB0012S5C 


Svnaptophysin/synaptoporin 


15.24 9.89e-23 125-165 


1418 


IPB001285D 


Synaptophysin/synaptoporin 


12. /4 4.U3e-13 loo-zuu 


1418 


IPB001285E 


Synaptophysin/synaptoporin 


9.55 2.98e-24 201-247 


1418 


PR00220A 


Synaptophysin/synaptoporin family 
signature I 


13.69 7.60e-26 38-60 


1418 


PR00220B 


Synaptophysin/synaptoporin family 
signature II 


17.03 5.35e-23 62-87 


1418 


PR00220C 


Synaptophysin/synaptoporin family 
signature III 


e~, y r*i a r 1 1 T 1/11 

11.86 2.85e-25 117-141 


1418 


PR00220D 


Synaptophysin/synaptoporin family 
signature IV 


i r\ /rn zr o a * o< i /ifl 170 
10.60 D.o4e-ZJ 14^-1 /Z 


1418 


PR00220E 


Synaptophysin/synaptoporin family 
signature V 


A Z£L A £f\o 11 1\f\ 11A 


1418 


PR00489C 


Frizzled protein signature III 


Q 1 Q 1 1^^ HQ 91 £-940 


1419 


IPB000729B 


PMP-22/EMP/MP20 family 


H KA A S£*» 1 1 AQ-^Q 


1419 


IPB000729C I 


PMP-22/EMP/MP20 family 


in qi o nip 15 80-1^9 


1419 


IPB000729C 


PMP-22/EMP/MP20 family 


in 9.1 Q 79 1 O R7-1 


1421 


IPB001905A 


Ammonium transporter family 


99 f\1 £ AHo 1A £Q_Q7 
ZZ.Uj o.t-r/G-iH oy-y / 


1421 


IPB001905B 


Ammonium transporter family 


1 Q 5Q f* OSp> 1 7 1 97-1 59 


1421 


IPB001905C 


Ammonium transporter family 


1 5 59 5 1^f> IS 178-90^ 


1421 


IPB001905D 


Ammonium transporter family 


11 SQ 1 61*=»-1Q 917-940 

1 i,07 J.OlC-17 £L f^'-rV 


1421 


IPB001905F 


Ammonium transporter family 


17Q1 1 nnp-1 s 9R7-^1 ^ 


1421 


PR00342B 


Rhesus blood group protein signature II 


\A OA 5 57r-1 ^ 61-7R 


1421 


PR00342C 


Rhesus blood group protein signature III 


1 1 59 5 OAf*-1 1 00-1 07 


1421 


PR00342D 


Rhesus blood group protein signature IV 


11100 fZQ~ 101 9QI AA 


1421 


PR00342E 


Rhesus blood group protein signature V 


1 7 OA 1 1A(*~ 10 151-1 74 


1421 


PR00342F 


Rhesus blood group protein signature VI 


0 ni 1 SSf* 10 185-900 


1421 


PR00342G 


Rhesus blood group protein signature VII 


in in O 70<=» 90 990-9^8 


1421 


PR00342H 


Rhesus blood group protein signature VEII 


8 8A 1 7^p 19 950-979 


1423 


IPB001427A 


Pancreatic ribonuclease family 


1 1 Q7 ^ no«=> 1 o ^A-,<d5 


1423 


IPB001427B 


Pancreatic ribonuclease family 


31.44 6.04e-34 50-100 


1423 


IPB001427C 


Pancreatic ribonuclease family 


ia q i i ~\n£* M if\A Mfi 
Z^.dd Z.i /e-ij luo-ioo 


1423 


PR00794A 


Pancreatic ribonuclease family signature I 


1^ A.Q 6. lla 1 1 ^1 70 


1423 


PR00794B 


Pancreatic ribonuclease family signature II 


17.91 6.34e-13 71-90 


1423 


PR00794C 


• *<« i r* »1 • a TTT 

Pancreatic ribonuclease family signature III 


i £: ot o nna. 1 /I OA 1 1 A 


1423 


PR00794D 


Pancreatic ribonuclease family signature IV 


1 c\ n a i in** no 117 1 1Q 
ly. /4 Z.o /G-Uy 1 1 /-ijy 


1426 


IPB000038A 


Cell division/GTP binding protein 


in nn c\ ofto OQ /t/17 /18Q 

16. / / y.oye-uy 44 /-400 


1426 


IPB001984C 


ATP-dependent serine proteases, Lon 
family 


12.76 3.32e-09 158-194 


1426 


PR00318A 


Alpha G protein (transducin) signature I 


s- OO C 1fln IO /KQ /I Q/1 

0.00 o.iye-iu 4oy-4o4 


1426 


PR00830A 


Endopeptidase La (Lon) serine protease 
(SI 6) signature I 


0 o 1 ,1 _ 11 1 ^rc 1 Q/1 

8.5zZ.14e-ll loj-lo4 


1426 


PR01100A 


Shikimate kinase family signature I 


0 in 1 one oc 470 A85 


1427 


IPB000483 


Leucine rich repeat C-terminal domain 


11 1 O A Q^ia 111 1 78 

11. lo O.oje-13 104-1/O 


1427 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


on 00 0 irifl in I^Q 9QA 


1 /tO'7 

142/ 


t>t?oooi q a 


T p»iirMnf=»-rir»h rf*npfit ^ipnatlire T 


11.72 3.18e-09 52-65 


1427 


PR00019A 


Leucine-rich repeat signature I 


11.72 5.64e-09 77-90 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 1.27e-09 25-38 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 4.82e-09 49-62 


1427 


PR00019B 


Leucine-rich repeat signature II 


11.42 6.32e-ll 74-87 


1430 


PR00462B 


Fungal lignin peroxidase family signature II 


14.83 4.74e-09 34-52 


1435 


IPB002610A 


Rhomboid family 


5.73 9.50e-10 185-194 


1437 


IPB001627A 


Sema domain 


16.97 9.34e-09 138-153 
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jVca tills 


1437 


IPB001627C 


_ ; " 

Sema domain 


21 13 7 lle-09 201-232 


1437 


TTITI AA1 /"ATT* 

IPB001627D 


Sema domain 


16.04 9.47e~17 249-264 


1437 


IPB001627F 


Sema domain 


22.05 8.50e-27 293-326 


1437 


IPB001627Cj 


Sema domain 


21.49 5.36e-22 346-379 


1437 


TT»T»AA1 /ATT 

IPB001627J 


Sema domain 


11.43 2.97e-14 462-478 


1437 


IPB001627K 


Sema domain 


13.76 4.00e-14 540-552 


1437 


IPB002165A 


Plexin repeat 


15.07 3.73e-12 138-153 


1437 


IPB002165B 


Plexin repeat 


13.59 3.43e-12 175-184 


1437 


IPB002165C 


Plexin repeat 


1 8 40 6 19p-34 293-325 


1437 


IPB002165D 


Plexin repeat 


14.72 2.85e-15 540-552 


1437 


IPB002861B 


Reeler domain 


1H ^0 ^ ^6p-00 890-91 8 
1U. ju j.juc oy\j 


1437 


IPB002861B 


Reeler domain 


10.50 9.76e-12 644-672 


1437 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


in ^7 7 ^f\o HQ 7flR-79^ i 


1438 


PR00170A 


Voltage-gated Na+ channel alpha subunit 
signature I 




1438 


PR01097A 


Transient receptor potential family 
signature I 


1 1 <a <: O^p-OQ 700-820 


1439 


PR00170A 


Voltage-gated Na+ channel alpha subunit 
signature I 


q ^ ^lA<*-^^ rri-qoo 

o.OD j.Ut'C-Iw OO i y\J\J 


1439 


PRO 1 097 A 


Transient receptor potential family 
signature I 




1442 


IPB000483 


Leucine rich repeat C-terminal domain 


ii iq 6 77p_l ? 1 R6-200 


1442 


PR00019A 


Leucine-rich repeat signature I 


11.72 1.33e-10 106-119 


1443 


IPB000135A 


High mobility group proteins HMGl and 
HMG2 


1 1 9 60p ^1 ^4-RR. 


1443 


IPB000135B 


High mobility group proteins HMGl and 
HMG2 


1 0 9 /I q 7C e _94 RR-1'39 


1443 


IPB000203A 


GPS domain 


1 0 AO 8 4^p-00 1 26-1 56 


1443 


IPB000203B 


GPS domain 


13.98 4.95e-12 286-307 


1443 


IPB000832C 


G-protein coupled receptors family 2 
(secretin-like) 




1443 


PR00249C 


Secretin-like GPCR superfamily signature 
III 




1443 


PR00249D 


Secretin-like GPCR superfamily signature 
IV 


u lf| Q AAp.C\Q ^97-^S9 


1443 


PR00886A 


High mobility group (HMG1/HMG2) 
protein signature I 


1 n ^7 R 90^-1 7 ^7-^9 


1443 


PR00886B 


High mobility group (HMG1/HMG2) 
protein signature II 


10.07 8.97e-13 60-80 


1443 


PR00886C 


High mobility group (HMG1/HMG2) 
protein signature III 


11.20 1.40e-ll 104-122 


1448 


IPB002000D 


Lysosome-associated membrane 
glycoprotein (Lamp) 


5.87 5.75e-09 419-432 


1448 


PR00308C 


Type I antifreeze protein signature III 


2.79 9.64e-09 758-767 


1449 


PR00180B 


Cellular retinaldehyde-binding protein 

ci cm *>\ lit*** TT 


if ^70 0 no 9^^ 9^n 


1452 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


16.91 8.65e-10 33-57 


1452 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


16.91 9.39e-09 130-154 


1452 


IPB001993B 


Mitochondrial energy transfer proteins 
(carrier protein) 


9.00 4.21e-09 78-91 


1452 


IPB001993B 


Mitochondrial energy transfer proteins 


9.00 7.14e-ll 173-186 
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(carrier protein) 




1452 


PR00926D 


Mitochondrial carrier protein signature IV 


10.86 7.92e-17 41-59 


1452 


PR00926E 


Mitochondrial carrier protein signature V 


11.83 5.74e-19 89-107 


1452 


PR00926F _J 


Mitochondrial carrier protein signature VI 


17.31 1.00e-09 35-57 


1452 


PR00926F 


Mitochondrial carrier protein signature VI 


17.31 5.76e-24 132-154 


1452 


PR00927A 


Adenine nucleotide translocator 1 signature 
I 


7.76 8.53e-09 28-40 


1452 


PR00927B 


Adenine nucleotide translocator 1 signature 
II 


15.15 1.44e-12 164-185 


1452 


PR00927D 


Adenine nucleotide translocator 1 signature 
IV 


10.82 3.79e-13 26-39 


1452 


PR00927E 


Adenine nucleotide translocator 1 signature 
V 


14.99 6.32e-ll 162-183 


1452 


PR00927E 


Adenine nucleotide translocator 1 signature 
V 


14.99 9.25e-22 67-88 


1452 


PR00927F 


Adenine nucleotide translocator 1 signature 
VI 


12.47 5.50e-14 128-144 


1452 


PR00927G 


Adenine nucleotide translocator 1 signature 
VII 


10.67 5.14e-15 178-193 


1454 


IPB001993A 


Mitochondrial energy transfer proteins 
(carrier protein) 


16.91 8.75e-ll 83-107 


1454 


IPB001993B 


Mitochondrial energy transfer proteins 
(carrier protein) 


9.00 1.72e- 10 242-255 


1455 


PR00320A 


G protein beta WD-40 repeat signature I 


13.15 1.96e-09 128-142 


1455 


PR00320C 


G protein beta WD-40 repeat signature III 


12.32 3.67e-09 128-142 


1456 


IPB000494A 


Epidermal growth-factor receptor (EGFR), 
L domain 


22.80 6.59e-25 122-154 


1456 


IPB000494B 


Epidermal growth- factor receptor (EGFR), 
L domain 


24.47 6.00e-18 228-248 


1456 


IPB000494C 


Epidermal growth-factor receptor (EGFR), 
L domain 


24.40 9.45e-33 1099-1145 


1456 


IPB000494D 


Epidermal growth-factor receptor (EGFR), 
L domain 


19.44 1.00e-40 1162-1211 


1456 


IPB000494E 


Epidermal growth-factor receptor (EGFR), 
L domain 


21.65 3.53e-27 1218-1253 


1456 


IPB001060C 


Cell division control protein 15 (CDC 15) 


26.16 1.00e-09 lloi-lzUz 


1456 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 4.27e-19 110j-ii4j 


1456 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 6.54e-29 1165-1203 


1456 


IPB001772C 


Kinase associated domain 1 


20.66 1.75e-10 1100-1130 


1456 


IPB002174A 


Furin-like cysteine rich region 


30.51 4.41e-13 126-157 


1456 


IPB002174B 


Furin-like cysteine rich region 


11.91 7.00e-10 1183-1193 


1456 


IPB003961B 


Fibronectin type III domain 


12.29 7.43e-ll 1184-1194 


1456 


PR00014D 


Fibronectin type III repeat signature IV 


15.12 6.09e-09 893-907 


1456 


PR00109A 


Tyrosine kinase catalytic domain signature I 


12.56 5.34e-ll 1066-1079 


1456 


PR00109B 


Tyrosine kinase catalytic domain signature 
II 


11.07 1.47e-20 1112-1130 


1456 


PR00109C 


Tyrosine kinase catalytic domain signature 
III 


1 1 Q/C Q 1^11 1 1 1A 

11. oo y.zr>e-ij iiOH-ii/ i + 


1456 


PR00109D 


Tyrosine kinase catalytic domain signature 
IV 


17.61 4.50e-24 1183-1205 


1456 


PR00109E 


Tyrosine kinase catalytic domain signature 
V 


12.99 4.48e-16 1227-1249 


1457 


IPB002889B 


WSC domain 


11.76 8.41e-09 1558-1604 


1462 


IPB000130 


Neutral zinc metallopeptidases, zinc- 


5.86 4.00e-12 238-248 
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— : — — : ■ 1 ~ 

binding region 




1462 


IPB001818A 


Matrixin 


14.60 2.13e-29 91-120 


1462 


IPBOOloloB 


Matrixin 


26.48 6.04e-31 137-178 


1462 


IPBOOloloC 


Matrixm 


24.38 4.09e-32 20-65 


1462 


IPBOOloloC 


Matrixin 


24.38 7.43e-35 182-227 


1462 


IPB001818D 


Matrixin 


14.91 3.08e-37 232-263 


1462 


IPB001818E 


Matrixin 


8.86 7.19e-15 273-286 


1462 


IPB001818F 


Matrixin 


11.19 6.36e-13 309-329 


1462 


PR00138A 


Matrixin signature I 


12.54 1.64e-16 111-124 


1462 


PR00138B 


Matrixin signature II 


14.84 5.21e-10 156-171 


1462 


PR00138C 


Matrixin signature III 


20.07 1.78e-16 180-208 


1462 


PR00138C 


Matrixin signature III 


20.07 8.38e-15 18-46 


1462 


PR00138D 


Matrixin signature IV 


14.57 5.50e-31 238-263 


1462 


ER00138E 


Matrixin signature V 


7 10 8 71e-15 273-286 


1462 


PR00480B 


Astacin family signature II 


14.35 1.16e-ll 233-251 


1462 


PR00997G 


Fragilysin metallopeptidase (M10C) 
enter otoxin signature VII 


1 1 64 8 01e-10 244-260 


1466 


IPB000008D 


C2 domain 


14.83 5.92e-09 288-306 


1466 


PR01471E 


Histamine H3 receptor signature V 


5.41 2.43e-10 610-625 


1468 


IPB000228D 


RNA 3 '-terminal phosphate cyclase 


^i«4 89e-19 30-67 


1468 


IPB000228E 


RNA 3 -terminal phosphate cyclase 


96 21 8 43e-09 105-146 


1468 


IPB002029A 


Aspartate and ornithine 
carbamoyltransferase family 


20.73 7.86e-09 78-105 


1470 


IPB003874C 


CDC45-like protein 


5.49 9.16e-10 315-326 


1473 


IPB002568A 


Carlavirus putative nucleic acid binding 
protein 




1473 


IPB002999B 


Tudor domain 




1473 


IPB003482B 


Transcription factor WhiB 


1 fl 0Q S 71p-0Q 10-22 


1474 


PR01547I 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature IX 


k Q7 7 Q4p-?8 69-99 


1474 


PR01547K 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XI 


"R 61 7 00e-15 95-114 1 

X^J-WX / ♦ vv/v X*/ *S +S A. A. i 


1474 


PR01547L 


Saccharomyces cerevisiae 175.8kDa 
hypothetical protein signature XII 


10.35 6.29e-19 119-135 


1477 


IPB000822 


Zinc linger, CzhLZ type 


14.67 2.88e-09 421-446 


1477 


PR01407E 


rJutvropnynn ^-terminal uur Mgndiuio v 


13.29 1.63e-09 56-80 


1478 


IPB001925C 


Eukaryotic porin 


23.55 1.84e-28 8-61 


1478 


PR00185D 


JiuKaryotic ponn signature i v 


14.33 9.50e-12 111-128 


1482 


PR00449b 


i ransrormmg proicin rz-i iaa oiguatuic v 


13.39 1.29e-ll 464-486 


1489 


IPB003859B 


jvietazoa galactosyl Lrdiis>ieidbc 


27.85 8.27e-18 205-257 


1489 


IPB003859C 


jVLetazoa gaiaciosyiirdnbicid&c 


19.53 3.74e-36 301-335 


1489 


IPB003859D 


Metazoa galactosyltransferase 


26.76 6.25e-27 349-382 


1491 


IPB003191M 


Guanylate-binding protein 


10.38 7.86e-28 28-58 


1491 


IPB003191N 


Guanylate-binding protein 


9.33 6.16e-21 61-91 


1491 


IPB003191O 


Guanylate-binding protein 


14.31 7.22e-28 92-132 


1491 


IPB003191O 


Guanylate-binding protein 


14.31 8.23e-09 81-121 




PR00502B 


NUDIX hydrolase family signature II 


14.82 4.15e-09 348-363 


1494 


IPB000117D 


Kappa casein 


10.18 8.71e-09 117-151 


1497 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 3.00e-15 88-125 


1498 


IPB001395E 


Aldo/keto reductase family 


20.87 7.39e-23 256-293 


1498 


IPB001395F 


Aldo/keto reductase family 


25.03 2.45e-23 317-362 


1498 


PR00069D 


Aldo-keto reductase signature IV 


19.00 3.92e-20 260-289 


1499 


IPB000237B 


GRIP domain 


30.66 8.24e-09 543-593 
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jvesuiis 


1499 


IPB000237B 


GRIP domain 


10 66 8 34e-09 547-597 


1499 


IPB000996B 


Clathrin light chain 


90 75 7 55e~09 541-593 


1499 


IPB001800D 


Lipoprotein, type 6 


99 7^ q 1 1 p-09 558-604 


1499 


IPB001800D 


Lipoprotein, type 6 


99 7-5 o 70e-09 563-609 


1499 


IPB002101B 


MARCKS family 


1A 58 2 ?8e-09 545-596 


1499 


IPB002906A 


Ribosomal protein S27a 


11.48 3.00e-09 561-593 


1499 


IPB002906A 


Ribosomal protein S27a 




1499 


IPB003121A 


BAF60b domain of the SW1B complex 


1^ 8Q 9 46e-09 552-577 


1499 


IPB003134F 


Repeat in HS l/Cortactin 


1 5 66 8 49e-09 552-600 


1503 


PR00049D 


Wilm's tumour protein signature IV 


0 00 0 57p-1 0 65-79 


1503 


PR01217B 


Proline rich extensin signature II 


4 C7 c 55e-09 71-87 


1504 


IPB000436B 


Sushi domain / SCR repeat / CCP module 


R 70 8 71 p-1 0 307-318 


1504 


PR01544D 


Arabidopsis thahana l30.7kDa hypothetical 
protein signature IV 


lA 5A 0 1^p-10 373-394 


1511 


IPB000822 


Zinc linger, C2H2 type 


14 67 3 77e-18 343-368 


1511 


IPB000822 


Zinc finger, C2H2 type 


14 67 9 ?5e-19 371-396 


1511 


IPB001275 


DM DNA binding domain 


1Q 17 3 51e-09 331-370 


1511 


IPB001275 


DM DNA binding domain 


1 Q 1 7 5 67e-09 359-398 


1511 


IPB001909 


KRAB box 


17 17 4. 00e-1 9 230-264 


1511 


PR00048A 


C2H2-type zinc finger signature I 


0 QA 5 76e-12 368-381 

^7,^7*T J, / Uv li< jvy»j «j a 


1511 


PR00048A 


C2H2-type zinc finger signature I 


0 OA 7 75e-1 6 340-353 

Zs m Z/ i / • / Jv/ i vy *y i vy — y »y -y 


1511 


PR00048B 


C2H2-type zinc finger signature II 


5 5? 7 41e-10 384-393 

m *J l ."J6 i- vy JU i _y^-y 


1517 


IPB000822 


Zinc finger, C2H2 type 


1A 67 1 00e-17 1066-1091 


1517 


IPB000822 


Zinc finger, C2H2 type 


1A 67 1 00e-19 281-306 


1517 


IPB000822 


Zinc finger, C2H2 type 


1A 67 1 60p-16 393-418 

X— r.U/ i..vyVy\-/ ivy s -j ~ a 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 1 60e-16 954-979 


1517 


IPB000822 


Zinc fmger, C2H2 type 


14 67 1 69e-18 477-502 


1517 


IPB000822 


Zinc finger, C2H2 type 


1A 67 ? 11e-14 1038-1063 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 4 79e-12 1094-1119 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 5 50e-1 9 421-446 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 5 5 Oe- 20 870-895 

X*T".U / J.Juw vy u / vy **y 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 6 73e-21 926-951 


1517 


IPB000822 


Zmc finger, C2H2 type 


14.67 7.00e-20 309-334 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 8 36e-21 337-362 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 8 36e-21 982-1007 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 8 50e-20 449-474 


1517 


IPB000822 


Zmc finger, C2H2 type 


14 67 9 25e-20 898-923 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 0 11 p-1 8 1010-1035 

Xt.vy/ y.Jlv lu i vy x vy j.vy-y«y 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 0 40p-16 365-390 


1517 


IPB000822 


Zinc finger, C2H2 type 


14 67 0 44e-15 505-530 


1517 


IPB001275 


DM DNA binding domain 


10 17 1 1?p-10 970-1009 

J.7, 1 / X . IZiv iw y / u x vy vy_y 


1517 


IPB001275 


DM DNA binding domain 


1 Q 1 7 1 55p-1 0 297-336 

1 jy , x / j.JJv —xv/ ' — ' -y vy 


1517 


IPB001275 


DM DNA binding domain 


10 17 1 60p-19 417-476 


1517 


IPB001275 


DM DNA binding domain 


1 Q 1 7 A 1 Op-0Q 886-925 


1517 


IPB001275 


DM DNA binding domain 


i q 1 7 a 1 0p-09 942-98 1 


1 < 1 *7 
ID 1 / 


TPT*nni 97<% 


DM DNA bindine domain 


19.17 6.71e-13 325-364 


1517 


IPB001275 


DM DNA binding domain 


19.17 6.72e-10 1082-1121 


1517 


IPB001275 


DM DNA binding domain 


19.17 6.72e-10 914-953 


1517 


IPB001275 


DM DNA binding domain 


19.17 8.06e-09 858-897 


1517 


IPB001275 


DM DNA binding domain 


19.17 8.18e-10 998-1037 


1517 


IPB001275 


DM DNA binding domain 


19.17 9.10e-ll 1026-1065 


1517 


IPB001275 


DM DNA binding domain 


19.17 9.88e-10 465-504 


1517 


IPB001909 


KRAB box 


17.37 8.03e-18 675-709 
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JvcMllli) 


ID J. / 




1SJev/\x3 DOX 


17 37 9 00e-32 76-110 


KIT 
1 D I 1 


rKUUU4oA 


(~^ r ) tJO lima r7-If-ir« fi n fT/^T CI Cm £1 til VF" T 

\^_,zriz - iypc zinc linger bigiiaLuic x 


9 94 1 69e-14 306-319 


1 < 1 7 
i D 1 / 


JrKUUU^foA 


f^") T-TO _t\ n^^» "vitin flnCTpr ci on atlTTF* T 
V_,Zil.Z-iy pc ZlllC llllgCi olgllaLUit/ J. 


9 94 1 90e-15 446-459 


ID 1 / 


JrKUUU'foA 


iw/Zxiz-iypc zinc imgci bigiidiuic x 


9 94 2 42e-ll 474-487 


K17 
1 J 1 / 


T>pAAA/lQ A 

rJxUUU4oA 


POTJO f\rr\<3 rzinf* "fi n crr^r" cionntnrp T 
^ZxxZ - iypc Z111C llllgCl olgllaLUlG J. 


9 94 2 93e-13 502-515 


K 17 

ID J. / | 


x^KUUU4oA 


i^Zxiz-iype zinc iingei oigiicuuic x 


9 94 3 08e-14 1035-1048 


ID 1 / 


x l\.UUU4o.f\ 


C^ r ) U7 > \rr\p» Titif "fin cr/^r cionntnrp T 
v^zxxz-typc zinc iiiigci oignaLuit? x 


9.94 4.38e-09 1007-1020 


ID 1 / 


rKUUU45A 


V^ZllZ-iy pC Z111C lUlgCl MgllcUUlt/ X 


9 94 5 50e-13 334-347 


K 17 

ID 1 / 


r\KUUU4oA 


f* , 7T49 -f\ rf-^^» 71'np "fin rrP'T' ci on Q+iiTf* T 

v^zinz-iypc zinc- ungci oigiiaiuio x 


9 94 5 50e-13 895-908 


K17 

ID 1 / 


~DT> f\f\f\AQ A 


r^7TJ7 +xrr»f» f7inr» ■Fincri^T' ci on Q^l lTf 3 T 
V^-Z inZ - Lypc Z1I1C llllgCI MgilaLUlC X 


9 94 6 14e-13 923-936 


K17 

ID 1 / 


JrJvUUUH-oA 


l" , 9'PT9 hmp 7in/» fincrpr ci on nil iff* T 
V^ZxlZ-LypC Z1I1C XlllgCi oigllaLUlC X 


9.94 7.00e-09 867-880 


K17 

ID 1 / 


JrJvUUU4oA 


09 U9 -Hrr*^ 7inp ■fincrpr ci on afl 1TP T 
V^ZixZ-Lypc ZlilC llllgOl olgllalUlO X 


9.94 7.35e-12 362-375 


K17 
ID 1 / 


JrK.UUU^foA 


P*9T-T9_-K/r\<= -^inr* -fincrpr cionatiirp T 

^ZXXZ _ iypC Z111C XlllgCl olgllalUlt/ J. 


9.94 7.55e-10 390-403 


K 17 

ID 1 / 


PP AAHAQ A 
rKUuU4o/Y 


(~*'~) XJO fTrrtp 7tnn ftlKTAr CI OTI CltllTf* T 

i^-zxiz-iypc zinL xinger oigiidiuic x 


9 94 8 lie- 11 979-992 


1 ^ 1 7 
ID 1 / 


r\KUUU4o/Y 


(~ ir yX^X0 t\rr\p> 7inp -fi n crf^v oicmfitnTf* T 
V^ZXlZ type Zlllv Xlllgd olglla-LU.lt; x 


9.94 8.58e-ll 418-431 


K17 

ID 1 / 


rK.UUUH-oA. 


/"'OXJO _j-Trr\n rzinr* "finCTf^r cicmntlTTf* T 
V^ZilZ-Ly pC ZUK/ XlllgCl alglldLUlC X 


9.94 9,05e-ll 951-964 


ID 1 / 


pp nnn4R a 

JrJ\.UUU £ fo/\ 


/~"7 T-IO _f\nif : » 71'nr 1 'fi n apr ci on atnTf* T 
v^ZXXZ~Lypc ZlUCr illlgCl olgliaiuiv x 


9.94 9.36e-13 1091-1104 


ID 1 / 


JrJvUUUn-oJj 


/^OXJO \n^<=> 7\t\o -fi 11 CTfT" cionatnTf* TT 
v_/^riZr lyjjc zinc xiiigci oigiiaxuiv/ 11 


5.52 1.00e-09 911-920 


K17 
ID 1 / 


JrKUUU4oJt> 


/^OT-TO t\rr^p> 71'rp "fincrpr cicmptllTf* TT 
V^ZliZ - LypC Z111C Xlll^Cl olglla.LU.lt/ XI 


5.52 2.29e-10 939-948 


1 r 17 

1517 


rKUUU4or> 


^Zxiz-Type zinc linger signaiurc xi 


S? ? 50e-09 995-1004 


1517 


PR00048B 


C2H2-type zinc finger signature II 


5.52 2.93e-10 350-359 


1517 


JrKUUU4oJt> 


L^Zxiz-Lype zinc linger signature ai 


5 5?^ ?5p-1 1 294-303 


1517 


PR00048B 


C2H2-type zinc finger signature II 


5.52 3.25e- 11 434-443 


1517 


tin r\rir\ /f on 

PR00U48B 


C2H2-type zinc finger signature II 




1517 


PR00048B 


C2H2-type zinc finger signature II 


^ ^9 ^ 57p-10 109^-103? 


1517 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.00e-09 462-471 


1517 


PR00048B 


C2H2-type zinc finger signature II 


d.dz /.uue-uy yo /-y /o 


1518 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


19.16 2.63e-13 21-70 


1518 


IPB000353C 


Class II histocompatibility antigen, beta 
chain, beta-1 domain 


20.11 1.41e-10 71-125 


1518 


IPB001003B 


MHC Class II, alpha chain, alpha- 1 domain 


1/1 79 /I -d.^^ HO 9Q 99 
14. /Z 4.4De-Ui? Ay- /Z 


1518 


IPB003006A 


Immunoglobulin and major 
histocompatibility complex domain 




1518 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 9,40e-19 78-115 


1518 


PR01436D 


NADH-ubiquinone oxidoreductase chain 2 
signature IV 


4.56 2.66e-09 6-18 


1521 


IPB001618 


Coenzyme A transferase 


37.23 1.86e-09 56-109 


1521 


THDAA1 /T 1 0 

IPBuOlolo 


Coenzyme A transferase 


^7 9^ 5 7^e-^9 ^5S-40S 

J / .Zj J. / JC-jZf JJJ-tUO 


1521 


IPB001618 


Coenzyme A transferase 


37.23 6.92e-31 196-249 


1521 


IPB001839A 


Transforming growth factor (TGF) beta 
family 


1 0 "37 1 Q1o 17 £^7 ^71 


1521 


IPB001839B 


Transtormmg growth tactor (1^) De f a 
family 


11 KO 1 7p» 1 9 7fiA 71 R 
11. ID Z. 1/e-lZ / U4- / 1 o 


1521 


IPB001839C 


Transforming growth factor (TGF) beta 

xcuixiijr 


17 ^O 9 SJHp 1 £ 79 A 741 


1521 


PR00438A 


Growth factor cystine knot domain 
signature I 


11.01 4.00e-09 665-674 


1521 


PR00669E 


Inhibin alpha chain signature V 


14.95 5.39e-12 640-657 


1523 


IPB000034A 


Laminin B 


22.21 8.56e-09 661-696 


1523 


IPB001762A 


Disintegrin 


23.93 4.21e-21 453-493 


1523 


IPB001762B 


Disintegrin 


10.06 7.30e-13 500-510 


1523 


IPB002870A 


Reprolysin family propeptide 


12.22 6.54e-09 109-125 
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1523 


IPB002870B 


Reprolysin family propeptide 


24.73 9.38e-26 154-192 


1523 


IPB002870D 


Reprolysin family propeptide 


16 11 6 11p-1 1 191-11R 


1523 


IPB002870E 


Reprolysin family propeptide 


11 00 7 57p-1 9 155-167 I 


1523 


IPB002870F 


Reprolysin family propeptide 


1RR1 6 R5p-1 0 10R-499 


1523 


IPB003854B 


Gibberellin regulated protein 


1 Q 96 A 6Qp-00 470-50R 


1523 


PR00289A 


Disintegrin signature I 


i a on 7 S5p-1 A 460-4RR 


1523 


PR00289B 


Disintegrin signature II 


ii 747 97p„ 1 0 408-5 1 0 


1524 


IPB000387 


Tyrosine specific protein phosphatase and 
dual specificity protein phosphatase family 


1 0 77 5 OOp-1 1 1 Sfi-1 Q6 

IV/.// J.WC-ll 1 OU 1 7U 


1524 


TTlTl AA1 A 

IPB001073A 


Complement Clq protein 


99 u S ^np-OQ 940-9S^ 


1524 


IPB001073A 


Complement Clq protein 


9? 14 5 "SOp-OQ 4^6-470 


1524 


IPB001695C 


Lysyl oxidase 


1 9 60 9 64p-0Q ^01 -41 5 


1524 


PR00700D 


Protein tyrosine phosphatase signature IV 


19 5 70p-10 18^-901 


1525 


IPB000276C 


Rhodopsin-like GPCR superfamily 


o A S9p-00 5-16 

O.UJ> UiiC Uy J 1U 


1525 


IPB000276D 


Rhodopsm-like GPCR superramily 


Q 40 5 94p 1 A 45-61 


1525 


PR00237G 


Rhodopsin-like GPCR superfamily 
signature VII 


in oa o 77p_1 7 ^5-61 


1525 


PR00424F 


Adenosine receptor signature VI 


R 75 5 ^4p-19 ^5-45 


1525 


PR00554F 


Adenosine A2B receptor signature VI 


R R6 1 75p-16 94-^6 


1525 


PR00554G 


Adenosine A2B receptor signature VII 


1 4 1 S 7 90p-1 9 60-RS 


1527 


IPB001045B 


Spermidine synthase 


IR 40 1 ^5p-19 ^14-^56 


1528 


T"nr» A AO /\ A o 

IPB002048 


El* -hand iamily 


7 014 ^9p-00 ^^S-^50 


1528 


PR01286E 


Orphan nuclear receptor NOR1 signature V 


5 97 6 65p-00 6-97 


1528 


PR01362H 


Flagellar calcium-binding protein 
(calflagin) signature VIII 


A S6 7 1 ^p-00 1 50-1 66 


1532 


IPB001602A 


Uncnaractenzeu protein iamiiy urruw/ 


1 5 OR 8 04e- 10 119-137 


1534 


IPB003861B 


E4 protein 


0 06 9 06e-09 1051-1065 


1534 


PR00021A 


Small proline-rich protein signature I 


^ 11 4 55p-00 1044-1056 


1535 


TTIT1AA1 1 CATT 

IPB001359H 


Synapsin 


99 5R 6 50e-09 19-89 


1535 


PR00671C 


Inhibin beta B chain signature III 


4 1 R 0 60e-00 44-63 


1538 


IPB000095r 


FAIv-dox /r z 1 -Kno-Dinaing 


16 47 1 05p-09 146-200 


1538 


IPBOOObolCj 


l^JsJN/rnopniiin/rnoteKin rno-Dinamg repeat 


13 73 9 15e-ll 144-193 

L Js . / ~J 7.1JC 11 111 1 — ' 


1538 


TT%T>AAAAC AT\ 

IPB000959D 


POLO box duplicated region 


97 01 6 79p-0Q 106-248 


1538 


Tr»r»AAAA/"1 T~"V 

IPB000961D 


Protein kinase C-terminal domain 


91 91 9 00p-14 13R-170 


1538 


IPB001245A 


Tyrosine kinase catalytic domain 


99 45 6 63e-16 82-122 


1538 


IPB001245B 


Tyrosine kinase catalytic domain 


91 6R 7 55p-17 142-180 


1538 


IPB001772C 


Kinase associated domain 1 


90 66 0 71e- 1 6 77- 1 07 


1538 


IPB001772D 


Kinase associated domain 1 


91 67 4 67p-00 140-188 

X . LJ / *-r.V//C \jy 177 lOO 


1538 


IPB001772E 


Kinase associated domain 1 


94 RR 9 67p-1 0 901-949 

ZH'.OO ^.U/C lu aUJ i«"/< 


1538 


IPB003527D 


MAP kinase 


91 51 9 04p-00 115-176 

1 . J J ^.V/T^t/ V/.7 ± w' 1 /V/ 


1538 


IPB003527G 


MAP kinase 


1 7 96 9 1Re-09 21 5-252 


1539 


PR00502B 


NUDIX hydrolase family signature II 


14 R9 7 51p-00 111-146 


1540 


IPB000777A 


Envelope glycoprotein GP120 


19 67 0 67e-09 659-665 


1540 


TUHAAAOI *> 

IPB000b22 


z,mc ringer, i^ztiz type 


14 67 9 57e- 10 729-754 


1540 


IPB003036C 


Gag P30 core shell protein 


11 53 4 86e-09 799-815 


1 540 


ooo4on 


Wilm's tumour protein signature IV 


0.00 1.79e-10 813-827 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 1.00e-12 537-574 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.52e-l 1534-571 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 2.56e-09 543-580 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 5.85e-ll 546-583 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.69e-09 555-592 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 7.27e-ll 552-589 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


1 1.46 7.36e-10 540-577 
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1541 


IPB000885A 


fibrillar collagen u-terminai uomam 




1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 46 Q 1 9*»-1 f) 


1541 


IPB000885A 


Fibrillar collagen C-terminal domain 


11 46 Q 41p~1 0 


1541 


IPB000885B [ 


Fibrillar collagen c-termmai aomam 


1Q 1S 1 77e-10 590-^73 


1541 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 Q 1 ^ 70p-0Q 547-600 


1541 


IPB000885B 


Fibrillar collagen C-terminal domain 


1 Q 1 5 4 1 1 e-OQ 5^8-591 

Yzs. 1 J *r. 1 1C"U7 JJO w>-7 X 


1541 


IPB000885B 


Fibrillar collagen C-terminal domain 


1Q 15 5 01p~0Q 544-507 


1541 


IPB000885B 


Fibrillar collagen C-terminal domain 


in K/C AQr_1 A 5 1 7-570 


1541 


IPB000885B 


Fibrillar collagen C-terminal domain 


1Q K Q 11 e _1 A 59^-576 


1541 


Ttit* A AAfi A C TP* 

IPB000885B 


Fibrillar collagen C-terminal domain 


1 Q 1 5 0 67p-0Q 541 -504 
u / C—VJI7 jti"J7i' 


1541 


IPB000885C 


Fibrillar collagen C-terminal domain 


1 6 70 1 00p-40 6^9-678 


1541 


IPB000885D 


Fibrillar collagen C-terminal domain 


18.84 4.75e-32 728-765 


1541 


IPB000885E 


Fibrillar collagen C-terminal domain 


91 i nn^ 4A Rnn_S4i 


1541 


IPB001073A 


Complement Clq protein 


22.14 1.00e-08 554-588 


1541 


IPB001073A 


Complement Clq protein 


zz. 14 o.yDe-uy jji-joj 


1541 


IPB001073A 


Complement Clq protein 


22.14 7.32e-13 548-582 


1541 


IPB001073A 


Complement Clq protein 


zz. 14 o.uje-uy d4zo / o 


1541 


IPB001073A 


Complement Clq protein 


22.14 9.50e-ll 545-579 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 2.53e-12 548-600 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.06e-10 551-603 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 5.54e-09 527-579 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26,12 6.12e-10 545-597 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.10e-10 542-594 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.59e-09 524-576 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 8.98e-09 536-588 


1541 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 9.34e-10 521-573 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.12e-09 546-566 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.76e-12 549-569 


1541 


IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


12.38 1.77e-10 55z-D /z 


1542 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 3.88e-ll 1025-1035 


1542 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


8.86 4.09e-10 927-942 


1542 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


o o /T o nn„ i o 1 AO A 1 AO C 

8.86 8.20e-lo 10ZU-103D 


1 c/to 
1D4Z 


T"DT2 AAA/1 QOD 


r roramine £ yrr^wLz.) 


5.26 9.27e-10 430-464 


1542 


IPB001862F 


Membrane attack complex 
components/perforin/complement C9 


29.39 5. 39e-13 901-948 


1542 


IPB001881A 


Calcium-binding EGF-like domain 


8.72 3.40e-09 916-925 


1542 


IPB001881B 


Calcium-binding EGF-like domain 


12.28 7.00e-16 1020-1031 


1542 


IPB003886D 


Extracellular domain in nidogen 


13.91 2.77e-10 1020-1039 


1542 


PR00010A 


Type II EGF-like signature I 


12.91 2.64e-09 912-923 


1542 


PR00010C 


Type II EGF-like signature III 


6.98 1.58e-10 1025-1035 
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1542 


PR00010C 


Type II EGF-like signature III 


6.98 4.07e-09 932-942 


1542 


PR00011D 


Type III EGF-like signature IV 


la. iz z.D^fe-uy y&y~y L x / 


1542 


PR00764F 


Complement C9 signature VI 


15.74 1.60e-10 918-938 I 


1542 


PR00907B 


Thrombomodulin signature II 


1 1 kci i <o 0 in 1 m £ 1 0^9 
1 1 ,du z.Ocse- tu iuio-iioz 


1543 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 1.00e-ll 88-125 


1544 


IPB001359H 


Svnapsin 


22.58 2.49e-09 244-294 


1544 


IPB001394A 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


11 £LA O r\A«- If! AHQ AC\K 

12. o4 o.uue-iy 4/o-4yD 


1544 


IPB001394B 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


1 n Q7 1 1 £o 1 Q £8A 71 ^ 


1544 


IPB001394C 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


oco i nn^-OQ 771-7R0 


1544 


IPB002000D 


Lysosome-associated membrane 
glycoprotein (Lamp) 


< 57 s ^Rp-OQ 941 -954 


1544 


TTVT> A A^nn AT) 

IPB002999B 


Tudor domain 


7 50 7 55e-09 244-252 


1544 


IPB002999B 


Tudor domain 


7 50 7 55e-09 245-253 


1544 


IPB002999B 


Tudor domain 


7 50 7 55e-09 246-254 


1544 


IPB003861B 


E4 protein 


q 06 1 00e-08 246-260 


1544 


TirnAA'5 o /Z 1 "rj 

IPB003861B 


E4 protein 


q 06 9 61 e-09 251-265 


1544 


PR0UU49D 


w urn s tumour protein signcuure i v 


0 00 1 29e-10 233-247 


1544 


PR00049D 


w um s tumour protein signature i v 


0 00 2 68 e-09 243-257 


1544 


PR00049D 


Wiim s tumour protein signature i v 


0 00 4 05 e-09 237-251 


1544 


PR00049D 


Wilm s tumour protein signature i v 


0 00 4 05 e-09 238-252 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0 00 4 05e-09 244-258 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0.00 5.92e-ll 242-256 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0 00 ^ 75^ 1 1 9^^-750 


1544 


PR00049D 


Wilm's tumour protein signature IV 


a nn 7 ^£f» 19 9^Q-?S^ 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0 OO SD1p 1 7 9^S_94Q 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0 00 S Q^p 10 94.1-755 


1544 


PR00049D 


Wilm's tumour protein signature IV 


0 00 Q ^Qp-1 1 940-954 


1544 


PR00211B 


Glutelin signature II 


0 Rfi 7 50^-00 9^R-95S 


1544 


PR0H02M 


5-hydroxytryptamine 6 receptor signature 

VTTT 
Alii 


11 1 ^ ^ ^9p-0Q 999-951 


1544 


t>t> Alii nr^ 
PR0121 /Cj 


Proline rich extensin signature VII 


4 09 5 58e-09 233-258 


1544 


PR01471E 


Histamine H3 receptor signature V 


5.41 3.35e-09 235-250 


1544 


PR01471E 


Histamine H3 receptor signature V 


^ 41 4 91e-09 9^6-251 


1545 


IPB001359H 


Synapsin 


99 5R 1 SOp-OQ 977-327 


1545 


IPB001359H 


Synapsin 


99 5S R R0p-1 0 4^7-487 


1545 


mnAA 1 O C C\t T 

IPB001359H 


Synapsin 


99 5S 0 S^p-0Q 458-508 


1545 


IPB002999B 


Tudor domain 


7.50 2.64e-09 1028-1036 


1546 


PR00019B 


Leucine-rich repeat signature II 


It AO A "^^p 10^1 7-^^0 


1546 


PR00416B 


Eukaryotic DNA topoisomerase I signature 
II 




1547 


TT*T^ f\ A A A A 1 A 

IPB000801A 


Putative esterase 


1^ 7 7^p-00 599-5^^ 


1547 


IPB003533C 


Doublecortin 


in 9<; ^ 1 /lp_ 1 0 697-670 




TPR001 698 A 


F-ftctin carnnin j? rvrotein beta subunit 

A ClV* till V/U UU111C UJlUlwlll W L vv iJLIU Ulllt 


11.94 2.89e-23 47-100 


1549 


PR00192A 


F-actin capping protein beta subunit 
signature I 


8.23 1.47e-27 47-67 


1551 


IPB002469G 


Dipeptidyl peptidase IV, N-terminus 


26.76 9.24e-ll 362-400 


1551 


IPB002469H 


Dipeptidyl peptidase IV, N-terminus 


21.17 6. 14e-16 407-442 


1551 


1PB002469I 


Dipeptidyl peptidase IV, N-terminus 


10.99 4.86e-16 452-470 


1551 


IPB002469J 


Dipeptidyl peptidase IV, N-terminus 


8.97 3.52e-12 534-550 


1551 


IPB002471B 


Prolyl endopeptidase family serine active 


24.90 3.66e-l 1439-470 
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site 




1552 


IPB000183D 


Orn/DAP/Arg decarboxylases family 2 


14 78 6 76e-12 38-57 


1552 


IPB000183E 


Orn/DAP/Arg decarboxylases family 2 


17 9S 9 67e-09 106-129 


1552 


IPB000183G 


Orn/DAP/Arg decarboxylases family 2 


21.16 1.00e-17 171-191 


1552 


IPB003873C 


Nonstructural protein NS3/small envelope 
protein E 


7 Q7 S CY\f> 10 1 *?Q-1 79 


1552 


PR00163B 


Rubredoxin signature II 


1 < 1 H A QOp» HQ 14^-1 ^0 
10.1/ 4.yUc-uy i'+j-iJ7 


1552 


PR01179E 


Ornithine/diaminopimelate/arginine (ODA) 
decarboxylase family signature V 


94 ^ 06^ 11 1RA-1Q^ 

10.Z4 o .uoe- 1 1 lou-ii/j 


1552 


PR01182G 


Ornithine decarboxylase signature VII 


11 1^ 7 00<=>-14 1 67-1 80 


1558 


IPB001331C 


Guanine-nucleotide dissociation stimulators 
CDC24 family 


1 6 HQ 9 40f*-1 9 1 1 04-1 1 29 


1558 


IPB001478B 


PDZ domain (also known as DHR or 
GLGF) 


fs\ll 1Qe-09 198-207 


1558 


IPB002532J 


Hantavirus glycoprotein G2 


16.97 8.37e-09 1129-1165 


1558 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.81e-09 1318-1332 


1558 


PR00049D 


Wilm's tumour protein signature IV 


0.00 5.73e-09 259-273 


1558 


PR00554B 


Adenosine A2B receptor signature II 


12 52 8 85e-09 917-925 


1558 


PR00834F 


HtrA/DegQ protease family signature VI 


11115 24e-09 194-206 


1559 


IPB000095F 


PAK-box /P21-Kno- binding 


1 6 47 4 5 1 e-17 374-428 


1559 


IPB000861D 


PKN/rnopnilm/rnoteKm rno-oinamg repeat 


1^ 61 6 28e-09 148-184 


1559 


IPB000961A 


Protein kinase C-terminal domain 


16.82 4.60e-13 150-184 


1559 


IPB000961D 


Protein kinase C-terminal domain 


91 9^ 5 64e-09 366-407 


1559 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 2.38e-13 370-408 


1559 


IPB001772A 


Kinase associated domain 1 


11 64 7 91 e-1 0 148-179 


1559 


IPB003527A 


MAP kinase 


17.00 3.49e-09 157-182 


1560 


IPB003104B 


Formin Homology 2 Domain 


18 81 6 87p 91 78^-814 


1560 


IPB003104C 


Formin Homology 2 Domain 


20.33 1.27e-14 957-984 


1565 


IPB000590A 


Hydroxymethylglutaryl-coenzyme A 
synthase 


i/.iz i.uue-zo ou-/^ 


1565 


IPB000590B 


Hydroxymethylglutaryl-coenzyme A 
synthase 


lo.ol l.o/e-o/ /D-iuy 


1565 


IPB000590C 


Hydroxymethylglutaryl-coenzyme A 
synthase 


1 7 m 1 AAo /in i in 1 ^6 

1 1 .yy l.UUe-4U 11U-1DO 


1565 


IPB000590D 


Hydroxymethylglutaryl-coenzyme A 
synthase 


13. dj o.oue-zi loD-iyo 


1565 


IPB000590E 


Hydroxymethylglutaryl-coenzyme A 
synthase 


20.64 3.Uoe-31 DVO-Djy 


1565 


IPB000590F 


Hydroxymethylglutaryl-coenzyme A 
synthase 


16.80 4.27e-32 384-412 


1565 


IPB000590G 


Hydroxymethylglutaryl-coenzyme A 
synthase 


in 71 1 OA^ 40 448 ^09 i 


1568 


IPB003016 


2-oxo acid dehydrogenases acyltr an sf erase 
component lipoyl binding site 


on .<Q 9 1 7f>„90 101-1 17 


1572 


IPB002043C 


Uracil-DNA glycosylase 


14 40 9 94p-00 666-676 


1574 


IPB000637A 


HMG-I and HMG-Y DNA-binding domain 


12.35 7.32e-10 275-295 


1574 


IPB000840C 


Matrix protein (MA) 


15.25 8.46e-09 245-290 


1574 


IPB000949H 


ELM2 domain 


21.43 9.41e-09 218-264 


1574 


PR00049D 


Wilm's tumour protein signature IV 


0.00 4.66e-09 254-268 


1574 


PR00456E 


Ribosomal protein P2 signature V 


3.08 5.60e-09 184-198 


1575 


IPB002567G 


Cell fusion glycoprotein K 


17.37 9.32e-09 231-270 


1577 


PR00756B 


Membrane alanyl dipeptidase (Ml) family 
signature II 


15.53 3.70e- 13 232-247 
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1577 


FK007D6C 


— — — . — rz 

Membrane alanyl dipeptidase (Ml) family 
signature III 


11 CO 1 QAa 11 Q1A ooa 
11. DZ l.o4e-ll 31U-3ZU 


1 C7A 

id /y 


td"daaaoo 1 
IroUUUZZl 


Protamine P 1 


^ ar s nip 11 a^a a<.a 

D.4o O.vie-11 OjU-OjO 


id /y 


tdtjaaaoo 1 
IrrJUUUZZl 


Protamine P 1 


^ AS S ^A** AO A9A A*\0 

d.4o o.Due-uy OZO-ODZ 


1 COO 

id /y 


IroUUUZz 1 


Protamine P 1 


AS Q A<C/=. HQ £AA f^fifk 


1 COA 

id /y 


T"DT3 a a a /t GO "D 


protamine z (rKJViz ) 


OA ^ 7Hp AQ ££0 AQA 

d.zo d. /ue-i/y ooz-oyo 


id /y 


tt>'daaa/igot3 


DfMntviiria O /"DO A /f 0\ 

Protamine z yr jkjviz ) 


^ OA 7 ^fip HO Aoo. AS7 


id /y 


tdtjaaaqoo 
IrxiUUUoZZ 


r 7\-tnr> ■Pi-n i-r POLIO 4"tm£± 

Zvinc ringer, L^zriz type 


1A A7 7 ^£j» HQ *RA A1 1 


1579 


IPB003868A 


Herpesvirus UL31-like protein 


9.31 6.11e-09 640-654 


1580 


TTITJAAAOI C A 

1PB000215A 


Serpins 


id\ ui D.yie-i/ oo-oo 


i con 
1580 


TT1DAAAO 1 C*D 

1PB000215B 


Serpins 


O CO O OQa HQ 1 AA 178 

y.o/ z.zye-uy loo-i/o 


If OA 

1580 


TT1T3AAA11 C/~* 

1PB000215C 


Serpins 


1 "3 on a QO/a 1 id i qa o 1 n 
13. yu 4.3Ze-14 iyo-ziu 


1580 


IPB000215D 


Serpins 


15.35 4.41e-14 407-433 


1581 


ttitn r\ f\r\ r f nn 

IPB000557B 


Calponin family repeat 


1*700 1 /CT „ 11 1 1 A 10A 

17.28 1.63e-ll 110-129 


1581 


IPB001715A 


Calponin homology (CH) domain 


1 A n A n O/C^ AA 11/1 10/1 

10.74 7.8oe-09 114-124 


1581 


PR00888C 


Smooth muscle protein/calponin family 
signature III 


1 1 CO A 0/C~ 1 A 1 1 A IOC 

11.52 9.8oe-10 110-lzD 


1581 


T\T% AAOOAtl 

PR00889B 


Calponin signature II 


n c/i n nfia ao 101 i oo 

y.D4 y.uue-uy izi-i^cs 


1 c oo 

1582 


TTIDAAA A OO 

IPB000483 


Leucine rich repeat C-terminal domain 


11 1Q/lC/l/a1A Q/^Q 'JQQ 

1 l.lo 4.D4e-lU Joy-ooJ 


1 coo 

1582 


TT>¥"» A AO AA/CD 

IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


OAOQ "7 Ho 11 /IOC CIO 

ZU.zo 3.3De-ll 4/D-D1Z 


1 c oo 
IDoz 


T5T> AAA1 A A 


Leucine-rich repeat signature I 


1 1 oo o riQo no oc i ns 
1 1. /z z.uye-uy yD-iuo 


1 coo 
IDoZ 


DD AA A 1 ATD 


Leucine-rich repeat signature II 


1 1 ai i nn<= 1 n iah 1 


1 coo 
1582 


T>T> AAA 1 A"D 

PR00019B 


Leucine-rich repeat signature II 


1 1 /tO C AOo 1 n QO 1 f\<\ 

1 1 ,4Z d.o /e-iu yz- iud 


1 c oo 

1582 


"D"D AAA1 AD 

FKUOOiyhJ 


Leucine-rich repeat signature II 


11 zio^/iCo no 11^ 1 oo 
1 1.4Z o.4De-uy i lo-izy 


1582 


PR00019B 


Leucine-rich repeat signature II 


11.42 9.18e-09 164-177 


1586 


PR01256B 


Otxl transcription factor signature II 


C AO *7 C7„ AA /I C 

5.92 7.57e-09 33-45 


1587 


IPB001841 


RING linger 


1 A /C A 1 AA^ AA 1 O^ 1 AC 

10. oy i.ooe-uy loo-iyD 


1587 


IPB002867B 


Cysteine-ncn domain (CoHC) 


OO CA A 1 C Q 1 C 1 OC OA/1 

zz.DU 4. IDe-l j 18D-ZU4 


1590 


IPB002889B 


WSC domain 


1 1 T/C O OOo AO QO 1 /t A 

11. /o o.ooe-uy yo-144 


1590 


TTHT5AAO O O AT> 

1PB002889B 


WSC domain 


1 1 0*C O 7A 0 1 A 1 AA 1 /I £ 

1 1. /o y. /ue-iu 1UU-14D 


1592 


TTiDAAO OOOT7 

IPB003888E 


FY-rich domain N-terminus 


1 O OA A AAq 1 A ^C<1 01 A 

ly.oU y.uue-iu 001-/1U 


1 COO 

iDyj 


TTJT3AAA1 C\1 A 


Pyridine nucleotide-disulphide 
oxidoreductase class-II 


11 ^ A HQp 1 A OAO OA7 

ii. dd o.uye-iu /4/-/0/ 


1 CQO. 

iDyj 


tdtjaaacc^ a 


Fibrillar collagen C-terminal domain 


11 AA 9 17p 10 4S1 

1 1 .h-o z. i /e- iu 'fo i j i o 


1593 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 6.14e-09 475-512 


1593 


rnn AAAOOCt* 

lPB000ob5B 


Fibrillar collagen C-terminal domain 


1Q 1C O Ifio AO AAQ AAO 

ly.iD /.3ue-uy ouy-ooz 


1593 


IPB000885B 


Fibrillar collagen C-terminal domain 


19.15 9.57e-10 320-373 


1593 


TTitlAAl AT> A 

IPB001073A 


Complement Clq protein 


OO 1 A A OA.-. 1A /lOO f 1T 

22. 14 4.20e- 10 483-D 1 / 


1593 


IPB001073A 


Complement Clq protein 


22.14 5.50e-09 486-520 


1593 


IPB001073A 


Complement Clq protein 


22.14 6.48e-09 342-376 


1593 


IPB001073A 


Complement Clq protein 


22.14 7.19e-09 726-760 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 1.00e-08 480-532 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 3.43e-09 468-520 


1593 


TT>r>AA1 A A O A 

IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


O/T 1 O A zrO« AA /C 1 O /"/CC 

26.12 4.68e-09 613-665 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 5.21e-10 462-514 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 6.17e-09 483-535 


1593 


IPB001442A 


C-terminal tandem repeated domain in type 
4 procollagen 


26.12 7.81e-09 459-511 
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1593 


IPB001442B 


_ 

C-terminal tandem repeated domain in type 

4 procollagen 


IZ.oo z.34e-uy 4o /-OU / 


1593 


T1YHAA1 A A^Tt 

IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


17 1Q £11^ CiQ AQ1 Zr\"\ 

lz.do o.iie-uy 4o1-dui 


1593 


TT1T5AA1 A A I'D 

IPB001442B 


C-terminal tandem repeated domain in type 
4 procollagen 


IO IQ 7 (?Oo f\Q AQA <f\A 

1J..D0 /.oye~uy 4o4-du4 


1 DyJ 


TDDflA 1/1^1 


Bacterial transferase hexapeptide repeat 


1 S D7 9 1 00 ^Ad-^lQ 
10. \J / J^v/y 


1 ZCll 


lrr>UU14D i 


Bacterial transferase hexapeptide repeat 


lo.U/ J.Hc-lU jjU-joj 


1 <Ci1 

L jy3 


iJrr5UU14M 


Bacterial transferase hexapeptide repeat 


1 Q. (Y7 8 1C\f> 1 H ^£ ^Q1 
lo.U/ O.ZUc-lU JJO-J7I 


1 


lrr>UU34 /4A 


GntP family permease 


9H HI 7 QRf> HQ 1 7Q 910 


IDyo 


T>T> nmAQ a 


Type I antifreeze protein signature I 


^ 79 1 nn<= ns ^97 ^41 

j./Z l.UUc-uo «)Z/-J i +1 


1 <Q1 


DPnmnfi a 
JrK.UU3UoA 


Type I antifreeze protein signature I 


^ 79 ^ 77f> 00 ^71 -^RS 


1593 


PR00308A 


Type I antifreeze protein signature I 


3.72 6.13e-10 372-386 


1 cno 
LDyD 


T}T> ftAOAO A 

rKUUoUoA 


Type I antifreeze protein signature I 


*X 79 Q *X 1 *=» HQ 7^£ 77H 

j./z yjie-uy /jo- / /u 




T>T> A AO A O A 


Type I antifreeze protein signature I 


^ 77 O AS*a 1 n 7^\7 7^^ 




T>T> AAQ AUP 


Type I antifreeze protein signature III 


9 7Q 9 ^n«=» HQ ^7^ 'XQ.A 


1 cm 
LDyD 


rKUUiUoU 


Type I antifreeze protein signature III 


9 7Q Q COq AQ QO/l 'IQ'i 

z. /y o.oze-uy jzh-ooj 


1593 


PR00308C 


Type I antifreeze protein signature III 


2.79 9.53e-09 372-381 


1593 


PR00308C 


Type I antifreeze protein signature III 


z. /y y.Dye-iu 5 /jooz 


1593 


PR00350E 


Vitamin D receptor (11 nuclear receptor) 
signature V 


11.57 4.21e-09 13-32 


1593 


PR01286E 


Orphan nuclear receptor NOR1 signature V 


5.27 9.02e-09 365-386 


1593 


PR01500B 


Tropoelastin signature II 


7.08 1.20e-26 229-249 


1593 


PR01500C 


Tropoelastin signature III 


6.02 1. 3 le-24 306-324 


1593 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3,91 3.47e-09 373-383 


1593 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3.91 5.85e-09 326-336 


1593 


PR01511D 


Kvl.4 voltage-gated K+ channel signature 
IV 


3.91 5.85e-09 757-767 


1595 


IPB000130 


Neutral zinc metallopeptidases, zinc- 
binding region 


5.86 5.71e-09 187-197 


1595 


PR00756C 


Membrane alanyl dipeptidase (Ml) family 
signature III 


11.52 4.81e-12 151-161 


1595 


PR00756D 


Membrane alanyl dipeptidase (Ml) family 
signature IV 


1 A TO O /Tr_ in 1 OT OA1 

10.78 8.65e-19 187-202 


1595 


TIT* AA1f^T7 

PR00756E 


Membrane alanyl dipeptidase (Ml) family 
signature V 


1U.3/ Z.4Ze-lD ZUo-Zlo 


loUU 


IrBUUU/ loU 


Thyroglobulin type- 1 repeat 


1 ^ AQ 7 ^7» 1 'X 110 

iD.4y /.o /e- 10 yo- 1 iu 


1601 


PR00342A 


Rhesus blood group protein signature I 


13.65 4.75e-21 12-30 


1606 


TTiDAAT) 1 AID 

1PB002219B 


Phorbol est ers/diacyl glycerol binding 
domain 


19 K'Z Q llo AO O^n O/l^ 

lz.dd y. 1 oe-uy ZJU-Z4D 


1606 


IrBOOzcD / A 


tAAt zinc rmger 


1A A CQa 01 OO 

14. oy 4.soe-zi oo-yy 


1606 


mn a AO A A A 

IPB002999C 


Tudor domain 


in 11 o 1 Co no <;ao. ^40 
lu.DJ y. ioe-uy j4U-04y 


1606 


TDDAAI iZA A"D 

1PB003649B 


B-Box C-terminal domain 


09 1 fk 7 *\7*» in ^8 77 
zz. 10 /.o/e-iu jo- / / 


1606 


TYT> A AO /T>! A 

PR00364D 


Disease resistance protein signature IV 


1/1 QQ 1 AA/> AQ T2(\ HA£. 

14. oy l.UUe-Uo /3U-/4D 


lulj 


TPRnn?no4.A 

IrDuuZUU'T/V 


ruxy-a.ucnyid.xe oinamg pruicin, unique 
domain 


IS S ^6p-10 1S0-1Q7 


1614 


PR00258B 


Speract receptor signature II 


7.94 7.75e-13 64-75 


1614 


PR00258C 


Speract receptor signature III 


9.05 2.72e-09 79-89 


1614 


PR00258D 


Speract receptor signature IV 


14.29 6.47e-10 110-124 


1614 


PR00258E 


Speract receptor signature V 


14.06 8.78e-13 133-145 


1615 


IPB003780E 


Cytochrome oxidase assembly 


15.53 7.40e-14 59-83 


1616 


IPB000135D 


High mobility group proteins HMG1 and 


2.13 1.00e-12 318-342 j 
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HMG2 




1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 1.10e-10 317-341 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 2.16e-15 322-346 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.35e-ll 327-351 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 3.82e-09 313-337 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4. 18e-09 316-340 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.47e-12 319-343 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.53e-14 320-344 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.58e-ll 328-352 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 4.58e-12 326-350 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 5.1 le-15 324-348 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.00e-09 330-354 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.69e-14 325-349 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 7.85e-10 329-353 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.12e-16 323-347 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 8.91e-09 315-339 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 9.80e-10 312-336 


1616 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


2.13 9.87e-15 321-345 


1616 


IPB000533E 


Tropomyosin 


11.32 J. loe-Uy 3oU-41 / 


1616 


IPB000637B 


HMG-I and HMG-Y DNA-bindmg domain 
(A+T-hook) 


14.21 3.45e-09 324-342 


1616 


IPB000637B 


HMG-I and HMG-Y DNA-binding domain 
(A+T-hook) 


1/1 01 n 1 On no no 

14.21 y.ioe-uy 3zo-34o 


1616 


IPB001422C 


Neuromodulm (LrAP-43) 


10. oz 1.34e-uy jzZOj / 


1616 


TT1T>AA1 /1<T>/~t 

IPB001422C 


Neuromodulm ((jrAr-43) 


lo.oZ z.uie-uy d i o-jjj 


1 £ 1 £1 

1616 


TTJt)AA1 /(OOP 

1PB001422C 


JNeuromodulin (LrAr-43 j 


10.0Z z.ooe-uy jzu-jjj 


1 £L 1 C 

lolo 


TDDAA1 /toon 

1FBU0142ZC 


JNeuromoduiin ^LrAr-43) 


10. oZ j.jOc-Uy jZI-jjD 


1 C 1 £Z 

lolo 


TDDAA1 /nor* 
lPJtSUU14ZZC 


JNeuromoauiin ^OAr-43} 


10. oZ 4.0/e-lU jZj-jjo 


lolo 


TT1DAA 1 /loir 1 

1PB001422C 


JNeuromoauiin ^LjAJt-4J j 


10. oZ D.oOe-lu 


1 61 6 

1 0 1 \J 


lrDUU IH-ZZV^ 


>Jf»iirrimnHii1in ffrAP-4^ 

INClil UI11UUU1111 ^V-I-TYl '^J 


16 82 7 33e-13 319-354 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 7.63e-10 317-352 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 7.64e-09 315-350 


1616 


IPB001422C 


Neuromodulin (GAP-43) 


16.82 9.29e-10 314-349 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e- 10 325-334 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 326-335 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 327-336 


1616 


IPB001580F 


Calreticulin family 


2.93 4.94e-10 328-337 
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1616 


IPB001580F 


Calreticulin family 


oqi/j R^r_OQ ^^1-^40 


1616 


IPB001580F 


Calreticulin family 


. y D / . / J C L \J JJU J*TJ 


1616 


IPB001990C 


Granins (chromogranin or secretogranin) 


33.59 8.59e-09 309-356 


1616 


IPB002172 


Low density lipoprotein (LDL)-receptor 
class A (LDLRA) domain 


i.oi z.yoe-ii jz-o4 


1616 


IPB002172 


Low density lipoprotein (LDL)-receptor 
class A (LDLRA) domain 


1 11 Q IHp 17 0^ 1 nr . 


1616 


IPB002360C 


Involucrin 


1 ^ Q Qfi*» HQ ^77 "X&R 

id. jo y.oue-uy jz/-joo 


1616 


PR00194D 


Tropomyosin signature IV 


9.54 8.25e-09 323-346 


1616 


PR00261A 


Low density lipoprotein (LDL) receptor 
signature I 


1 C Aft A 1 Qa 1 f\ A'X (\A 

ij.4y 4. iye-iu 40-04 


1616 


PR00261A 


Low density lipoprotein (LDL) receptor 
signature I 


i ^ aq a /^r>o no 9. a 1 
iD.4y H-.oue-uy oh-iuj 


1616 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 


K n n qa 0 1 a OA 1 
1D.1Z Z.oUe-lU 04-1UD 


1616 


PR00261B 


Low density lipoprotein (LDL) receptor 
signature II 


if T) ^ AOp AQ A'X fZA 

ID. 1Z 0.\JDQ-\Jy 4J-04 


1616 


PR00261C 


Low density lipoprotein (LDL) receptor 
signature III 


1 O T*i 1 nia AO A'X £.A 

lo. 11 5.\)jQ-\jy 43-04 


1616 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


1 r on A oc„ 1A 0/1 1 rt< 

16.87 4.2De-iU o4- 1 U j 


1616 


PR00261D 


Low density lipoprotein (LDL) receptor 
signature IV 


1 r on o /1C„ i 1 A1 £.A 
16. o/ 0.43e-l 1 43-04 


1616 


PR00261E 


Low density lipoprotein (LDL) receptor 
signature V 


1 O £Ln *7 Clo AO QA 1 f\*Z 

lo.OZ /.32e-Uy 54-1UD 


1616 


PR00261E 


Low density lipoprotein (LDL) receptor 
signature V 


1 O /CO "7 OIq AO A'X fc.A 


1616 


PR00261F 


Low density lipoprotein (LDL) receptor 
signature VI 


1* AA O fc^t* 1 1 A'X f\A 
ID. 40 Z.ODe-1 1 4J-04 


1616 


PR00261F 


Low density lipoprotein (LDL) receptor 
signature VI 


1 ^ Af* fx Q1 p_1 0 £4-1 05 
1D.40 O.Z7lG-iv OH-iUJ 


1617 


IPB002554A 


Protein phosphatase 2 A regulatory B 
subunit (B56 family) 


Ly.yj z.zye-z/ loj- iyn 


1618 


IPB003029B 


SI RNA binding domain 


9.42 8.80e-09 407-418 


1619 


IPB000001D 


Kringle 


I L.J L Z.OOC L£ JJ~ / L 


1619 


IPB000126A 


Serine proteases, V8 family 


11.75 6.60e-09 55-70 


1619 


IPB001254A 


Serine proteases, trypsin family 


y.yo 1 ,03e- 14 jj- / 1 


1619 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9.36e-14 56-71 


1619 


PR00839B 


V8 serine protease family signature II 


11.20 4.95e-09 55-72 


1620 


IPB000934C 


Serine/threonine specific protein 
phosphatase 


OA OK 1 AAq O/J '71 1 1 A 
ZU.ZD J.UUe-j4 / I- 114 


1620 


IPB000934D 


Serine/threonine specific protein 
phosphatase 


1 a otz ft kk.** i *7 i i /c ! 
14.00 y.DDe- 1 / IIO-UD 


1620 


IPB000934E 


Serine/threonine specific protein 
phosphatase 


«f 1 /" -> oO« T7 1 OA T)1 

25.16 2.8 be-z/ loU-zzj 


1620 


TIT> A A 11/1 T\ 

PROOl 14JD 


oerine/inreonine piio&piiaiaac idiiiiiy 
signature IV 


11.97 4.60e-24 77-103 


1620 


PR00114E 


Serine/threonine phosphatase family 
signature V 


17.50 7.63e-23 106-133 


1620 


PROOl 14F 


Serine/threonine phosphatase family 
signature VI 


16.38 2.93e-15 170-190 


1620 


PROOl 14G 


Serine/threonine phosphatase family 
signature VII 


15.51 9.57e-17 198-214 
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1623 


PRO 1008D 


Flagellar L-ring protein signature IV 


9.99 8.13e-17 35-49 


1623 


PR01008E 


Flagellar L-ring protein signature V 


13.32 6.48e-17 51-65 


1623 


PR01010G 


Flagellar P-ring protein signature VII 


12 AO 8. 1 le-20 88-106 


1623 


PR01010H 


Flagellar P-ring protein signature VIII 


•i -to O O O _ 1 O 11/1 1 O O 

12.18 8.88e-18 114-132 


1623 


PR01010I 


Flagellar P-ring protein signature IX 


*i A n 1 t 11 inn i f\f\ 

7.74 8.1 le-11 178-190 


1624 


IPB002195C 


Dihydroorotase 


11.49 8.26e-l2 98-109 


1624 


IPB002604F 


Chlorohydrolase 


ll. 98 3.86e-09 1 30-139 


1624 


IPB003764D 


N-acetylglucosamine-6-phosphate 
deacetylase 


33.54 8.05e-09 114-156 


1625 


IPB000319C 


Aspartate-semialdehyde dehydrogenase 


12.28 6.63e-10 61-77 


1625 


IPB000319D 


Aspartate-semialdehyde dehydrogenase 


1 o r\r\ o r\r\ ~ 1 /■ m 110 

13.92 8.90e-16 91-1 12 


1625 


IPB000319E 


Aspartate-semialdehyde dehydrogenase 


5.68 3.68e-09 155-165 


1625 


IPB000319F 


Aspartate-semialdehyde dehydrogenase 


12.86 4.96e-10 238-251 


1625 


IPB000319G 


Aspartate-semialdehyde dehydrogenase 


14.74 9.79e-16 258-278 


1625 


IPB000534A 


Semialdehyde dehydrogenase 


15.79 7.88e-ll 64-81 


1628 


IPB000847 


Bacterial regulatory protein, LysR family 


15.44 8.67e-15 21-54 


1628 


IPB001804D 


Isocitrate and isopropylmalate 
dehydrogenases 


24.39 9.18e-09 208-235 


1631 


IPB000121C 


PEP-utilizing enzyme 


12.96 5. 15e-15 96-108 


1631 


IPB002192G 


Pyruvate phosphate dikinase, PEP/pyruvate 
binding domain 


23.23 3.77e-16 78-109 ' 


1631 


PR00811C 


Bacterial general secretion pathway protein 
D signature III 


10.47 6.20e-09 9-19 


1632 


IPB002029C 


Aspartate and ornithine 
carbamoyltransferase family 


27.21 8.14e-21 401-439 


1632 


IPB003462C 


Ornithine cyclodeaminase/mu-crystallin 
family 


8.98 7.00e-09 500-513 


1632 


PR00101C 


Aspartate carbamoyltransferase signature III 


15.49 8.93e-10 410-427 


1632 


PR01375B 


Salmonella/Shigella invasin protein B 
signature II 


7.54 9.67e-09 201-217 


1632 


PR01507H 


Melanin-concentrating hormone 1 receptor 
signature VIII 


11.93 6.53e-09 354-373 


1634 


IPB001568B 


Ribonuclease T2 family 


11.65 1.00e-17 101-112 


1643 


IPB001014 


Ribosomal L23 protein 


22.21 6.03e-20 217-249 


1645 


PR01303D 


Plasmodium circumsporozoite protein 
signature IV 


10.57 6.88e-09 486-503 


1647 


IPB001518B 


Argininosuccinate synthase 


12.16 6.87e-12 58-70 


1647 


IPB001518C 


Argininosuccinate synthase 


27.71 3.57e-27 72-114 


1647 


IPB001518E 


Argininosuccinate synthase 


20.16 5.24e-14 172-188 


1651 


PR00453A 


Von Willebrand factor type A domain 
signature I 


11.78 3.89e-13 88-105 


1651 


PR00453B 


Von Willebrand factor type A domain 
signature II 


13.84 6.10e-13 125-139 


1651 


PR00453C 


Von Willebrand factor type A domain 
signature III 


11.84 3.57e-10 246-254 


1654 


IPB000222B 


Protein phosphatase 2C subfamily 


15.80 o.ooe-09 94-104 


1654 


IPB000222C 


Protein phosphatase 2C subfamily 




1657 


IPB000917A 


Sulfatase 


9.52 4.79e-10 47-58 


1658 


IPB000063 


Thioredoxin 


9.89 1.69e-16 346-359 


1658 


IPB000063 


Thioredoxin 


9.89 2.38e-16 481-494 


1658 


PR00421A 


Thioredoxin family signature I 


10.35 l.OOe- 10 481-489 


1658 


PR00421A 


Thioredoxin family signature I 


10.35 2.13e-ll 346-354 


1658 


PR00421B 


Thioredoxin family signature II 


10.23 1.00e-ll 354-363 


1658 


PR00421B 


Thioredoxin family signature II 


10.23 7.75e-12 489-498 
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1658 


PR00421C 


Thioredoxin family signature III 


11.02 1.00e-09 394-405 


1658 


PR00421C 


Thioredoxin family signature III 


1 1 07 3 84e-10 533-544 


1661 


IPB003134F 


Repeat in HS 1/Cortactin 


15.66 9.68e-09 101-149 


1662 


IPB002360C 


Involucrin 


1 5 16 0 00p~09 56-97 


1662 


PRO 1270 A 


Hi stone deacetylase superfamily signature I 


14 77 6 R5p-10 1715-1258 

1*+. /Z O.OJC-IV/ IZ-JJ-liJO 


1662 


PR01270B 


Hi stone deacetylase superfamily signature 

TT 
11 


0 70 S 76p-1 8 1 769-1 784 


1662 


PR01270C 


xiistone aeacetyiase superiamny signaxure 

TTT 
111 


10 80 1 57p-09 1160-1370 


1663 


PR01 104 A 


Anaphylatoxin chemotactic receptor 
signature i 


11 81 8 50e-09 83-94 


1666 


IPB000737 


Serine protease inhibitor, squash family 


13.14 8.31e-14 85-104 


1 £ eZ /Z 

1666 


lrBOuZUyo 


Seminal vesicle protein I repeats 


q 00 1 00e-40 31-67 


1666 


lPB00209o 


Seminal vesicle protein I repeats 


9 00 1 43e-16 43-79 


1666 


IPB00209o 


Seminal vesicle protein I repeats 


9 00 2 54e-17 19-55 


1666 


TT1DAA1AAC 

lPBOOzuyo 


Seminal vesicle protein I repeats 


9 00 3 79e-15 37-73 


1666 


IPB00209b 


Seminal vesicle protein I repeats 


9 00 7 1 5p-1 S 25-61 


1666 


TtYDAAOAAO 

lPBOOzUyo 


Seminal vesicle protein I repeats 


9 00 9 S3e-1 0 49-85 


1666 


lPBUuzzzlr> 


W Ax -type ^wney aciuic Jrroieinj iour- 
aisuiiiue core aomam 


17 17 1 14e-?0 92-1 13 


lOOO 


rKUUUloA 


^f-uisuipniae core Mgrid.iu.ic i 


14 31 7 00e-09 69-78 


looo 


nr) AAAAIP 


4-Qisuipniue core bigiiaiuic in 


7.81 7.63e-12 98-107 


1 & A 

looo 


JtivUUUUjjU 


A /Ti enlt-tVii f\t* c* r\VF* ci on Clti it"P \\f 
T—ClloUipillUC L/UiC olgllctlUlC J. V 


9.63 8.50e-09 108-116 


1667 


IPB001909 


KRAB box 


17.37 1.00e-32 108-142 


1673 




ivTyv ursA-Dinuing aomain 


75 34 7 08e-09 248-294 


1674 


IPB001322A 


Intermediate filament tail domain 


30.52 7.88e-ll 63-116 


1674 


PR01002A 


Flagellar protein FlgJ signature I 


10 07 9 65p-09 761-281 


1676 


IPB001865A 


Ribosomal protein S2 


35.08 8.94e-ll 53-106 


1678 


PR00795C 


Ryanodine receptor signature III 


(L 1 0 7 Q7p 00 1 07- 1 1 1 


1684 


IPB001998A 


Xylose isomerase 


15.52 3.05e-09 214-227 


1684 


IPB002198 


Short-chain dehydrogenase/reductase 
(SDR) superfamily 


Ly./Z o.oUe-ZU ijj-IOo 


1684 


PR00080C 


Short-chain dehydrogenase/reductase 
(SDR) superfamily signature III 


1 A AQ Q C^/a 1^ 177 
14. 4y 0.03e-lj lDJ-l/Z 


1684 


PR00081A 


Glucose/ribitol dehydrogenase family 
signature I 


lu.u/ o.zue-iu d-zz 


1684 


PR00081C 


Glucose/ribitol dehydrogenase family 
signature III 


12.71 7.97e-09 127-143 


1684 


PR00081D 


Glucose/ribitol dehydrogenase family 
signature IV 


1D.ZZ H. 1 ie-lU lJj-1 /Z 


1684 


PR01167F 


Insect alcohol dehydrogenase family 
signature VI 


1 1 77 7 1/1p-09 1 57-1 70 
x i .zz z.jtc yjy i jz, i / \j 


1685 


TITDAA1 C^i £li~^ 

1PB001526C 


Ly-6/u-PAR domain 


11 04 7 01p-09 197-212 


1685 


PR01514F 


Kv2. 1 voltage-gated K+ channel signature 

VI 


2.00 5.29e-09 220-231 


1686 


IPB000095E 


PAK-box /P21-Rho-binding 


17.62 7.35e-13 679-724 


1 £££ 
lOoO 


TPROO0Q61 P 


Prntpi-n Vinncp ("VtPTTnitial dnmnin 


15.48 5.50e-12 678-712 


1686 


IPB001245A 


Tyrosine kinase catalytic domain 


22.45 8.88e-16 671-711 


1686 


IPB001245B 


Tyrosine kinase catalytic domain 


21.68 8.64e-09 736-774 


1686 


IPB003527C 


MAP kinase 


14.70 5.30e-09 663-711 


1686 


IPB003605C 


GS motif preceding kinase domain in TGF 
beta receptor 


14.92 2.91e-15 570-617 


1686 


IPB003605D 


GS motif preceding kinase domain in TGF 
beta receptor 


12.41 4.39e-16 667-708 



WO 03/054152 PCT/US02/39555 



Table 3B 
438 



SEQ 

ID 

NO: 


Database ! 
entry ID 


Description 




1686 


IPBOOioUDb 


kjo motit preceding Kinase aomain iii i 
beta receptor 


21 14 ^ 61e-15 710-749 

Z^l.l^T J.U1C U /If / " 


1686 


IPB003605X 4 


GS motif preceding kinase domain in TGF 
beta receptor 




1686 


PR00653D 


Activin type II receptor signature IV 


19 01 Q 7t^p- 19 607-71 R 


1686 


PR00653E 


Activin type II receptor signature V 


14.33 2.93e-16 871-890 


1687 


IPB000832A 


G-protein coupled receptors family 2 
(secretin-like) 


1^ £1 ^ 9^p-1^ Q4-10Q 


1687 


PRU0491A 


Vasoactive intestinal peptide receptor 
signature I 


19 69 9 OSp-OQ 1 09-1 20 


1687 


rinni i c A A 

rKUi 1D4A 


vasoactive mxesiinai pepuae reccpLui i 

oi oncitiirp T 
m mi UILUC 1 


11 05 3 84e-29 58-79 


loo / 


PP01 1 
r\K.Ul l^^frJ 


\lncr\ r 3r*tl\m infpctinal ■nP'ntiHf* rpppntnr 1 
V aoUaLLl Vv HUCoLlilal pc^JLlUC ic^C|;iui a 

ciondfitrp TT 


12.24 1.39e-22 95-111 


lOoo 


PP00J.01 A 

JrJtvUU z +U IJrV 


Q T-T9 rlrMnaiti Qionnturp T 

k31 1Z< UVJlildlll OlgllCXIUl t/ J. 


12.88 6.29e-15 214-228 


1 £GQ 
luoo 


PPOOA*\9 A 


QT-T^ r1/~\m oin ci on citurp T 
OOj UUiIid.111 sijk^iiaauic i 


9.57 2.35e-10 150-160 


IDoo 


PP OOA^9R 
r^lvUU4DZX5 


C Q Hnm oin cionatiirp TT 
uiiJ UUHldlll ijlg,lictlUl c ll 


11.47 4.91e-12 164-179 


looo 




QT— T'? Homuin «;i onntnrp TTI 

\J±J.J LlWlllCllll olgllClLLIl C 111 


10.63 4.60e-09 181-190 


IDoo 


PP OO400D 


"KTpiitrrvnTiil r*\/f ncnl fnptnr 9 ^ionatiirp, TV 

1 N UU Ll f^llll V^j lUoUJ lavlUl L^lgliailti w x r 


11.47 8.29e-10 152-172 


IDOO 


PP01 960 A 


xj5i4_//^r,9_i_ pYpViano-pr ienform 1 siffnature I 


9.46 2.75e-09 5-17 


1 £Q7 


PP0001 Q A 


T piminp-ripn rpnpnt Qionntiirp T 


11.72 1.27e-09 660-673 


1 


PP0001 Q A 
Jr JvUUU i yj\ 


T piminp-ri^li rpnpnt Qianntnrp T 


11.72 1.55e-09 614-627 


1 £07 

loyj 


PPOOOI QR 

±^j\.uuu lyo 


T piifinp- fipVi TPtiPdi" ciondtiiTP TT 
1_#CU.L'1I1C~1 lV/Il ICpCdL olgllCtLUl^/ 11 


11.42 7.55e-ll 527-540 


loyo 


PPOOOI OR 
r ivUUUiyiD 


T ipi"np_ri pVi rpnpnt cionsitnTP TT 
1_»CUU111C-1 lwll icpcaL olgllatuiv/ 11 


11.42 8.91e-09 611-624 




PPOOAQQR 
_rK.UU4yyr5 


M<=»i i+-r/^r~»li i 1 r»\/tr\or\1 ■fbptnT* 9 ci on ntl ll*P TT 

rNcuuupiiii L-yiubui id^Lui z, di^iichuiv^ 11 


9.48 4.73e-09 34-53 


Yoyo 


TPROOI l^QH 


oyndpoiii 


22.58 4.58e-09 150-200 


1 90A 
1 /U4 


TPR000979 


ATP1 Ol /PT M/MATS familv 
t\. Ill VJ 1/ 1 J_>ivi/ ivixA. 1 0 icuim y 


14.36 8.53e-09 28-63 


1 7fK 
1 /Uj 


TPROOORQ9R 


Pinrtcrtmsil nrotpin S196F. 

1V1 UUoVJlllCll \Jl WLwlll OZiUJj 


13.49 3.50e-27 160-196 


1 /UO 


TPROOOQO^r* 
lr^xjUUUyuOU 


ZjLJJ UUllldlll 


23.89 2.54e-09 368-422 


1 90£ 

1 /uo 


TPROOOQ06n 


*7T Hnmm'n 
ZjVJJ LlVJllla.il 1 


23.89 5.71e-09 437-491 


1 /UO 


TPROOOQOnD 


7T Hrvmnin 


23.89 8.54e-09 401-455 


1 / UO 


TPROOOQOfiF 


7T TS Hnmain 

I—i -J UVJllld-lil 


22.11 1.38e-09 448-488 


1 706 
1 / UO 


TPR000Q06O 


7T Hrnrifiin 

/—1 \-J *J VlWillcllll 


25.85 3.64e-10 382-430 


1706 
1 / UO 


TPR000Q06O 


7\ Homnin 
z_iWw' Vavjiiitim 


25.85 5.87e-09 451-499 


1 706 
1 /UO 


TPROOI ^ROF 


C^5i1rptipn1in fnmilv 
\_-cin CLIL/U.1111 j.ciiini_y 


2.93 8.20e-09 317-326 


1 706 
1 / UO 


TPR00^71 6P 


P7SJA nr\1 "\/~m pracp ntnpoc* QllT>linit 
JVLN^A. pUiyillCla&C \JlllCgcl oUULllllL 


13.72 9.39e-09 314-334 


1 706 
1 / UO 


PP 0001 QR 
r^ivuuu i ysy 


T piipinp-ripTi i*pr»p5it ^ionfitnrp TT 
JUwUvlllC 1 Iwll lCLJ<h/&l> olgliaiui w 11 


11.42 1.55e-09 742-755 


1 706 
1 /UO 


PP 0001 OR 
i xvuuu \.y±y 


T piipinp-fipVi rpnpnt Qionntnrp TT 

JLjC UCll 1C~1 1L/11 IC^Jv/dL olgllclLUlv/ 11 


11.42 5.67e-10 773-786 


1 706 
1 /UO 


PP0141 S A 


AnVvrin rpnpnt Qionaturp T 
^viiiv_yiiii ickJwcii oigiiciiuit/ 1 


12.73 1.45e-13 415-427 


1 706 
1 /UO 


PP0141 5A 


Anlrvrin rpnpnt dionfitnrp T 

XAlllvyilll ICjJCClL OlglltlL Hi O 1 


12.73 4. 19e- 10 451-463 


1 7H£ 
1 /UO 


PPOl 41 ^R 


Atil/*\7t*in rpnpQt oitmcitiirp TT 
.rvllls.yi 111 !C[JCaL &lg,llcli.UlC 11 


10.23 1.90e-10 463-475 


1 706 
1 /UO 


PP0141 5R 


AnVvrin v&rx&at qi on nturp TT 
.rYllJYjfl 111 IvjJCal olgilciiu.it; 11 


10.23 2.93e-12 427-439 


1 7HC 

1 /Uo 


TPR000Q.99 
IJtdUUUOZZ 


7\r\c fin opr (^9FT9 tvnp 
£-ivW\j inigcij v^z<i iz< lyLJc 


14.67 1.00e-18 235-260 


1 70Q 
1 /Uo 


TPR000R99 


7inr finopr P9TT9 tvnp 

^-(11 It llllgCl, V^i^lli, IjrjJt/ 


14.67 2.93e-17 207-232 


1 70R 
1 /Uo 


TPR000R99 


Zinc finopr P9H9 tvne 

/—/lilt iiiigjV^i ; v. - i.y^.'v> 


14.67 6.57e-10 263-288 


1 7HQ 
1 /Uo 


TPR000&99 

IJtxjUUUO^^ 


7inp finopr P9H9 tvnp 
z-iiiii/ migci j ty^jM 


14.67 8.71e-10 53-78 


1708 


IPB001275 


DM DNA binding domain 


19.17 7.04e-09 223-262 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 1.00e-08 204-217 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 4.38e-09 232-245 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 7.00e-09 260-273 


1708 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.63e-09 50-63 


1708 


PR00048B 


C2H2-type zinc finger signature II 


5.52 7.43e-10 220-229 


1708 


PR00498A 


Neutrophil cytosol factor 1 signature I 


12.92 6.74e-09 353-361 


1709 


IPB002870A 


Reprolysin family propeptide 


12.22 4.09e-10 108-124 
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1710 


IPB000222B 


Protein phosphatase ZL/ suDiamiiy 


1 5 80 9 49e-09 1539-1549 


1710 


IPB001389D 


Flocculin repeat 


9.66 5.93e-09 1712-1739 


1710 


PR01391C 


Binary toxin B family signature III 


14 87 5 44p-00 950-977 


1711 


PR01099B 


Hydroxyethylthiazole kinase family 
signature II 


1 A 49 5 44p~0Q 6-97 


1712 


IPB000716C 


Thyroglobulin type-1 repeat 


1 7 69 5 A 1 p- 1 9 587-305 


1712 


IPB000867A 


Insulin-like growth factor-binding protein 


7 10 7 1 0p-0Q 48-55 


1712 


IPB000867B 


Insulin-like growth factor-binding protein 


11 44 4 1 1p-1 8 66-82 


1712 


PR01169I 


Ceratitis capitata alcohol dehydrogenase 
signature IX 


1 0 85 0 41 p-00 495-445 


1715 


IPB003308C 


Integrase zinc-binding domain 


10 01 1 42e-33 22-75 


1715 


IPB003308D 


Integrase zinc-binding domain 


8 ^1 7.43 e-09 94-106 


1715 


IPB003308E 


Integrase zinc-binding domain 


9.08 1.20e- 11 166-179 


1717 


IPB001134C 


Netrin, C-terminus 


17.82 7.82e-10 1317-1331 


1717 


TTkTi aa 1 r c\r\ a 

IPB001599A 


Alpna-z-macrogioDuun iamny 


10 97 1 45e-21 175-193 


1717 


TTYTIAA1 C C\C\Y} 

IPB001599B 


Alpna-z-macrogiODUiin iamny 


7.45 3.35e-14 264-276 


1717 


rnn e\r\i c r\f\ f\ 

IPB001599C 


Alpna-z-macrogioDunn iamny 


14.40 4.60e-15 291-307 


1717 


IPB001599D 


Alpha-2-macroglobulin family 


11.61 4.94e-15 783-793 


1717 


IPB001599E 


A 1«U n O <-» y» /-v #t1 /~V l-> till r\ TOUllll/ 

Alpna-z-macrogioDunn iamny 


11.06 1.60e-10 810-819 


1717 


IPB001599F 


Alpna-2-macrogioDunn iamny 


18.95 5.30e-27 840-869 


1717 


IPB001599G 


Alpha-2-macroglobulin family 


13.87 1.53e-13 988-997 


1717 


ttm^ AA 1 rAATT 

IPB001599H 


Alpha-2-macroglobulin family 


18.42 1.00e-23 1020-1047 


1717 


IPB001599I 


Alpha-2-macroglobulin family 


10.83 4.33e-12 1057-1066 


1717 


IPB001599J 


Alpha-2-macr ogl obulin family 


90 00 1 1 0e-1 6 1088-1113 


1717 


IPB001599K 


Alpha-2-macroglobulin family 


8.15 7.60e-13 1273-1284 


1717 


IPB001599L 


Alpha-2-macroglobulin family 


1 8 66 4 OOp-95 1 104-1 13 1 


1717 


IPB001599M 


Alpha-2-macroglobulin family 


13.29 1.75e-12 1445-1456 


1717 


IPB001599N 


Alpha-2-macroglobulin family 


OA 525 Q 99p 96 1408-1 510 
Z4.5D y.zze-zu i^+^o- ijju 


1717 


IPB001983D 


Translationally controlled tumor protein 


22.98 6.63e-13 1213-1253 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


O 11 1 AHa 1 1 1 1 1 1/11 

2.13 l.UUe-lZ 21 /-Z41 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


i 1*3 o /I 1 ^ if 01Q O/l 

2.13 2.4le-iD ziy-z43 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


1 1 O 1 Q 1 *7 OOA 1/1/1 

2.13 z.ooe-i/ Z2U-Z44 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 3.43e-14 Z15-24Z 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


O 1 1 A <Q*> 11 r )'\A 9^5 

2. 13 4. Doe- 11 Z14-2oo 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


O 1 1 < 0"5«a 10 09^ 94'7 
2.13 0.23e-12 ZZO-Z4/ 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


Z. 13 D.oUe-iU zio-Zo / 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


O 1 1 £L 1 So HQ 019 0^/^ 

2.13 o.ioe-uy ziz-zoo 


1718 


IPB000135D 


High mobility group proteins HMGl and 

rllVlvJZ 


'till 1 ^ OOO OA(^ 
Z.13 /.Doe- ID ZZZ-Z40 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 8.06e-ll 224-248 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 8.30e-14 216-240 


1718 


IPB000135D 


High mobility group proteins HMGl and 
HMG2 


2.13 9.13e-12 215-239 


1718 


IPB000135D 


High mobility group proteins HMGl and 


2.13 9.14e-17 221-245 
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JtvcallllS 






HMG2 




1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


9 io q i o^i i 

Z.1J i/.loc-ll LLj-z* l -xy 


1718 


IPB000135D 


High mobility group proteins HMG1 and 
HMG2 


Z, 1 J i?.U'TC~Ui? ZrZ.U""Z,JU 


1718 


IPB000637B 


HMCj-1 ana HMu- Y JJINA-Dinaing aomam 
(A+T-hook) 


1/191 1 00^-08 994-949 


1718 


IPB000637B 


HMCj-I ana rlMLr-Y JJJNA-Dmaing aomam 

f A 1 T t, ^ -.1 _\ 

(A-rl-noOKJ 


14 91 1 OOp-OS 99S-?4^ 


1718 


IPBOUOoi/B 


riMLr-l ana HJYHjr- Y jJiNA-Dinamg uomain 
^a i i -nooK ) 


14 91? OQp-00 99^-941 


l/lo 


TT>T5 A OH £ ^ 7T3 


xiivio-i ana riivivj- i i^i>/\-uiriuiii5 u.\jixicihi 
\/\ i i -nooK. ) 


14 21 4 82e-09 221-239 


171 Q 
1 / 15 


IFrJvJUUOj /o 


1-fAAOr T onH T-IA/IO-V FiTsJ A -hinHina domain 
rrivi^j-i duu riivivjr- i J-^iN-ri-u inning u.uiiia.111 

^A+T-nonlr"! 

^^A. 1 1 11VJVJ1V J 


14.21 8.36e-09 222-240 


171 Q 
1 / 1 O 


TPRAA1 AllO 


1NCUXUI11UU.U1111 ^vJ^A.1 ■tJ ) 


16.82 2.07e-10 218-253 


171 O 
1 / 1 O 




XT(^nrr\i-nr»rlii1in ffrAP-d'^ 
INCUrUIIlUQUllli ^Un.fTJ ^ 


16.82 2.13e-09 222-257 


1 11 ft 
l/lo 


TPR0A1 AllO 


irr»m r»Hn1 in ^(^AP-4'S^ 
1>I CUHJ111UL1U1111 ^ vJ/t-1 tJ^ 


16.82 3.70e-09 220-255 


1 / 1 O 


TPR001 AllO 


TvTfaiirrimorliilin ^rrAP-4^^ 

INCUl \JIllULlUllll ^V_I^VJT ""*T J 


16.82 3.84e-10 217-252 


1 71 8 




1NCUHJ111LIUU1111 ^VJ^A.i tJ^ 


16.82 6.68e-10 223-258 


1/15 




1NCUI UI1HJU.U1111 ^ ' / 


16.82 6.74e-09 221-256 


171 o 
1/15 


TPROA1 499P 


XTmirrvmoHnlin /TlAP-4^ 
INCUIUIIIUCIUIIII ^vj-rvjr tj^ 


16.82 8.93e-10 225-260 


1 71 S 
l/lo 


TPR001 ^ROF 
IrDUU 1 d our 


V-zclllCtll^Ullll lalliliy 


2.93 4.94e-10 224-233 


1 71 8 
1/15 


lJrJjUU l j our 


^c»1'i*< :3 >t , tr*iilin ■pmnilv 
v^ctiiCLiVvUiiii idiiiny 


2.93 4.94e-10 225-234 


1 71 C 
1/15 


TPRAA1 ^RAT7 


r^Qlff^'flr^iiliTl "Fja m 1 1 \/ 
v_>a.iiCLiutiiiii ldiiniy 


2.93 4.94e-10 226-235 


171 Q 
1/15 


TPRAA1 ^RAT? 
IJrJDUU 1 J5Ur 


I t ci1'f*#»'j"ir , iili'ri ~Tci tn 1 1 \ ^ 

v^tiiit/Licuini iciiiiny 


2.93 4.94e-10 227-236 


1 71 8 
1/15 


TPRA01 ^ROF 


V^ctllCLlivUllll idllllljr 


2.93 4.94e-10 228-237 


171 Q 
1/15 


TPRAA1 ^ROF 

ijtijUU ijour 


V^ali CL1CU1111 Idlllliy 


2.93 4.94e-10 229-238 j 


1 71 Q 
1/15 


TPRAA1 ^RAF 
IJr UUUl JOUr 


V_^dliCtlL/Ullll Xdlllliy 


2.93 4.94e-10 230-239 


1718 
l/lo 




r^nlrf^HonliTi familv 

\_sdli Ctlv< 141111 Xdllllljr 


2.93 4.94e-10 231-240 


1 71 R 
l/lo 


IT DUvlJOUr 


r^plrftifiiliTi fainilv 

VsdllC/lU'Ullll idllllljr 


2.93 4.94e- 10 232-241 


1 71 R 
1/15 


ljt Dvjyj i jour 


V^dllC/LlwLlllll Xdllllljr 


2.93 4.94e- 10 233-242 


1 71 R 
1/10 




r^alrRtipnlin "Fflmilv 

V^dll CLlvltllll Xdllllljr 


2.93 4.94e-10 234-243 


171 Q 

1/15 


TPRAA1 SROF 
Ir^JjUU 1 JoUr 


V«><11ICL1L/U11I1 ldlllliy 


2.93 7.75e-09 235-244 


171 Q 

1/15 


TPRAA71 1& A 
lr JjUUZ 


L^d-UIlCl 111 UUlildlll 


14.68 7.00e-09 119-135 


1 71 R 
1/15 


TPRA091 9^R 
IJr X5UUZ 1 ZUX5 


V^dUIlCllU UUlildlll 


12.04 5.20e-16 422-439 


1 71 R 
1/15 


TPR0091 96R 


V^dUllCl 111 UUlildlll 


12.04 7.86e-13 156-173 


171 Q 
1/15 


TPRAA9 1 l&O 


V^aUllCllIl UUlildlll 


12.35 8.41e-09 398-408 


171 Q 

1/15 


PP AA1 QAFi 
r JvUU iy^L^ 


iropomyubiii bigiidLuic i v 


9.54 1.94e-09 222-245 


171 o 

1/15 


PP AA1 Q^lTi 


iropomyosm signdLurc i v 


9 54 2 05e-09 220-243 


171 O 

1/15 


PP OA90^ A 


V>dUIlCllIl olglldLUlC 1 


17.38 3.50e-13 108-127 


1 71 R 
1/15 


pPAAOA^R 


r^dHVipfin ci onfiti iff* TT 

V_/dUlldlll olglldLUiCi xi 


20.09 3.30e-ll 551-580 


171 Q 

1/15 


PP 009OSR 


v^dUllCl ill olglldLUlC 11 


20.09 4.66e-14 434-463 


171 Q 

1/15 


PP AH90^R 


x^danenn signdiurc 11 


20.09 9.50e-21 168-197 


1 71 R 
1 / 15 




On H Ti pfin ci on nil lfp TTT 
V_/dUllCl 111 blgllaLUit ill 


13.59 3.25e-ll 397-409 


1 71 R 
1/15 


pp on90^n 


wdUlldlll DlglldLUtC 1 V 


12.22 4.46e-19 413-432 


1718 


PR00205E 


Cadherin signature V 


10.82 4.32e- 11 432-445 


1718 


PR00205F 


Cadherin signature VI 


19.57 4.77e-10 380-406 


1718 


PR00205F 


Cadherin signature VI 


19.57 5. 20e- 13 493-519 


1718 


PR00205F 


Cadherin signature VI 


19.57 7.30e-13 115-141 


1718 


PR00205G 


Cadherin signature VII 


13.05 8.71e-14 416-433 


1720 


IPB003360A 


US22-like viral protein 


16.29 4.27e-09 133-146 


1722 


IPB001128 


none 


11.42 5.15e-13 469-481 


1722 


PR00385A 


P450 superfamily signature I 


13.94 4. 19e-10 341-358 
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1 700 
1 / zz 


PP00QS5.P 


P450 qi lnprfomiK/ oi ond+i it'P TT1 
JTHJV isUpoildillliy blglldlUiC 111 


17 08 6 29e-10 395-406 


1 TOO 

1 /zz 


ppooi25r> 


P450 oi ir\/*vfinmi 1 t/ d'onafiirp "Wf 
rHJ\J oupcilainiiy blJ^llalLUC i v 


11 11 9 47e- 10 470-479 


1 / zz 


t>j> oo^R5F 


P450 ci vr\f*rfc\m i 1 \r cioncifiiTV 3 \f 
JT^+JL/ oUpcildlHliy olglldllliG v 


13 50 5 50e-09 479-490 


1 700 
1 / zz 


PI? 0046^14 


T3_r»1acc P450 trrmm T gi onsitnT'f* \/TTT 
£Z>~L'ld.DO JT*+JW glULip 1 olglldlUlC Y 111 


11 44 5 68e-10 469-479 

xx* i i */ • v/ w x w i \j s r / 


1722 


PR00465B 


E-class P450 group IV signature II 


17.22 7.92e-10 113-136 


1 TOO 
1 /ZZ 


rKUU4uD<^ 


n-ciass rn-jU group iv signature 111 




1722 


PR00465D 


E-class P450 group IV signature IV 


15.21 8.62e-16 390-406 


1722 


PR0Q465b 


E-class P450 group IV signature V 


1 *i A A1 ^ 40^-4^7 


1722 


T~1T» AA /I 

PR00465F 


E-class P450 group IV signature VI 


Ij.J / I.UUc-Ij 4^7"^ / 


1722 


TiTJ AA/1 £ZtZf~* 


E-class P450 group IV signature VII 


IS 71 Q 07p-1 & Afi^-AIQ 
lj./d o.\j / C - I U tUJ t / ^7 


1 TOO 

1722 


T>rj AA/1/rCTT 

rKUU4ojxl 


xi-ciass rQjv group iv signature vni 


10 0 05p-01 470-407 


1726 


lrr$UU3UUoA 


Immunoglobulin and major 
nisiocoiripaiiDiiiLy compieA uuiiidiii 


17 51 7 01-1 1 ^ 


1 TO»< 
1 /ZO 


lrrJUU^UUorJ 


immunogioDuun ana major 

rii o+ •~ii'»i'vrviT'\ci , f'i r\i li+x/ pr\HTnlPY AATYlilin 
nioLOUiJIIipclLlUlllLy L/tJilipiCA. UAJlllcllll 


00 0^ 0 80e-1 0 1 ^0-1 67 


1 707 

1 /z/ 


TPR001 0RS A 


Qtm QTYfr AlVfl A/Cltl /c\/tl 51TYf" AT1AT1T1 

o yiictp lupiiyoiij/ oyi idpLupui in 


12.19 1.00e-40 26-72 


1 797 
1 /Z / 


tpt*ooi oa5n 


oyiiapLUpilyo 1X1/ oyilaptup villi 


21 42 2 00e-35 73-104 


1 7")7 
1 /Z / 


tpt^ooi 0C5P2 


oyndpLopnybin/ byiiapLupui m 


21 42 7 86e- 12 72-103 


1 707 
1 /Z / 


rptinoi os^f* 

JLrr>UUlZo 


oynaptopnysm/ synapiopurin 


15 24 1 00e-40 108-148 


1727 


IPB001285D 


Synaptophysin/synaptoporin 


12.74 1.00e-40 149-183 


1 70*7 


TDDAA 1 OC^C 

IrrJUUlZoDri 


S 3/naptophys in/ synapt op orin 


0 55 1 00p-40 1S4-?^0 


1727 


IPB001285F 


Synaptoph3^sin/synaptoporin 


6.39 1.00e-40 231-275 


1727 


T"T»"T> A A 1 O 

IPB001285F 


S ynaptophys in/ synapt op orin 


t< 1 <Ac* OQ 04R 0Q0 


1727 


IPB001285F 


Synaptophysin/synaptoporin 


6.39 3.46e-09 242-286 


Mil 


IPB001285F 


Synaptophysin/synaptoporin 


'JO Q 1 1 0<5 ^AO 


1727 


IPB001285F 


Synaptophysin/synaptoporin 


6.39 9.83e-ll 263-307 


1727 


PR00220A 


Synaptophysin/synaptoporin family 
signature I 


1 a /en 1 /i Q 07 oq /i^ 

13. oy i.4ie-z / zj-4j 


1727 


PR00220B 


Synaptophysin/synaptoporin family 
signature II 


1 T AO O 11/, OO /IT TO 

1 i Sjd o.o le-Zo 4 /- /Z 


1727 


TVO AAOO A.O 

PR00220C 


Synaptophysin/synaptoporin family 
signature III 


1 1 R£ 7 OO^ 0^2 1 00 1 04 

ii. oo /.uue-zo iuu-iz*+ 


1 TOT 

1727 


T»T5 AAOO AT\ 

PR0l)2zUL> 


Synaptophysin/synaptoporin family 
signature i v 


10 60 R 1 ^p OR 1 ^0-1 55 


1 TOO 

1 /Z/ 


pponooor? 
rKUUZZUJi 


oynapiopnysin/synapLoporiii laiiniy 

olglldlulC V 


4 56 5 68e-23 199-217 


1 707 
1 /Z / 


PP 00761 A 


OlllUlll piCCUi oVJi olglldLUi C 1 


6.20 7.00e-10 287-303 


1 70S 
1 / Zo 


TPROOO^fiOr 1 


lIlViJlUL'Illl 


15.36 1.40e-09 114-155 


1 70a 
i /zo 


TPROOO^^Or 1 


1I1VU1UL/1111 


15.36 7.80e-09 105-146 


1 70S 
1 /Zo 


TPR00055RF 
Ir IjUVJZj J or 


T/T WT<0 H Amain 

1/ J_< VV 1_>V^ LlUllidlll 


14.64 7.33e-09 155-185 


1 TOO 

1 /zy 


TDRAH^ 1 17 
lroUUo lo / 


Drr\f an c/n QCCAoiotpH / D A 1 /I f\m <a 1 fl 

i rOlCdoC abbULldLLU /\^J UUllldlll 


22 40 2 86e-15 551-581 


1729 


PR00747C 


Glycosyl hydrolase family 47 signature III 


12.34 5.14e-ll 145-163 


1 Ton 

1729 


T>T3 A A T A T"E? 

PR00747ii 


Glycosyl hydrolase family 47 signature V 


14 R1 0 ORp-IR 005-040 \ 


1729 


PR00747H 


Glycosyl hydrolase family 47 signature VIII 


12.76 5.50e-17 326-346 


1730 


TJT3 A AO AA/""" 1 

PRU0209C 


Alpha/beta gliadin family signature III 


^^7 5 ^Qp-0Q 77-00 


1731 


IPB000276B 


Rhodopsin-like GPCR superfamily 


4.97 9.05e-09 192-203 


1 71 1 


TPT50^oo76^ , 


ixnoaopsin-iiKe urui\ supciidrniiy 


8 03 3 50e-ll 244-255 


1731 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 9.47e-14 285-301 


1731 


PR00237A 


Rhodopsin-like GPCR superfamily 
signature I 


9.81 2.42e-13 35-59 


1731 


PR00237B 


Rhodopsin-like GPCR superfamily 
signature II 


12.45 5.24e-16 68-89 


1731 


PR00237D 


Rhodopsin-like GPCR superfamily 
signature IV 


9.76 1.47e-ll 133-154 
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1 nn 1 
1731 


rKUUZJ /Jd 


Knoaopsin-uice urLK superiamiiy 
signature V 


1 1 Ol & H7p 111 24 907 


1 15 1 


T)T> AAO'2'7T7 


Rhodopsin-like GPCR superfamily 
signature VI 


\A. 14 9 RQp 1 7 914 9^R 


1 71 1 
1 Id 1 


PP00917f^ 


ixHOQopsin-iiKe ururs. superidmiiy 

cifmQtui'ja \/ 1 T 
blglldlUIC Vll 


1 0 91 8 1 1 p-1 0 97^-101 


1711 


px> 00944 a 


INCUIOiSJIlHl iCL/CpLUI olglldlUlC 1 


1 0 06 5 00e-1 S 9Q-40 


1 71 1 


ppoo944P* 


~\T/c» t i P"i nm rPAPti+Ar cirniotiirp TT 
INCUlUKuIllil ICwCpLUI M glial Ul C 11 


19 1 R 1 00p-11 S5-64 


1 711 


dd fiH7 AAO 


iNeuroKinin recepior Mgnaiure 111 


1 1 4R 9 S7p-1 9 1 0S-1 1 S 

1 J,tO Z/.O/C 1Z> 1 WO 1 1 o 


1 71 1 
1 / j 1 


pt?oo944ti 


1NCUI UKlIllIl ICOCpUJl blgllaLUlC 1 V 


7 47 0 1fip-14 19S-11^ 


1711 
i / D L 


PP00944F 


INCUlUKlIllIl ICL/CpiUl MglldlUiC V 


S 67 1 44p-1 1 1 R1-1 Q9 

. U / 1 .TtC 111 O 1 ^?z» 


1711 


PR 00944F 


1NCU1 UivLIllil ICL'CpLUl OlgllclLUlC V 1 


1 9 01 ft 9Qe-1 6 204-21 7 


1 711 

L / D L 


PR00944O 


"Wf»iTr*Vk"irn"n r & F*Y\i~ r\v ci onQtiifp \ATT 
INCUI Urvlllill ICl/CpiAJl olgllalUlc; Vll 


0 65 1 4Se-1 5 268-279 


1 71 1 


PP00944T-J 


TVTf^i lTr^L-i' tii n fpppntAt* ci cmntnre* \/TTT 
1NCU1 Ulvllilll ICv/CpUUl olglldtlllC V 111 


1118 5 50e-?6 994-1 1 2 


1 711 


PR01 095 A 


INCUIUlvlIllIl l>Jv^ ICUCpiUl blglldLUlC 1 


8 61 5 50p-1 6 9-15 


1 711 


PP01 095R 


\Tpi i r nl/'i n i n 'MT^'9 i*<ar'/a-i-\+ /***!• oioncitiirp TT 
lNCUrUJvllllll INJS^X ICL/CpLUl olgllalLU C 11 


19 90 7 1 1p-17 1 5-9R 


1711 


PP.01 095P 


IN GUI UJVlillll INJVZf ICV/CpiUl olgliCtLUlC 111 


1 R 51 9 91p-1 6 1 54-1 66 


1711 


PR01 095D 


"Wpi lrolH rri n ^TT<^9 i*pr*<= k 'ntri'r cionnturf* TA/ 

liCUl UJvtlllll lMVXf lCV>CpLUl OlglldtUJ. C 1 Y 


12 91 5 60e-18 168-182 


1 711 

L / d l 


PP01 095F 


INCUlUlVlIlUl 1NXVX lCOCpLUI blgllcttul C/ V 


S 95 7 91p-95 919-910 


1 711 


PP01 095F 


Mp>iii*r\1rini-n TvTTT9 rF^npntrxV cionntnrf* \/T 

1NCU1\JIV11H11 1N1VX ICLCpiUI OlglldlUl VI 


14 52 1 00e-17 257-269 


1711 


PR01095fr 


l^CUlUiVllllll INXS-il. lCL/CpL\Jl blgllClLUl C Vll 


9 85 6 65e-21 312-331 


1711 


TPR000Q1 7 A 


OUllctLaoC 


9 52 5 26e-10 44-55 


1734 


PR00464A 


Group II E-class P450 signature I 


19.27 7.26e-14 167-187 




TPT3 00001*7*2 


ojxiJr aomam 


10 66 52 14p 0Q 971-191 


1735 


IPB002097C 


Profilin 


22.96 4.04e-10 986-1040 


1 70 C 

1735 


IrrJUUZUy /U 


rroiilin 


ii oa ^ nso no qqi 1 04^ 


1735 


IPB003753G 


Exonuclease VII, large subunit 


31.72 8.85e-09 787-831 


1735 


PR00392C 


Profilin signature III 


14. 4Z z.zye-uy yoo-iuuo 


1735 


PR00392D 


Profilin signature IV 


10.64 4.35e-09 1008-1022 


1738 


TT"»T"» A A 1 1 o o 

IPB001138 


Fungal transcriptional regulatory protein, N- 
terminus 


11 n A ha. rvrv /TO 70 

12.17 9.79e-09 62-78 


1739 


IPB001509A 


NAD dependent epimerase/dehydratase 
family 


14.43 9.65e-09 379-400 


1741 


IPB003644J 


Domain found in Na-Ca exchanger and 
integrin-beta4 


11 £Z A C £>~1 ~ (\C\ CI 111 

13.64 5.67e-09 57-111 


1741 


PR00915G 


Luteovirus group 1 coat protein signature 
VII 


15.74 o.58e-10 303-324 


1745 


IPB001283A 


Extracellular proteins bCr/ Ipx-1/Ag5/PK- 

1/oC / 


1 iC O /C /C tin n 1 A OA /I o 

lo.Zo o.o/e-lU Zy-4o 


1 /4D 


TPT3001 OQ1T5 


extracellular proteins oLxjr/ipx-l/AgD/lrK- 

1 /Q/>7 
1/oC / 


u.uy z.^oe-iz d i- /z 


1 745 


TPR0019R1F 


C/Xtraceiiuiar proicino ov^r/ 1 px- 1 / j / 1 iv- 

If IJV / 


11 14 5 50p-90 197-147 


1745 


TPR001106F 


T-Trvrnp>rVhrvv ncQOPifitpri 1 Pi 7P1*ti <=» f 7^v\r\p'T 
xivjlliCL>U\JA cioo v^icii a^vi iwuv^iiic Ziippci 


11 71 6 00e-09 25-55 


1745 


PR00S17A 


All prnpn V^/TnY-1 -porn l"1 \r OKmotiirp T 

/Yiiergen v j/ ipA-i ictiiiiiy bigrid-iure i 


14 6Q1 56e-1457-75 

It.Uy l.JUC 11" D 1 1 D 


1745 


PR00S17P 


/viicrgcn vj/ ipA-i idiimy signature 111 


16 11 5 50p-90 196-149 


1745 


PR00838D 


Venom allereen 5 signature TV 


8.59 3.57e-09 57-75 


1745 


PR00838G 


Venom allergen 5 signature VII 


15.39 8.60e-18 125-144 


1749 


PR00672D 


Inhibin beta C chain signature IV 


10.52 6.40e-09 96-112 


1749 


PR01035I 


Tetracycline resistance protein signature IX 


13.24 3.86e-09 568-589 


1749 


PR01221G 


Rodent urinary protein signature VII 


12.11 8.68e-10 61-78 


1750 


IPB000885A 


Fibrillar collagen C-terminal domain 


11.46 8.16e-09 1852-1889 


1751 


IPB001991A 


Sodium :dicarboxylate symporter family 


10.78 2.53e-20 9-40 


1751 


IPB001991B 


Sodium:dicarboxylate symporter family 


14.40 2.29e-33 324-355 
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1 IK 1 
1 /D 1 




oouium.aicaruoxyiate symponer iamny 


Q Q1 1 /I Ho OA IT^ 1.QA 

0.4ve-Z*+ D / D-D^Hr 


1751 


PR00173B 


Glutamate-aspartate symporter signature II 


7.82 7.87e-12 19-39 


1751 


PR00173C 


Glutamate-aspartate symporter signature III 


1A /CT A A A a. T/C 107 I^T 

iu.o/ y.44e-zo oZ/"DDZ 


1753 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 6.54e-09 286-323 


1755 


IPB002617C 


Vesicular monoamine transporter 


16.23 5.03e-10 91-143 


1755 


PR01035E 


Tetracycline resistance protein signature V 


1 A O r ~! O An ~ AA 1 A 1 1 AO 

10.87 2.08e-09 101-123 


1755 


PR01035F 


Tetracycline resistance protein signature VI 


12.88 7.52e-09 332-352 


1757 


IPB000300B 


Inositol polyphosphate related phosphatase 
family 


15.64 1.93e-16 468-491 


1757 


IPB000300C 


Inositol polyphosphate related phosphatase 
family 


1 y| ^ r\ o tA„ 10 COO C /I O 

14.50 8.50e-12 538-548 


1757 


IPB000300D 


Inositol polyphosphate related phosphatase 
family 


29.87 4.5 le-1 8 589-623 


1757 


IPB000300E 


Inositol polyphosphate related phosphatase 
family 


15.50 1.43e-14 774-790 


1757 


IPB000492B 


Protamine 2 (PRM2) 


5.26 4.23e-09 83-117 


1758 


PR00169B 


Potassium channel signature II 


16.49 8.80e-28 239-267 


1758 


PR00169C 


Potassium channel signature III 


17.53 9,83e-ll 307-330 


1758 


PR01496C 


Shaker voltage-gated K+ channel family 
signature III 


9.37 6.92e- 12 202-213 


1758 


PR01496D 


Shaker voltage-gated K+ channel family 
signature IV 


9.74 3.25e-21 236-252 


1758 


PR01496B 


Shaker voltage-gated K+ channel family 
signature V 


ia q^t o i 1 „ i>t o <rn t?a 
lU.oo 2.1 le-14 Z39-Z/U 


1 /Do 


TlD A1 CAO A 


Kvl.2 voltage-gated K4- channel signature I 


<Q 1 «[7fi 1 T OTP. TOO 

o.Dy d.j /e- 1 / z /o-zyz 


1 T-TCQ 

1 /Do 


rKU10U9r> 


Kvl.2 voltage-gated K+ channel signature 

TT 

11 


o iq i 1*7 oq/1 m£ 


1759 


IPB000631B 


Uncharacterized protein family UPF003 1 


10.96 5.00e-12 100-112 


1 Ten 

i /39 




T T^nliiMin ntaMnaJ „ ^ „ ' 1 _ - T "TOT? A AO 1 

Uncnaractenzed protein tamily UPrUUol 


n 0-7 *7 7C„ i o oq'7 
lZ.o/ /./je-lZ j/0"jo/ 


1759 


Tr»T"> a a a/ti i *d 

lPB0UU631b 


Uncnaractenzeu protein tamily Urr UUo 1 


/.oz y.ZDe-11 4 J o-4uo 


1762 


IrBOOl /34A 


Sodium: solute symporter family 


1/1 Q Qlo 11 /in *\A 

14. o9 o.oje-lo 4U-D0 


1762 


lrB0U1734B 


Sodium: solute symporter family 


1 Q OX. K <f\a 1£ 1 • c \ c . 1 

lo.ZO D.jUe-Zo 


1762 


1PB001734C 


Sodium: solute symporter family 


1Q A1 O 1 /l-. 01 /1/tQ /ITT 

lo.Zl Z.loe-Zl 440-4/ / 


1 T/^"2 

1 /o3 


TDTJAAA/I Q1 

lJrr>UUU4o3 


Leucine rich repeat C-terminal domain 


1 1. 1 o z.zoe-iz jjo-j 1 is 


1763 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


20.23 5.15e-09 462-499 


1 T/£1 

1 /o3 


T>T> AAA1 Q A 


Leucine-rich repeat signature I 


1 1 TT *\ -^H^ 10 C£ QQ 

ii. /z D.jue-iz oo-yy 


1763 


PR00019A 


Leucine-rich repeat signature I 


11.72 8.91e-09 278-291 


1763 


T>T"> AAA 1 fit) 

PR00019B 


Leucine-rich repeat signature II 


1 1 /IO 1 A A/-k AO 1 AT 1 HA 

11.42 l.(JUe-Uo 1U/-1ZU 


1763 


T1TJ AAA 1 ATI 

PR00019B 


Leucine-rich repeat signature II 


1 1 /IO O 77„ 1 1 OO OK 

11.4z o.//e-ll oj-9o 


1763 


PR00019B 


Leucine-rich repeat signature II 


11.42 9.67e-10 275-288 


1764 


TT>T3 AAA 1 T A 

1PB000130 


Neutral zinc metallopeptidases, zinc- 
binding region 


loo 6.57e-09 294-304 


1764 


IPB001762A 


Disintegrin 


23.93 5.21e-10 241-281 


1764 


TT1DAA1 o /TO A 

1PB001862A 


Membrane attack complex 

r* /"^wi r'v/'^n ante fvyf^VifWl VI /r* rf"\TY"> t-\l r^-rvi £^r-^ "f- /""'Q 

L-uinpui lciiLo/ pci iui iil/ ^uiripieiiicrrc 


12.54 1.76e-10 416-431 


1764 


IPB002861B 


Reeler domain 


10.50 1.41e-09 422-450 


1764 


IPB002870D 


Reprolysin family propeptide 


16.31 6.25e-12 265-280 


1764 


IPB002870E 


Reprolysin family propeptide 


11.90 4.41e-12 296-308 


1764 


IPB002870F 


Reprolysin family propeptide 


18.81 3.73e-13 335-359 


1764 


PR00977A 


Scytalidopepsin B aspartic protease (A4) 
signature I 


7.30 9.11e-09 233-255 


1765 


PR00824A 


Hepatic lipase signature I 


8.114.44e-22 6-24 
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1768 


IPB000337A 


G-protein coupled receptors family 3 
(Metabotropic glutamate receptor-like) 


9.17 4.32e~14 4/oo 


1768 


IPB000337B 


G-protein coupled receptors family 3 
(Metabotropic glutamate receptor-liKe) 


24.51 9.90e-09 150-198 


1768 


IPB001828A 


Receptor family ligand binding region 


13.21 9.05e-09 74-86 


1768 


PR00248A 


Metabotropic glutamate GPCR signature I 


7.46 9.10e-lo 4ooo 


1768 


PR00248B 


Metabotropic glutamate GPCR signature II 


12.69 2.89e-17 75-90 


1768 


PR00248C 


Metabotropic glutamate GPCR signature III 


11.96 8.20e-24 yu-iuy 


1768 


PR00248D 


Metabotropic glutamate GPCR signature IV 


■t a ^ o r\ r\ r\ 1 c 1 CI 1 OO 

14.23 9.00e-15 153-1 fy 


1768 


PR01054A 


Metabotropic glutamate receptor 4 signature 
I 


7.72 1.22e-25 y-zo 


1768 


PR01054B 


Metabotropic glutamate receptor 4 signature 
II 


c\ n A i O/C O/C A A 

9.34 1.4/e-zo Zo-44 


1768 


PR01054C 


Metabotropic glutamate receptor 4 signature 
III 


ia 10 i 1 oo 111 1 47 
1U.J / l.lDe-ZZ lol-l-H- / 


1769 


IPB000008B 


C2 domain 




1769 


IPB000008C 


C2 domain 


91 17 9 SAp ^1 400-4^9 


1769 


IPB000008C 


C2 domain 


11 17 i c\Af± 1 n ?^^-^04 


1769 


IPB000008D 


C2 domain 


1/1 Ql 5 ^Q*=» 14 4 c , i S-47'} 


1769 


PR00360A 


C2 domain signature I 


k ifii nAo 1 n ^qr_41 A 


1769 


PR00360B 


C2 domain signature II 


1 1 1 11^ 1 9 497-440 
1 1 .04 Z.JJC" 1Z H-z. /"ttu 


1769 


PR00360B 


C2 domain signature II 


11. 04 o.ooc-1j z,yjL~j\jj 


1769 


PR00399A 


Synaptotagmin signature I 


i c a^ c 1 1 91^-9^1 
lO.LD j.DUe-11 ZjO-zj i 


1769 


PR00399B 


Synaptotagmin signature II 


1 /i in K ^ A*» in 9^1 -9^4 


1769 


PR00399B 


Svnaptotagmin signature II 


14. oU /.yze-iz joj-oi'o 


1769 


PR00399C 


Synaptotagmin signature III 


1 c oo 1 ^io 1 n ins 19^ 


1769 


PR00399D 


Synaptotagmin signature IV 


10 11 & Hp 1 A 19R-11R 
1Z. /Z O.JJC-1U jZO-JjO 


1770 


IPB000534A 


Semialdehyde dehydrogenase 


1 c. OO /I 91 «=> AQ 987-^04 

id. /y 4.zie-u:7 zo /-juf 1 


1771 


IPB001195 


Glycophorin A 


*7 CO £ D7o 1 A 71 7 74^ 


1771 


IPB001359H 


Synapsin 


11 CQ 7 QQo HQ 871 -Q9 1 


1771 


IPB001514D 


RNA polymerases D/30 to 40 Kd subunits 


1 A no Q A1a» AQ 1 19^-1 ^S9 

ly.yy y.4je-uy ldao-idj^ 


1771 


IPB002126A 


Cadherin domain 


1 /l £8 A 1 11 41 9-4.9 8 


1771 


IPB002126B 


Cadherin domain 


17 AA 1 *\A*3 AQ 919-940 
1Z.U4 l.DUc-U;? ADZs-A'-vy 


1771 


IPB002126B 


Cadherin domain 


1 9 AA 1 9^r_1 9 44R-46S 


1771 


IPB002126B 


Cadherin domain 


10 AA 7 O^o AQ 191-140 
1Z.U4 /.Zje-U^ IZj-IH-vj 


1771 


PR00205A 


Cadherin signature I 


17 00 c f.f. P in 7Q1-1 1 9 


1771 


PR00205A 


Cadherin signature I 


17 00 7 o/ip 1 1 7^_Q4 


1771 


PR00205B 


Cadherin signature II 


OA AQ 9 71*» HQ 1 ^^-1 f\A 


1771 


PR00205B 


Cadherin signature II 


oa aq a a^p» 1 n 460-4SQ 


1771 


PR00205B 


Cadherin signature II 


9A AQ S SAp 1 ^ 944-97^ 


1771 


PR00205C 


Cadherin signature III 


11 ^Q 7 7^#=» AQ 117-^9Q 


1771 


PR00205D 


Cadherin signature IV 


1 9 99 1 49r 1 4 41Q-ASS 
1Z.ZZ j.^Zc-ih hj^-hjo 


1771 


PR00205D 


Cadherin signature IV 


10 OO 1 Rio 19 99^-949 
1Z.ZZ J.oje-lZ ZZj-ztz, 


1771 


PR00205D 


Cadherin signature IV 


1 O OO £ A flo 1 9 ^44-^6^ 

iz.zz o.4ue-iz j^m-jdj 


1771 


PR00205D 


Cadherin signature IV 


10 00 /^71o 11 111 l^A 

iz.zz o./ie-ii 


1771 


PR00205D 


Cadherin signature IV 


1 0 00 o^1o 1A114 111 
IZ.ZZ /.Die-lU 114-Ijo 


1771 


PRQ0205.b 


v^aonenn signaiuic v 


10.82 4.86e-09 458-471 


1771 


PR00205F 


Cadherin signature VI 


19.57 1.95e-ll 408-434 


1771 


PR00205F 


Cadherin signature VI 


19.57 2.50e-15 513-539 


1771 


PR00205F 


Cadherin signature VI 


19.57 3.65e-12 300-326 


1771 


PR00205G 


Cadherin signature VII 


13.05 2.65e-09 334-351 


1771 


PR00205G 


Cadherin signature VII 


13.05 4.65e-ll 226-243 


1771 


PR00205G 


Cadherin signature VII 


13.05 7.91e-10 442-459 


1771 


PR00205G 


Cadherin signature VII 


13.05 9.57e- 14 547-564 
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1771 


t»t"» r\ 1 -1 c 1 T""\ 

PR01157D 


rz punnocepior signaiurg i v . 


16.03 5.42e-09 815-827 


1771 


PR01383B 


Claudin-10 signature II 


5.36 5.05e-09 724-736 


1774 


IPB000301B 


Transmembrane 4 family 


18.74 4.68e-26 492-527 


1774 


IPB000301D 


Transmembrane 4 family 


26.02 7.30e-23 662-702 


1774 


PR00259B 


Transmembrane four family signature II 


13.09 3.03e-17 486-512 


1774 


PR00259C 


Transmembrane four family signature III 


16 39 5 76e-19 513-541 


1774 


PR00259D 


Transmembrane tour tamily signature iv 


14.90 8.54e-18 676-702 


1775 


IPB000276A 


Rhodopsm-like GPCR superiamiiy 


11 56 1 95e-10 111-122 


1775 


IPB000276A 


Rhodopsin-like GPCR superfamily 


1 1 56 5 50e-09 199-210 


1775 


PR00237E 


Rhodopsin-like GPCR superfamily 
signature V 


13.03 3.90e-10 280-303 


1775 


PR00245A 


(Jliactory receptor signature i 


10.98 1.90e-10 85-96 


1775 


PR00245A 


Olfactory receptor signature I 


10.98 2.50e-10 173-184 


1775 


PR00245B 


Olfactory receptor signature II 


13.73 l.OOe- 10 210-222 


1775 


PR00245B 


(Jliactory receptor signature 11 


13.73 8.71e-09 122-134 


1775 


PR00245C 


(Jliactory receptor signature m 


14.65 4.91e-15 257-273 


1775 


PR00245D 


Olfactory receptor signature IV 


9.34 8.41e-13 317-326 


1775 


PR00534A 


Melanocortm receptor tamiiy signature i 


12.77 9.56e-ll 44-56 


1778 


IPB000920C 


Myelin P0 protein 


15.78 2.33e-09 160-212 


1778 


IPB001388 


Synaptobrevin 


76 07 7 fiOe-30 115-169 


1778 


PR00219A 


Synaptobrevin signature I 


10.78 1.35e-ll 118-137 


1778 


PR00219B 


Synaptobrevin signature II 


Q 01 5 ^(V-OQ 1 ^8-1 57 


1778 


PR00219C 


Synaptobrevin signature III 


6.91 2.20e-13 20-39 


1780 


IPB000353B 


Class II histocompatibility antigen, beta 
chain, beta-l domain 


in i/c A Q^p_1 7 1 77-726 


1780 


IPB000353C 


Class II histocompatibility antigen, beta 
chain, beta-l domain 


OH 1 1 0 1 8p-1 0 778-287 


1780 


IPB001039A 


Major histocompatibility complex protein, 
Class I 


17 17 1 OOp AO 1 5-68 


1780 


IPB001039C 


Major histocompatibility complex protein, 
Class I 


1Q 87 8 H8p-17 1^1-704 


1780 


IPB001039D 


Major histocompatibility complex protein, 
Class I 




1780 


IPB003006A 


Immunoglobulin and major 
histocompatibilitv complex domain 


17 q q 75p_1 8 108-720 

1 / ,Jl J.ZJC'IO 1-70 JU£*\J 


1780 


IPB003006B 


Immunoglobulin and major 
histocompatibility complex domain 


70 71 5 SOp-78 915-272 


1780 


IPB003363E 


Glycoprotein GCj/CjX 


n 75? 1 Sp-1 0 282-3 14 


1782 


IPB000084A 


PE N-terminus 


14 90 3 44e-09 644-691 


1782 


IPB000258G 


Bacterial ice-nucleation proteins octamer 
repeat 


8 61 4 96e-09 681-734 


1782 


IPB000258H 


Bacterial ice-nucleation proteins octamer 
repeat 


10.13 6.03e-09 719-773 


1782 


IPB000765 


GTP1/OBG tamily 


76 91 9 84e-10 449-492 


1782 


IPB000790B 


Air synthase alpna suounit, ^-lermmai 


10.45 9.64e-10 424-467 


1782 


IPB001140A 


ABC transporter transmembrane region 


21.73 2,42e-18 437-483 


1787 


TPB001140B 


ABC transporter transmembrane region 


15.62 7.70e-12 555-593 


1782 


IPB001482B 


Bacterial tvoe II secretion system protein E 


12.05 3.70e-09 447-469 


1782 


PR00300A 


ATP-dependent Clp protease ATP-bindmg 
subunit signature I 


7.82 8.27e-09 451-469 


1782 


PR00449A 


Transforming protein P21 ras signature I 


12.48 1.67e-09 449-470 


1782 


PR00671B 


Inhibin beta B chain signature II 


4.29 9.41e-09 396-415 


1783 


IPB000175A 


Sodiummeurotransmitter symporter family 


16.29 1.00e-40 211-260 


1783 


IPB000175B 


Sodium:neurotransmitter symporter family 


19.12 3.37e-32 298-332 
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1783 


IPB000175C 


Sodium:neurotransmitter symporter family 


15.09 3.75e-13 385-436 


1783 


IPB000175D 


ooaium. neurotransmitter syinpuiLci laumy 


23.45 1.00e-40 451-503 


1783 


IPB000175E 


Sodiumtneurotransmitter symporter family 


21.88 1.67e-30 544-583 


1783 


IPB000175F 


Soaium:neurotransmiuer symporter iamiiy 


25 63 1 00e-30 639-678 


1783 


IPB000175G 


Sodium neurotransmitter symporter family 


16.18 6.32e-20 700-722 


1783 


PR00176A 


Sodium/chloride neurotransmitter 
symporter signature I 


16 07 S 50e-24 21 1-232 


1783 


PR00176B 


Sodium/chloride neurotransmitter 
symporter signature II 


7 07 1 1 240-259 


1783 


PR00176C 


Sodium/chloride neurotransmitter 
symporter signature III 


10 57 ^ 16p-?^ 283-309 


1783 


PR00176D 


Sodium/chloride neurotransmitter 
symporter signature IV 


9. 06 7 0?p-?1 41 2-429 


1783 


PR00176E 


Sodium/chloride neurotransmitter 
symporter signature V 


11 14 ^ S4p-1 0 404-514 


1783 


PR00176F 


Sodium/chloride neurotransmitter 
symporter signature VI 


11110 57p-90 54R-567 


1783 


PR00176G 


Sodium/chloride neurotransmitter 
symporter signature VII 




1783 


PR00176H 


Sodium/chloride neurotransmitter 
symporter signature VIII 


1 5 04 4 5?p-1 6 670-690 


1783 


PR01204A 


Glycine neurotransmitter transporter type 1 
(GLY 1-1) signature 1 


15 111 ?5p-1 S 1 R7-200 


1783 


PR01204B 


Glycine neurotransmitter transporter type 1 
fGLYT-n signature II 


1? 80 1 17e-15 201-211 


1783 


PR01204C 


Glycine neurotransmitter transporter type 1 
(vjrLr Y i - 1 ) signature in 


1? 14 1 60e-16 335-347 


1783 


PR01204D 


Lriycine neurotransmitter transporter iypc i 

fdX VT 1 > \ aicmniirrf* TV 


10.51 6.86e-18 349-364 


1784 


lrr>UUloz/is. 


oema uonidiii 


13.76 3.02e-09 662-674 


1784 




x^iexin repeal 


14.72 1.15e-10 662-674 


1 no a 

17o4 


T>r> A1 O 1 Q A 


filial r»p»ll linf^-Hi^ri \/f s H npnrA+rAT>Viic factor 

vf*r>r*r\tr\r alrVha 1 <?i tmatllfft T 


3.85 7.08e-09 27-39 


l/oD 


1F15UUU04 / A 


rnpP/MF-T fnrniW 


11.22 1.00e-40 644-693 


1 /OJ 






10.06 2.14e-30 694-718 


1 70C 

1 /oD 


lrrSUUUo4 


PTF/MF-T familv 

\_/ 1 V I IN JT -1 Idlllllj 


9.27 1.00e-40 724-765 


1785 


lJrr>UUUo4 /JJ 


rTp/vru T fbmilv 

^ir/rsr-i iaiiiiiy 


12.61 1.20e-32 766-790 


■nor 

1785 




OlrVlN.r-1 ldmny 


12.21 9.18e-38 791-821 


1 /CO 


lr*13UUU4o3 


T <=»n<-»ini=» nVVi renpat C^-fprminal domain 

X-fCLlL'lIlC I1L>11 IGJJt/dL i will ii 11 ell uuiuciiii 


11.18 4.54e-10 675-689 


1786 


PR00019A 


Leucine-rich repeat signature I 


11.72 4.67e-10 243-256 


1786 


T>T> AAA1 A A 


Leucme-ricn repedi bigiidiuic i 


11.72 6.45e-09 434-447 


1786 


PR00019B 


Leucine-rich repeat signature II 


11.42 4.27e- 11 431-444 1 


1786 


PR00019B 


Leucine-rich repeat signature II 


11.42 5.09e-09 479-492 


1786 


PR00019B 


Leucine-rich repeat signature II 


11.42 7.55e-09 597-610 


1786 


PR00019B 


Leucine-rich repeat signature II 


11 49 Q 64e-09 240-253 


1786 


PR00364D 


Disease resistance protein signature IV 


14.89 4.86e-09 188-204 




r>r> AA7AOT7 


A fri-flavin rpcH^tanop nrotein familv 

./YCO 11 drill i CSlOLdllV/V/ LAlV/l^lll -LCllllllJ' 

signature VI 


14.03 1.46e-17 49-72 


1789 


PR00702G 


Acriflavin resistance protein family 
signature VII 


10.67 6.00e-21 74-97 


1789 


PR00702H 


Acriflavin resistance protein family 
signature VIII 


14.64 3.67e-14 153-170 


1789 


PR00702I 


Acriflavin resistance protein family 
signature IX 


16.75 1.39e-15 216-230 



WO 03/054152 PCT/US02/39555 



Table 3B 
447 



SEQ 

ID 

NO: 
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entry ID 


Description 


Results* 


1 7nn 


rKUU /9 /r 


otreptopam i uj cysteine protease lamuy 
signature VI 


1 1 . oy o.o iG-\jy *+i-oz 


1 ^701 

1 fy 1 


lr d U U U o4 y U 


vjip i iamiiy 01 transporters 


99 1Q Q 9Sp-1 1 ^S_01 


1791 


IPB000849E 


GlpT family of transporters 


19.05 2.88e~36 108-145 


1 nc\ 1 

1791 


TT3TD A A A O A C\X7 

lr\BUUUo4yr 


GlpT family of transporters 


1 ^ o£ s nop 90 1 69_i 70 
id.uo o.uue-zw loz-i /y 


1792 


lrrJUUUyi /C 


bultatase 


n 19 zi /^7f» 1 1 *\Qi aoi 


1792 


1PB003661A 


His Kinase A domain 


G 1*; 1 £A*» AO OAA 0^1 


1796 


1FBU002U3A 


Lrrb domain 


1 s zin a fs'Xo 1 9 on^ q^^ 
lo.^u o.Ojc-iz yuj-7Jj 


1796 


lrB(J002U3r> 


Oro domain 


OS Q 10p 19 1 AAQ 

iD.yo y.ize-iz i7oo-iuuy 


1796 


IPB000483 


Leucine rich repeat C-terminal domain 


11.18 6.04e-ll 371-385 


1 HC\£L 

1796 


TDDAAnO'3 I/" 4 

IPtsOUOoizC 


G-protein coupled receptors family 2 
(secretin-like) 


1Q ^"5 7 HQ oss inn 
ly.Dj? / . jDQ-\jy yoo- iu i / 


1796 


PR00019A 


Leucine-rich repeat signature I 


1 1 TO 1 AA#a AO "31 ^ ^9R 

ii. /z i.uue-uy j i jozo 


1796 


PR00019B 


Leucine-rich repeat signature II 


11.42 8.00e-10 312-325 


1798 


IPB002208C 


SecY protein 


1^ C1 /! 1 £T« 1 "7 Q A ^A 

Id.jI 4. lje-1/ ov-j\) 


1798 


IPB002208D 


SecY protein 


22.30 2.29e-36 82-118 


1798 


PR00303C 


Preprotein translocase SecY subunit 
signature III 


15.05 7.84e-19 34-57 


1798 


PR00303D 


Preprotein translocase SecY subumt 
signature IV 


15.48 1.95e-22 72-97 


1798 


PR00303E 


Preprotein translocase SecY subunit 
signature V 


14.43 7.19e-23 98-lzl 


1798 


PR00825B 


Vespid venom allergen phospholipase Al 
signature II 


13.74 9.40e-09 234-254 


1800 


PR00081A 


Glucose/ribitol dehydrogenase family 
signature I 


10.07 9.70e-12 179-196 


1800 


PR00669B 


Inhibin alpha chain signature II 




1801 


IPB000109C 


PTR peptide transporters (PTR2) 


o.zl J./4e-13 1Z-Z4 


1801 


TflT^ A A A 1 AAT\ 

IPB000109D 


PTR peptide transporters (PTR2) 


25.09 l.OUe-zy 464-511 


1802 


IPB000033B 


Low-density lipoprotein (ldl) receptor, 
YWTD repeat 


7.05 4.96e-ll 494-504 


1802 


IPB000152 


Aspartic acid and asparagine hydroxylation 
site 


o o^r i a n ^ 1/1 /ion ca/1 
o.oo 1.4 /e-14 4oy-DU4 


1802 


1PB001169K 


Integrin beta, C-terminus 


0*7 /t ^ O OAo 1 A Q'TA 1 A 1 0 

Z/.4D z.yue-iu y /u-iu iz 


1802 


IPB001774C 


Delta serrate ligand 


18.25 2.19e-09 973-1015 


1802 


1PB001881B 


Calcium-binding EGF-like domain 


lO OC ^ ACq 1 1 /ICO <AA 

iz.Zo o.ooe-ii 4oy-DUU 


1802 


1PB001969A 


Eukaryotic and viral aspartic protease active 
site 


lo.o/ /.uue-iu 3 /o-jy4 


1802 


t>t> nn a 1 
rKUUUlUC 


lype 11 jiOr-lilce signature ill 


£ QC ^ 1 A/a 1 A AQ/1 <IAA 

o.ys /. iue-iu 4y4-DU4 


1802 


rKUU /92B 


Pepsin (Al) aspartic protease family 
signature II 


10 i^o. ^^i 

1Z.OD D.ZDQ-LD OJ1-J44 


loUZ 


rKUU /yZC 


Pepsin (Al) aspartic protease family 
signature III 


R Q QIp 11 ISA 1Q1 


loUo 


tpraaa79q a 
ir .t>uuu /z^a 


Jr JMr -ZZ/ liJVlr / JVLr ZU iami ly 


10 70 1 OSp 00 1-99 


loUo 


TPT3AAA77Q A 
IrJDUUU / ZyJ\ 


T>A/TD OOARA/rDAMT^O family 

rMr-ZZ/JiMr/JVlrzu iamiiy 


1 Q 70 9 90<= 1 0 1 S 10 


1 QA1 

loUo 


TDT4AAA'7'1QT3 
lr JdUUU /ZylS 


DA/TD OO AC A /f PAN /TP 9 A -fb-miKr 

r JVlr-ZZ/liJVlr /IVLr ZU iamiiy 


11^6^ ^Rp 1 0 AQ 
lO. 'O J.OOo-lU i f!?-Ji' 


1 801 


TPR000729C 


PMP-99/FMP/MP20 familv 


37 83 3 47e-23 87-139 


1803 


IPB000729C 


PMP-22/EMP/MP20 family 


37.83 7.83e-22 80-132 


1803 


IPB000729D 


PMP-22/EMP/MP20 family 


18.96 4.94e-18 160-187 


1803 


PR01077B 


Claudin signature II 


14.12 8.80e-10 49-55 


1803 


PR01077C 


Claudin signature III 


13.60 9.43e-12 63-73 


1804 


IPB001717I 


Anion exchanger family 


7.53 2.07e-ll 18-58 


1804 


IPB001717L 


Anion exchanger family 


14.31 9.28e-14 84-130 


1804 


IPB001717M 


Anion exchanger family 


19.47 6.23e-32 133-185 



wo 
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SEQ 

ID 

JNO: 


Database 
entry ID 


Description 


IvtJSUHS 


1 QCiA 

loU4 


TDTJOftl 91 7\T 
IrxJUUI / 1 /IN 


/\mon excndnger idiiniy 


14 40 1 00e-30 183-208 


1 Qf\A 

1SU4 


T"D"QAA1 11 1C\ 


A m An o.'vr^n o-f^ rr*at* *ro Willi/ 

/vnion excnanger iamuy 


10 14 4 76e-l 6 209-236 


1804 


IPB001717Q 


Anion exchanger family 


17.70 5.77e-09 232-273 


1 OA/1 

loU4 


JrKUU iojr 


Anion excnanger iamuy signature v i 


14 7^ 1 50p-1 1 42-49 


1804 


PR00165J 


Anion exchanger family signature X 


6.53 4.18e-09 127-135 


1804 


T»T> c\f\ 1 ^rv 

PR00165K 


Anion exchanger family signature XI 


19 74640p~11 1^6-164 


1804 


PR00165M 


Anion exchanger family signature XIII 


9.29 7.26e-12 255-265 


1804 


PR01120D 


Plant CLC chloride channel signature IV 


in 6 A6,t* OQ '['XA 141 

iu.33 o.4oe-uy 


1804 


PR01231E 


HC03- transporter superfamily signature V 


8.87 4.38e-15 40-52 


1804 


PR01231H 


HC03- transporter superfamily signature 
VIII 


1 O 1 £L C\ 1 1^111 1 O/l 

lz. lo y. loe-1 J lll-lz4 


1805 


IPB001394A 


Ubiquitin carboxyl-termmal hydrolase 
family 2 


lz.o4 3. loe-lo /oo-oU3 


1805 


IPB001394B 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


17.87 3.05e-21 1215-1244 


1805 


IPB001394C 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


8.52 3.50e-10 1290-1299 


1805 


IPB001394D 


Ubiquitin carboxyl-terminal hydrolase 
family 2 


y.iy4.oue-iu iJUo-ui/ 


1806 


PR01130C 


Delayed-early response 
protein/equilibrative nucleoside transporter 
signature III 


7.16 6.00e-09 187-210 


1806 


PR01130E 


Delayed-early response 
protein/equilibrative nucleoside transporter 
signature V 


14.23 o.u/e-iu 3uy-33U 


1806 


PR01130F 


Delayed-early response 
protein/equilibrative nucleoside transporter 
signature VI 


1A "2 </1fl 1 1 I'll "3^/1 

j. 34 3.D4e-l 1 33 /-3D4 


1807 


1PB001919B 


Cellulose-binding domain, bacterial type 


14 99 9 Q7p~0Q 104-195? 


1810 


IrBOOioozA 


General substrate transporters 




1811 


PR01228C 


Eggshell protein signature III 


5.69 3.90e-09 51-66 


1813 


IPB000260A 


NADH-ubiquinone oxidoreductase chain 4, 
amino terminus 


19 ^9 9 SO<=. OQ 1A 40 


1813 


1PB 000260 A 


NADH-ubiquinone oxidoreductase chain 4, 
amino terminus 


n o 4 i a 1 Q_S9 

IZ.^Z *+.OOC- 1U 17-JZ 


ion 

1813 


TDDflAA/1 A A "D 


Xanthine/uracil permeases family 


99 44 4 9 1 p_0Q S 1 -04 


1813 


TVIT~> riAA/l /t /I'D 

1PB000444B 


Xanthine/uracil permeases family 


99 44 4 ^9^-00 4^-86 


1813 


tddaaa/co/it 
lPB000oo4L 


Eukaryotic RNA polymerase II 
heptapeptide repeat 


^ 40 7 9Sp-0Q S41-RS^ 


ion 

1813 


IrBOOOozz 


Ainc linger^ K^ztiz type 


14 67 1 00p-1 S 979-907 


1 Ol'l 

lo i 3 


IrtsVvvoZZ 


Z/inc linger, y^ztiz type 


14 67 9 71p-10 41 6-441 


lol3 


T"DD AAAQOO 


Zjinc linger, L^zriz type 


1 4 67 7 1 Qp-1 4 4S7--S 1 9 


1011 
1813 


TTIDAAAOOT 


Z/inc linger, ozriz type 


1 4 67 S 00p- 1^51 *>-S40 


1813 


TnDAAAOTl 

lPBUUUozz 


z-inc linger, i^ztiz type 


14 67 S 1 1p-19 ^00-^9^ 


1813 


lPBOUOozz 


/-» An/ran rt^TT 1 ! i. tma 

>6inc linger, UzriZ type 


14 67 0 0^p-19 77^-70S 


1813 


IPB00145 / 


NADH-ubiquinone/ pi asto quinone 
oxidorediipta^p pliain 6 


1Q 96 ^ R1 p_00 S-46 


1813 


PR00048A 


C2H2-type zinc finger signature I 


9.94 6.25e-09 770-783 


1813 


PR00048A 


C2H2-type zinc finger signature I 


9.94 9.53e-ll 269-282 


1813 


PR00048B 


C2H2-type zinc finger signature II 


5.52 6.79e-10 285-294 


1813 


PR00219C 


Synaptobrevin signature III 


6.91 6.34e-09 33-52 


1815 


IPB000001D 


Kringle 


11.31 2.02e-13 53-69 


1815 


IPB000001H 


Kringle 


12.24 5.29e-ll 318-328 


1815 


IPB000126A 


Serine proteases, V8 family 


11.75 5.07e-l 153-68 
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1815 


TTlT^AAA 1 ^^~\ At 

IPB000177M 


Apple domain 


y. lo o.jje-io ZZo-ZOU 


1815 


IPB000177N 


Apple domain 


1 A 1*7 C 01 „ 1 C *i AO O/IO 

10. 1 / 5.zle-lZ> J(Jo-34Z 


1815 


IPB001254A 


Serine proteases, trypsin family 


9.98 1.00e-15 53-69 


1815 


IPB001254B 


Serine proteases, trypsin family 


15.01 6.29e-18 316-339 


1815 


PR00722A 


Chymotrypsin serine protease family (SI) 
signature I 


12.06 9.28e-15 54-69 


1815 


PR00722C 


Chymotrypsin serine protease family (SI) 
signature III 


10.74 8.88e-13 315-327 


1815 


PR00839B 


V8 serine protease family signature II 


11.20 6.42e-09 53-70 


1817 


IPB003819A 


Taurine catabolism dioxygenase 
TauD/TfdA 


10.95 3.03e-09 200-207 


1817 


IPB003819B 


Taurine catabolism dioxygenase 
TauD/TfdA 


21.71 5.64e-26 222-255 


1817 


IPB003819C 


Taurine catabolism dioxygenase 
TauD/TfdA 


11.87 2.74e-17 288-304 


1817 


IPB003819D 


Taurine catabolism dioxygenase 
TauD/TfdA 


27.59 4.68e-38 319-359 


1819 


IPB000387 


Tyrosine specific protein phosphatase and 
dual specificity protein phosphatase family 


1 A 11 1 IOq H 1/fO/I 1/1/1/1 

1U. //Z.3oe-13 1434-1444 


1819 


PR00014C 


Fibronectin type III repeat signature III 


1/1 /f7 /l AAd AO ^GQ ^17 


1819 


PR00014C 


Fibronectin type III repeat signature III 


1/1 /f7 /I Q£« AO /^SS ^A/^ 

14.4/ 4.ooe-uy ooo-/uo 


1819 


PR00014C 


Fibronectin type III repeat signature III 


14.47 5.00e-10 1056-1074 


1819 


PR00014C 


Fibronectin type III repeat signature III 


1 A AH H AA n AO QtZ£l QC/1 

14.4/ /.UUe-l>y oOD-oo4 


1819 


PR00213E 


Myelin P0 protein signature V 


^ ^ 1 1 AO 1 1 Q£. 1 11 A 

.01 i.ooe-uy iioo-iziu 


1819 


PR00700A 


Protein tyrosine phosphatase signature I 


£1 f\C C OAa IO 1*2 A/C 1HO 


1819 


PR00700C 


Protein tyrosine phosphatase signature III 


10 on ^ oOfl 1*7 1 ion 1/i a*7 

13. oy /.ooe-i/ I3yu-i4u/ 


1819 


PR00700D 


Protein tyrosine phosphatase signature IV 


11 on 1 C/Co OA 1/f"3.1 1 /l/LQ 
IZ.oJ 3.o0e-ZU 1431-144y 


1819 


PR00700F 


Protein tyrosine phosphatase signature VI 


ia 11 /i *71 « 10 1/1*7© 1/tQQ 

1U.33 4. /le-lZ 14/G-14oo 


1819 


PR01371E 


Salmonella/Yersinia modular tyrosine 
phosphatase signature V 


11.71 5.98e-12 1432-1443 


loZU 


TT>T* A A"i C/1 0 A 


vacuolating cyioioxm 


7 11 0 ?Se-O n 100-146 


1821 


IPB000276D 


Rhodopsin-like GPCR superfamily 


9.40 9.65e-09 405-421 


1821 


PR00245D 


Olfactory receptor signature IV 


9.34 1.53e-13 359-368 


1821 


PR00245E 


Olfactory receptor signature V 


8.96 6.88e-13 406-417 


1822 


IPB003861B 


E4 protein 


9.06 1.00e-08 34-48 


1822 


PR00494E 


Fanconi anaemia group C protein signature 
V 


10.42 8.26e-09 621-640 


1822 


PR01039A 


Tryptophanyl-tRNA synthetase signature I 


14.70 9.59e-09 200-216 



* Results include in order: accession number subtype; raw score; p-value; position of signature in 
amino acid sequence 
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Repeats 


Position 


912 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


1 A ~ C A 

z.4e~54 


1 CiA f\ 

194.0 


l 


1Z4-ZZ / 


912 


GST_C 


Glutathione S-transferase, 
C-terminal domain 


6,2e-25 


92.6 


2 


1-45:410-525 


912 


GSTJNT 


Glutathione S -transferase, 
N-terminal domain 


l>4e-16 


62.5 


1 


<-> r\ A a r\r\ 

324-400 


913 


tsp_l 


Thrombospondin type 1 
domain 


2.9e-23 


90.7 


2 


124-174:181-231 


913 


EGF 


EGF-like domain 


1.6e-20 


81.6 


7 

— 


474-508:514- 
553:559-591:59/- 
ojj.oyz- /zu.ouz- 


914 


MIF 


: : : 

Macrophage migration 

inhibitory factor (MIF) 


4.De-DU 


L /y.o 






915 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


6.1e-68 


239.1 




2-129 


916 


trypsin 


Trypsin 


3.8e-99 


314.8 




445-675 


916 


kringle 


Kringle domain 


4.2e-45 


163.3 


■ 


286-367 


916 


fh2 


Fibronectin type II domain 


4.5e-35 


94.9 




108-148 


916 


EGF 


EGF-like domain 


6.2e-15 


63.0 




164-197:245-278 


916 


fhl 


Fibronectin type I domain 


5.2e-ll 


45.0 




202-237 


917 


laminin_EGF 


Laminin EGF-like 
(Domains III and V) 


2.3e-158 


539.5 


12 


282-346:349- 

409:412-469:472- 

520:523-565:786- 

831:834-877:880- 

925:928-984:987- 

1036:1039- 

t r\r\ a 1 r\.r\c 1 1 

1094:1095-1139 


917 ; 


laminin_Nterm 


Laminin N-terminal 
(Domain VI) 


5.9e-105 


362.1 


1 


45-280 


918 


EGF 


EGF-like domain 


3.2e-53 


190.2 


9 


33-68:74-1 10:116- 
151:161-197:208- 
243:247-282:288- 
323:329-362:368- 

A f\A 

404 


918 


CUB 


CUB domain 


Lle-16 


68.8 


1 


708-817 


919 


Parathyroid 


Parathyroid hormone family 


3.1e-82 


286.6 


1 


46-175 


920 


EGF 


EGF-like domain 


2.2e-22 


87.8 


9 


2-36:47-82:88- 
122:128-163:232- 
Ibo.Z /0-3UO.O iy- 
349:353-394:407- 
4jy 


921 


IL8 


Small cytokines 
(intecrine/chemokine), inter 


2.7e-09 


35.4 


1 


25-65 


922 


Zn_carbOpept 


Zinc carboxypeptidase 


3.1e-119 


409.5 


1 




922 


Propep_M14 


Carboxypeptidase activation 
peptide 


5.6e-20 


79.8 


1 


7-84 


923 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


i So fn 
i.oe-o / 


ZD 1 .0 


i 
i 




927 


FKBP 


FKBP-type peptidyl-prolyl 
cis-trans isomeras 


1.3e-44 


149.6 


2 


40-111:112-122 


930 


FAD_binding_4 


FAD binding domain 


6.5e-89 


308.8 


2 


11-176:186-381 


930 


FAD-oxidase_C 


FAD linked oxidases, C- 
terminal domain 


2.6e-63 


223.8 


1 


383-647 


933 


Fringe 


Fringe-like 


L4e-06 


-12.8 


1 


262-487 



WO 03/054152 PCT/US02/39555 



Table 4A 
451 



SEQ 
ID 


Model 


Description 


E-value 
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Repeats 
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937 


14-3-3 


14-3-3 protein 


l.le-124 


427.6 


1 


58-282 


938 


S_100 


S-100/ICaBP type calcium 
binding domain 


3.6e-23 


90.4 


1 


65-108 


938 


efhand 


EF hand 


A A A 1 1 

0.0012 


OK K 
ZJ.J 


i 


\\A 1/19 
1 l*f~ I'tZ 


940 


ig 


Immunoglobulin domain 


1.2e-33 


113.5 


5 


38-105:139- 

9fi6«9A1 901*196- 
zuo.z^f i-zy j . jzo- 

177-.41 9-487 


941 


LRR 


Leucine Rich Repeat 


3.6e-30 


113.7 


8 


42-63:64-87:88- 
1 19-1 1 1-1 16*1 17- 
160-161-184*1 85- 

1UVJ. lUl lO^. 1 Ow> 

208*209-232 


941 


LRRCT 


■ r-r 

Leucine rich repeat C- 

terminal domain 


1 Of- 07 
1 .Zc-U / 


75 Q 
JO.7 


1 


242-292 


943 


AMP-binding 


AMP-binding enzyme 


i.ze-io 


69 


I 


82-159 


944 


serpin 


Serpin (serine protease 
inhibitor) 


Hp. 1 ^ A 

/e- 1 


tjj. i 


I 


173-532 


947 


r> T TXT 

KUJN 


RUN domain 


OC"'T t t 


1 59 0 


I 


31-163 




r Y Vii 


r x ve zinc linger 


1.6e-18 


70.7 


1 


529-576 


950 


lectin c 


Lectin C-type domain 


2.4e-08 


41.1 


1 


130-234 


952 


Glyco_transf_8 


Glycosyl transferase family 

Q 
O 


0.01 


-47.7 


1 


1255-1526 


953 


TPR 


TPR Domain 


0.0044 


23.7 


1 


171-204 


956 


Branch 


Core-2/I-Branching enzyme 


< ^t* 1 fi9 


159 9 


1 


74-306 


957 


fn3 


Fibronectin type III domain 


1 1 #=> 1 ^ 

i . ie- ld 




i 
i 


30-108 


958 


PeptidaseJMlO 


Matrixin 


5.9e-121 


415.2 


1 


58-225 


958 


hemopexin 


Hemopexin 


ze-jo 


9H7 ^ 
ZU /.3 


A 
H 


105-147-140- 

393:398-445:447- 

487 


962 


chromo 


'chromo' (CHRromatin 
Organization Modifier) 


5.5e-09 


35.7 


1 


64-84 


964 


ig 


Immunoglobulin domain 


i .oe-uj 


99 6 
zz.u 


1 

1 


34-1 15 


967 


7tm_l 


7 transmembrane receptor 
^rnoaopsm iamnyj 


z . i e-zo 


oo.u 


9 


41-154-161-237 


968 


adh_zinc 


Zinc-binding 
dehydrogenases 


7.4e-60 


212.3 


1 


179-495 


970 


El_dehydrog 


Dehydrogenase El 
component 


9 1p 1 SI 

z. le-ioo 


699 7 
ozz. / 


1 
1 


129-425 


971 


ig 


Immunoglobulin domain 


0 1p-1 1 


1Q 7 




45-122 




rr-F A A T V 

ZI-JVliZ, 


iviiZj zinc linger 


1 9p-19 

J.ZrC— J 4* 


120.5 


1 


279-331 


973 


bAr 


SAP domain 


/ .HC-UU 


19 0 




2-36 


975 


WD40 


WD domain, G-beta repeat 


5e-08 


40.1 


3 


234-270:447- 
483:491-528 


977 


An peroxidase 


Animal haem peroxidase 


5.6e-140 


478.4 


1 


137-701 


978 


Defensin_propep 


Defensin propeptide 


1.9e-17 


71.4 


1 


32-82 


978 


defensins 


Mammalian defensin 


8.6e-05 


23.3 


1 


95-123 


981 


ig 


Immunoglobulin domain 


3.6e-06 


24.9 


1 


49-130 


982 


ig 


Immunoglobulin domain 


1.2e-09 


36.1 


1 


36-112 


985 


tsp_l 


Thrombospondin type 1 
domain 


W.UU / D 


17 1 


i 
i 


151-206 


987 


LRR 


Leucine Rich Repeat 


l.le-58 


208.4 


12 


78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:270-293:294- 

317:318-341:342- 
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Pncitinn 














365 


988 


fer4_NifH 


4Fe-4S iron sulfur cluster 
binding proteins 


U.UU1 j 


ICQ 


1 
1 


1 1 f)_1 


988 


ParA 


ParA iamily A 1 Pase 


U.UUO 


-U.o 


1 
1 


177-98^ 

i / /-ZOj 


989 


vwd 


von Willebrand factor type 
D domain 


l.oe-o / 


1 1A Q 

lo4.y 


1 
1 




989 


vwc 


von Willebrand factor type 
C domain 


5.3e-JS 


no a 


D 


SO 1 OS- 1 08- 

163:166-224:238- 

980-^01-^57 


989 


TIL 


m ~ 

Trypsin Inhibitor like 
cysteine rich domain 


o.ye-uo 


^9 6 


1 
1 




992 


ig 


Immunoglobulin domain 


Q /C P 17 


SO 0 
jy.\J 


2 


34-1 1 1-150-220 


994 


ig 


Immunoglobulin domain 


1.6e-28 


97.0 


3 


43-110:143- 
197:230-291 


995 


SCP 


S CP-like extracellular 
protein 


4.7e-41 


149.8 


1 


4-173 


996 


ig 


Immunoglobulin domain 


8.9e-06 


23,7 


2 


73-123:153-206 


999 


RNase PH 


3' exoribonuclease family 


1.5e-28 


108.3 


1 


264-469 


1000 


ig 


Immunoglobulin domain 


2.6e-05 


22.2 


1 


38-102 


1002 


zf-C2H2 


Zinc finger, C2H2 type 


7.6e-54 


192.3 


7 


183-205:211- 
233:239-261:267- 
289:295-317:323- 
345:351-373 


1003 


zf-C2H2 


Zinc finger, C2H2 type 


2.1e-114 


393.5 


16 


260-282:297- 

319:325-347:353- 

375:381-403:409- 

4 J 1 .4J /-4Dy ,4oD- 

487:493-515:521- 

C/11.^4Q ^91 -S77 
D4o . D4y- J I Y.J 1 /- 

599:605-627:633- 

6SS-661 -68*3 -680- 

DJ J . OD 1 UOJ ,U07 

711 


1007 


LIM 


LIM domain 


1.5e-37 


1 1Q 1 
LDO.Z 


D 


^QO 448-4S0 1 
S07-S10-S76 


1010 


zf-C2H2 


Zinc finger, C2H2 type 


4.2e-06 


33.7 


3 


2-24:29-52:154- 
177 


1011 


60s ribosomal 


60s Acidic ribosomal 
protein 


l.le-28 


94.4 


2 


1-22:25-89 


1012 


abhydrolase_2 


Phospholipase/ Carboxyleste 
rase 


1.3e-24 


95.2 


1 


9-213 


1016 


Anti_proliferat 


BTG1 family 


3.6e-73 


Zdo.j 


1 


1 1 9^ 


1017 


zf-C2H2 


Zinc finger, C2H2 type 


9.3e-71 


248.5 


11 


147-169:213- 

231:269-291:325- 

.54 /:3jo-o /j.Jol- 

403:409-431:437- 

459:465-487:493- 

515:521-543 


1017 


KRAB 


KRAB box 


1.7e-21 


84.8 


1 


42-122 


1020 


LIM 


LIM domain 


/c-jO 


1 ^9 6 


4 


78-136-1 ^9- 

197:200-256:259- 

318 


1024 


zf-C2H2 


Zinc finger, C2H2 type 


3.3e-53 


190.2 


10 


156-183:184- 

206:212-234:240- 

262:268-290:296- 

318:324-346:352- 

374:380-402:408- 
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1024 


KRAB 


KRAB box 


2.1e-42 


154.3 


1 


15-77 


1025 


RAG2 


Recombination activating 
protein 2 


r\ 
V 


1 JoU. 

2 


1 


1 ^77 


1026 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


3.7e-09 


33.9 


1 


16-54 


1029 


KeratinJB2 


Keratin, high sulfur B2 
protein 


3,8e-06 


16.4 


2 


5-149:150-281 


1031 


pro_isomerase 


Cyclophilin type peptidyl- 
prolyl cis-tr 


1.5e-106 


367.4 


1 


5-165 


1035 


zf-C2H2 


Zinc finger, C2H2 type 


9.9e-114 


391.2 


17 


100-122:132- 
154: 160-182: 188- 
210:216-238:244- 
zoo:z /Z-2y4:3UU- 

Q77.T7Q ^^A-l^^ 

^Q'^SA 406*419 
j / o.j o4~ t'UO.t* IZ— 

434:440-462:468- 
490:496-518:524- 
546-559-574 


1 AQ^ 




JSJv/vD DOX 


9 16 


O / .O 


i 

i 


4-54 


1038 


Sulfate_transp 


Sulfate transporter family 


4.3e-103 


355.9 


2 


2-284:441-751 


1038 


blAb 


OTA O /]a«m>ii'«> 

b l Ao domain 


a Co on 
4.oe-zu 


OA A 

oU.U 


— 


77A QR7 
/ /4-i'o / 


1 AO O 

1039 


ubiquitin 


Ubiquitin family 


i.4e-uy 


jy.O 


_J 


55 1 96 


1042 


ig 


Immunoglobulin domain 


1 Oa ia 
l.ze-oU 


1 A'J © 




69 19Q-16^ 
oz- izy . 1 Oj- 

229:264-316:349- 

400:433-501 


1043 


UPAR LY6 


u-PAR/Ly-6 domain 


o 1 « cn 

y. ieoy 


OAO *7 




£/X 1 OA 


1047 


MHCJ 


Class I Histocompatibility 
antigen, domains 


3.2e-147 


502.5 


i 


25-203 


1047 


ig 


Immunoglobulin domain 


0.057 


11.4 


i 


220-285 


1050 


ig 


Immunoglobulin domain 


Z.Je-Uy 


35.2 


— 


1Q 1 1 A 


1051 


PMP22_Claudm 


PMP- 

22/ tL Mr/JVlrzU/ CI auam 
family 


0.0019 


-4. / 


1 


/I 1 CM 

4-iy4 


1 AC A 

1054 


JSRCK 


Scavenger receptor 
cysteine-rich domain 


o.ze-ZD 


o^ ^ 




799 S9A 
/ZZ-oZU 


i ac^ 

1055 


P 2X_r eceptor 


Air FZX receptor 




1 ni ^ 

1U 13. 

9 




1JJ-JOO 


1057 


zona pellucida 


Zona pellucida-like domain 


1 la QA 


7Q 1 ^ 




ZDo-Jjo 


1057 


trefoil 


Trefoil (P-type) domain 


0.02 


9.1 


i 


224-262 


1058 


Aa_trans 


Transmembrane amino acid 
transporter protein 


C\ A ^ AO 


/to ^ 
42.5 




ou-joy 


1059 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


z.le-lJ 


44. o 




n ooa 


1060 


Na_sulph_symp 


Sodium: sulfate symporter 
transmembrane 


1 1 ^ 1 A O 

l.ze-143 


A OA *7 

490.7 




1o-jj4 


1064 


ERG4JBRG24 


Ergosterol biosynthesis 

Crd/FTi 09 4 fpmilv 

S2fl\\JZ>H Idlllliy 


l.le-103 


357.8 


i 


7-350 


1066 


ig 


Immunoglobulin domain 


4.7e-16 


56.8 




42-95:135- 
192:231-288 


1067 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


2.3e-33 


108.1 




28-266 


1069 


pkinase 


Protein kinase domain 


4.9e-68 


239.4 




298-578 


1069 


Activin_recp 


Activin types I and II 
receptor domain 


1.6e-27 


104.8 




20-107 
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1070 


PMP22__Claudm 


PMP- 

22/EMP/MP20/Claudin 
family 


0.00018 


13.2 


1 


3-177 


1071 


MHCJ 


Class I Histocompatibility 
antigen, domains 


4.4e~14 


55.1 


1 


24-196 


1071 


ig 


Immunoglobulin domain 


2.8e-07 


28.5 


1 


218-284 


1073 


sugar_tr 


Sugar (and other) 
transporter 


0.028 


126.7 


1 


48-528 


1074 


sugarjr 


Sugar (and other) 
transporter 


0.028 


126.7 


1 


136-616 


1075 


ank 


Ank repeat 


1.7e-45 


164.6 


6 


31-63:64-96:97- 

129:130-162:163- 

195:196-228 


1076 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.6e-12 


40.2 


1 


178-349 


1077 


sugar_tr 


Sugar (and other) 
transporter 


0.0032 


100.1 


1 


46-470 


1079 


G_glu_transpept 


Gamma- 

glutamyltranspeptidase 


2.8e-05 


144.9 


1 


122-500 


1080 


TPR 


TPR Domain 


2.3e-19 


77.8 


5 


28-61:68-101:108- 

141:148-181:188- 

221 


1081 


ACAT 


Sterol O-acyltransferase 


1.9e-32 


121.3 


1 


300-406 


1082 


WD40 


WD domain, G-beta repeat 


2.3e-16 


67.8 


7 


1015-1050:1059- 

1097:1115- 

1151:1158- 

1194:1203- 

1240:1246- 

1281:1293-1329 


1083 


pkinase 


Protein kinase domain 


4.9e-57 


202.9 


2 


248-492:537-564 


1083 


Activin_recp 


Activin types I and II 
receptor domain 


3.1e-36 


133.8 


1 


26-127 


1084 


pkinase 


Protein kinase domain 


4.9e-57 


202.9 


2 


310-554:599-626 


1084 


Activin_recp 


Activin types I and II 
receptor domain 


3.1e-36 


133.8 


1 


26-127 


1086 


C2 


C2 domain 


1.7e-06 


35.0 


1 


233-316 


1087 


Cache 


Cache domain 


1.5e-25 


96.2 


2 


557-650:960-985 


1088 


Glyco_hydro_3 1 


Glycosyl hydrolases family 
31 


4.9e-268 


903.8 


2 


1-92:114-636 


1090 


HMG_box 


HMG (high mobility group) 
box 


3.8e-32 


120.2 


1 


681-749 


1091 


serpin 


Serpin (serine protease 
inhibitor) 


3.1e-195 


662.0 


1 


315-683 


1092 


trypsin 


Trypsin 


0.0044 


12.4 


1 


406-526 


1094 


Aa_trans 


Transmembrane amino acid 
transporter protein 


0.0042 


-25.4 


1 


141-551 


1097 


ATP-synt C 


ATP synthase subunit C 


3.3e-18 


73.9 


1 


72-140 




aconitase 


Aconitase family (aconitate 
hydratase) 


1 a ^ 1 no 


651.7 


L 


1o2-z41:oz 1-/44 


1098 


Aconitase__C 


Aconitase C-terminal 
domain 


8.9e-72 


251.9 


1 


872-1043 


1103 


PAP2 


PAP2 superfamily 


6.3e-15 


63.0 


1 


89-236 


1104 


ig 


Immunoglobulin domain 


1.4e-20 


71.3 


2 


80-148:183-251 


1106 


TCTP 


Translationally controlled 
tumor protein 


3.5e-93 


323.0 


1 


1-166 
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1109 


efhand 


EF hand 


l.2e-l3 


58.8 


3 


33-61:102- 
1 10-1 66 

lJu. UO 1 W 


1110 


ATP1G1_PLM_ 
MATS 


ATP 1 G 1 /PLM/M AT8 
family 


l.8e-l3 


58.2 


1 


92-146 


1112 


ldljrecept_a 


Low-density lipoprotein 
receptor domain 


0.00073 


26.3 


1 


115-153 ! 


1112 


CUB 


CUB domain 


0.002 


-3.5 


1 


9-109 


1115 


7tmJ 


7 transmembrane receptor 
(rhodopsin family) 


3.9e-2l 


69.3 


1 


139-317 


1116 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


l.4e-24 


80.3 


3 


2-28:114-275:348- 
363 


1119 


tsp_l 


Thrombospondin type 1 
domain 


4.5e-38 


139.9 


10 


1 /I A IftO,'] A/C 

l4y-iyo:3Uo- 
364:571-626:631- 

6Q6.7H7 761-Rzll- 

00Q.Q7n 
OOi?.i? / U- 

1091-1 OQ0- 
iuz 1 . \.\jyy- 

1148:1219- 

1269*1342-1398 


1120 


HMG14 17 


HMG14 and HMG17 


l.3e-34 


128.4 


1 


2-86 


1125 


ig 


Immunoglobulin domain 


a o a 

4. /e-zu 


oy. / 




OH— IJJ . 1 O J- 

255:292-347 


1127 


ig 


Immunoglobulin domain 


1 1 A 

l.oe-10 


00. y 


1 
1 


49-1 1 9 


1131 


C2 


C2 domain 


5.7e-19 


/O.J 


z 


167 9^7-667-7*50 


1133 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


6.9e-29 


93.9 


1 


104-352 


1134 


MIP 


Major intrinsic protein 


3.9e-40 


125.5 


3 


80-189:197- 

969-^n.R ^9^ 


1136 


sugar_tr 


Sugar (and other) 
transporter 


0.024 


1 OA Q 

iz4.y 


1 


23-504 


1137 


ATP-synt_C 


ATP synthase subunit C 


3.3e-35 


130.4 


2 


14-79:90-155 


1138 


DHDPS 


Dihydrodipicolinate 
synthetase family 


4.4e-32 


1 OA A 

120. U 


1 


1A ^9^ 


1139 


SSF 


Sodium: solute symporter 
family 




1 HA 7 
1 /4. / 


z 


SO-461 -^6Q-95^ 


1143 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


O ^A 

z.zeoU 


1 AO O 

loz.z 


1 
1 


/ O-JJii 


1145 


PID 


Phosphotyrosine interaction 
domain (PTB/PID) 


i a«. c\A 

i.ye-y4 


52 /.Z 


z 


4RR-697-661-789 


1145 


WW 


WW domain 


2.5e-08 


41.1 


1 
1 


^7^-401 


1148 


C2 


C2 domain 


01^. ia 

Z. Ie-JU 


1 1 A A 
1 14.4 


z 


87- 1 6V940-^90 

O/l UJ .Z.'tVJ- ->Z\J 


1152 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


y.oe-31 


QQ 0 

yy.o 


1 
1 


161-41 0 


1153 


WD40 


WD domain, G-beta repeat 


A A1 "2 


00 1 
zz. 1 


4 


96-69-71-1 00-9?6- 
271:430-467 


1155 


aa__permeases 


Amino acid permease 


5.6e-06 


179.9 


1 


193-613 


1158 


dUTPase 


dUTPase 


5.1e-29 


109.8 


1 


46-167 


1158 


Mir 


JVLajor intrinsic proiem 




88.8 




2-56 


1158 


rvp 


Retroviral aspartyl protease 


2.1e-22 


85.1 


1 


179-280 


1158 


G-patch 


G-patch domain 


0.00095 


25.9 


1 


285-329 


1160 


laminin_EGF 


Laminin EGF-like 
(Domains III and V) 


9.5e-132 


451.1 


14 


299-353:356- 

423:426-466:490- 

532:535-578:584- 

627:630-680:683- 

727:1265- 
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1308:1311- 
1352:1355- 

1 a n i . 1 /l A/i 

1401:1404- 
1452:1686- 
1730:1733-1786 


1160 


laminin_Nterm 


Laminin N-terminal 
(Domain VI) 


4e-77 


269.6 


1 


47-297 


1160 


laminin_G 


Laminin G domain 


6.le-3l 


116.2 


3 


2625-2748:3015- 
313o:3Ioj-3313 


1160 


laminin B 


Laminin B (Domain IV) 


l.le-09 


45.6 


1 


1517-1651 


1161 


EGF 


EGF-like domain 


2.2e-26 


101.1 


5 


29-57:60-85: 104- 

1/1A.1/1'7 17C-1 Q< 

14U: 14/-1 /o. ioD- 
216 


1162 


interferon 


Interferon alpha/beta 
domain 


1 c« n 
l.oe-1 / 




1 


K 1 T\ 


1163 


IL1 


Interleukin-1 / 18 


o.7e-zl 


OO C 

oZ.D 


1 


1 Q 1 C,Q 


1164 


PDGF 


Platelet-derived growth 
lactor (rD Lrr ) 


1 O— C 1 

l.oe-51 


1 cm /i 


1 


co ion 


1165 


IL1 


Interleukin-1 / 18 


3.7e-23 


90.4 


1 


11-144 


1167 


Palm_thioest 


Palmitoyl protein 
thioesterase 


2.4e-21o 


732.1 


1 


Zo-ZoZ 


1168 


serpin 


Serpin (serine protease 
inhibitor) 


i.oe-zuz 


OoO.Z 


i 
l 


4/-41!) 


1170 


Defensm_propep 


Defensin propeptide 


l.Ze-Zo 


1 ao a 


1 


15 OA 

35-yu 


1170 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


103-131 


1172 


Y_phosphatase 


Protein-tyrosine 
phosphatase 


1 O „ 1 1 A 

1.8e-110 


o o r\ a 
380.4 


1 


ooi /inn 

z/ i-4yy 


1179 


Metallophos 


Calcineurin-like 
phosphoesterase 


l.le-11 


52.3 


1 


70-285 


1184 


MHC_I 


Class I Histocompatibility 
antigen, domains 


6.2e-06 


-5.9 


1 


29-205 


1187 


PLA2_B 


Lysophospholipase catalytic 
domain 


8.3e-53 


188.9 


1 


O C~f OAA 

357-800 


1 1 OO 

1187 


Cz 


C2 domain 


1 a/c 

l.oe-Uo 


1 


1 
l 


/I/: 1 oo 

4o-izy 


1 1 o r\ 

1189 


lipocalin 


Lipocalin / cytosolic fatty- 
acid binding 


Z.oe-3y 


1 /t/i a 
144. U 


i 
1 


3y-ioo 


1 1 QQ 


jvuniiz_r>r 1 1 


js.unitz/ij ovine pancreatic 
trypsin inhibito 


id. 7<= 91 


on n 


9 




1190 


Ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1191 


ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1200 


zf-DHHC 


DHHC zinc finger domain 


3.7e-15 


63.8 


1 


87-149 


1201 


KRAB 


KRAB box 


6.6e-22 


86.2 


1 


33-73 


1202 


KRAB 


KRAB box 


6.6e-22 


86.2 


1 


33-73 


1204 


trypsin 


Trypsin 


3e-31 


117.2 


1 


56-239 


1212 


LRR 


Leucine Rich Repeat 


2.5e-54 


193.9 


15 


73-96:97-122:123- 

149:150-175:176- 

195:197-220:221- 

lA&'OAH 99A^9Q9.. 
Z / 4-O.Z*+ /-Z /U.Zi'Z- 

317:318-337:339- 
362:363-388:389- 
408:410-433:434- 
459 


1213 


Zn carbOpept 


Zinc carboxypeptidase 


4.3e-62 


219.7 


1 


50-288 


1221 


Clq 


Clq domain 


3.4e-41 


150.3 


1 


134-258 


1221 


Collagen 


Collagen triple helix repeat 


2.6e-10 


47.7 


1 


41-100 
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Fosition 






— ; " 

(20 copies) 










1223 


pkinase 


Protein kinase domain 


9e-75 


261.8 


i 


158-417 


1223 


CI T TO 

SH3 


OTTO J A^ftJ^ 

brii domain 




1 A 




ATX AIR 1 
^ZJ-*f /o 


1223 


UBA 


UBA/TS-N domain 


0.037 


20.6 




987-1026 


1224 


vwa 


von Willebrand factor type 
A domain 


1.2e-05 


29.0 


1 


51-242 


1225 


abhydrolase 


alpha/beta hydrolase fold 


3.7e-12 


53.8 




111-390 


1227 


lectin c 


Lectin C-type domain 


9.6e-29 


108.9 




54-166 


1237 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


L5e-09 


-1.3 




242-385 


1257 


LRR 


Leucine Rich Repeat 


9e-65 


228.6 


14 


78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:270-293:294- 

111.0 1 O O /I 1 .O yfO 

317:318-341:342- 
365:366-389:390- 

A 1 1 

413 


1257 


LRRCT 


Leucine rich repeat C- 
terminal domain 


0.011 


22.3 


i 


423-475 


1263 


Pep_M12B_prop 
ep 


Reprolysin family 
propeptide 


1.8e-28 


108.1 


i 


75-191 


1263 


disintegrin 


Disintegrin 


2.7e-ll 


51.0 


i 


419-494 


1263 


Reprolysin 


Reprolysin (M12B) family 
zinc metallo 


7.3e-07 


-22.2 




206-402 


1266 


Clusterin 


Clusterin 


2.3e-298 


1004. 

O 





<-j i r\A 

2-394 


1267 


serpin 


Serpin (serine protease 
inhibitor) 


3.8e-140 


479.0 




0U-4ZD 


1268 


serpin 


Serpin (serine protease 
inhibitor) | 


o.8e-162 


551.2 






1269 


COX6C 


Cytochrome c oxidase 
subunit Vic 


2.3e-38 


140.9 


■ 


1-75 


1273 


Rhodanese 


Rhodanese-like domain 


1.5e-48 


174.7 




16-138:165-266 


1274 


trypsin 


Trypsin 


2.9e-80 


280.1 


1 


121-346 


1279 


Calpain_III 


Calpain large subunit, 
domain III 


1.5e-25 


98.3 




30-179 


1287 


zf~C2H2 


Zinc finger, C2H2 type 


4.4e-66 


233.0 


9 


173-195:201- 

223:229-251:257- 

279:285-307:313- 

335:341-363:369- 

391:397-419 


1287 


KRAB 


KRAB box 


4.9e-23 


90.0 


1 


14-54 


1294 


vwc 


von Willebrand factor type 
C domain 


1.3e-09 


45.4 


1 


69-127 


1295 


zf-C2H2 


Zinc finger, C2H2 type 


5e-50 


179.6 


8 


263-285:319- 

341:347-369:375- 

397:403-425:431- 

510 


1295 


KRAB 


KRAB box 


8.9e-26 


99.1 


1 


8-48 


1296 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-113 


389.6 


13 


190-212:218- 

240:246-268:274- 

296:302-324:330- 

352:358-380:386- 

408:414-436:442- 
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464:470-492:498- 


1296 


KRAB 


KRAB box 


6.7e-28 


106.1 


1 


14-54 I 


1299 


hemopexin 


Hemopexin 


7e-25 


CMC 1 

96.1 


4 


97Q ^99-^9A- 
167-^6Q-A1 S-41 7- 
tu 1 


1299 


Peptidase M10 


Matrixin [ 


o.4e-14 


JO.O 


1 
1 




1305 


DUF25 


Domain of unknown 
function DUF25 


5.3e-76 


265.9 


1 


40-162 


1307 


kazal 


Kazal-type serine protease 
inhibitor domain 


1.3e-12 


55.3 


1 


21-68 


1309 


Defensinpropep 


Defensin propeptide 


1.2e-26 


1 AO A 

102. 0 


1 
1 


7A 1 96 
1ZO 


1309 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


139-167 1 


1312 


ras 


Ras family 


2.7e-59 


210.4 


1 


£ZA OCO 

04-ZDJ 


1314 


zf-C2H2 


Zinc finger, C2H2 type 


l.le-51 


185.1 


8 


163-185:191- 

01Q.91Q 9A1 -9A7- 

z 1 o :z i y-z*+ i .zh / - 
269:275-297:303- 

381 


1315 


Keratin J32 


Keratin, high sulfur B2 
protein 


0.0035 


-30.0 


1 


13-154 


1316 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.6e-15 


63.8 


1 


23-154 


1317 


Keratin_B2 


Keratin, high sulfur B2 
protein 


8.2e-20 


79.2 


2 


zu-ioj:io4-zy_> 


1319 


zf-C2H2 


Zinc finger, C2H2 type 


2.9e-52 


187.1 


7 


33-55:61-83:89- 

111*117 1 ^Q- 1 
111.11 /-LDy. LHrD- 

167:173-195:201- 
223 


1321 


Keratin_B2 


Keratin, high sulfur B2 
protein 


0.00014 


-8.3 


1 


23-193 


1331 


is 


Immunoglobulin domain 


9.7e-06 


32.5 


1 


O/l 7Q 

Z4- /o 


1332 


Kunitz_BPTI 


Kunitz/Bovine pancreatic 
trypsin inhibit o 


2.2e-18 


74.5 


1 


76-126 


1333 


zf-C2H2 


Zinc finger, C2H2 type 


4.6e-55 


196.3 


7 


199-221:227- 

0/1Q-9*\^ 977-98^.. 
Z4y.ZDj-Z/ / .ZOO- 

305:311-333:339- 
361:367-389 


1333 


KRAB 


KRAB box 


5.8e-24 


93.0 


1 


0-40 


1336 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 


32-193 


1337 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 




1339 


PMP22_Claudin 


PMP- 

22/EMP/MP20/Claudin 
family 


0.0088 


-16.3 


1 


A OAO 


1340 


PMP22_Claudin 


PMP- 

9 9 AR AA P AN/TP9 0 /P1 m l H i n 
ZZ/ X2jrvi.tr/ IVJLJr ZU/ V^lauuiii 

family 


0.0019 


-4.7 


1 


4-194 


1343 


ig 


Immunoglobulin domain 


1.5e-13 


58.5 


2 


81-158:204-278 


1347 


ig 


Immunoglobulin domain 


1.2e-07 


38.9 


2 


168-226:276-330 


1348 


ig 


Immunoglobulin domain 


1.5e-18 


75.1 


3 


157-217:269- 
325:373-427 


1353 


LRR 


Leucine Rich Repeat 


2e-39 


144.4 


9 


120-143:144- 
167:168-191:192- 
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215:216-239:240- 
263:264-287:311- 
334:335-358 


1353 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0066 


23.1 


1 


92-119 


1356 


ig 


Immunoglobulin domain 


2.1e-17 


71.3 


3 


37-106:138- 
208:245-300 


1357 


CD36 


CD36 family 


3.9e-231 


781.3 


1 


5-445 


1358 


DUF139 


Cysteine rich repeat 
(DUF139) 


1.2e-08 


42.2 


4 


195-211:245- s 

261:288-304:316- 

332 


1359 


ig 


Immunoglobulin domain 


1.6e-06 


35.1 


2 


41-124:156-230 j 


1359 


Gag MA 


Matrix protein (MA), pl5 


0.0044 


-25.2 


1 


291-429 


1360 


ion trans 


Ion transport protein 


0.0031 


24.2 


1 


285-445 


1363 


spectrin 


Spectrin repeat 


9.5e-22 


85.7 


6 


17-121:124- 
226:229-340:372- 
476:678-785:788- 
896 


1366 


Clq 


Clq domain 


2.1e-31 


117.8 


1 


266-390 


1366 


Collagen 


Collagen triple helix repeat 
(20 copies) 


0.00029 


19.8 


1 


182-241 


1367 


rnaseA 


Pancreatic ribonuclease 


l.le-34 


128.7 


1 


27-132 


1368 


3HCDH_N 


3 -hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.0021 


-60.2 


1 


50-141 


1371 


voltage_CLC 


Voltage gated chloride 
channel 


3.1e-199 


675.3 


1 


92-528 


1371 


CBS 


CBS domain 


1.2e-24 


95.4 


2 


559-617:761-814 


1375 


Peptidase_Cl 


Papain family cysteine 
protease 


le-120 


414.5 


1 


114-332 


1376 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


2.6e-50 


180.6 


1 


48-454 


1378 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


4.9e-09 


32.5 


1 


250-500 


1380 


MAGE 


MAGE family 


1.4e-91 


317.7 


1 


3-231 


1381 


ion trans 


Ion transport protein 


0.066 


16.8 


1 


158-323 


1382 


ig 


Immunoglobulin domain 


7e-12 


52.9 


2 


37-128:160-241 


1385 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 


1 


29-204 


1388 


pro_isomerase 


Cyclophilin type peptidyl- 
prolyl cis-tr 


7e-88 


305.4 


1 


45-184 


1390 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.3e-06 


21.2 


1 


1-210 


1393 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.6e-12 


55.0 


1 


99-329 


1397 


fh3 


Fibronectin type III domain 


4.7e-78 


272.7 


5 


533-619:631- 
717:734-833:854- 


1397 


ig 


Immunoglobulin domain 


3.7e-42 


153.5 


5 


154-227:261- 

318:362-418:453- 

511:1253-1316 


1404 


ig 


Immunoglobulin domain 


2.9e-08 


40.9 


2 


42-99:139-198 


1409 


ig 


Immunoglobulin domain 


8.1e-30 


112.5 


3 


142-199:263- 
321:359-438 


1414 


mito can- 


Mitochondrial carrier 


3.5e~64 


226.7 


3 


40-130:137- 
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protein 








227:238-322 


1416 




Immunoglobulin domain 


o om 1 






57-112 


1417 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


6.6e-06 


33.1 


1 


27-67 


1418 


Synaptophysin 


Synaptophysin / 
synaptoporin 


O 1o I'X 


1 

jLOJ . 1 




27-273 


1419 


PMP22_Claudin 


PMP- 

22/EMP/MP20/Claudin 
family 


5.2e-18 


73.3 


1 


4-165 


1421 


Ammoniumjran 
sp 


Ammonium Transporter 
Family 


o.De-io 


7? (\ 




25-383 


1423 


rnaseA 


Pancreatic ribonuclease 


^f. ie-zy 


110 1 




32-136 


1427 


LRR i 


Leucine Rich Repeat 


^oe-zo 


1 00 0 


6 


3-26:27-50:51- 

74:76-99:100- 

123:124-144 


1427 


fn3 


Fibronectin type III domain 


2.7e-08 


41.0 


1 


334-417 


1427 


ifi 


Immunoglobulin domain 


2.4e-07 


37.9 


1 


225-283 


1427 


LRRCT 


Leucine rich repeat C- 
terminal domain 


7.2e-06 


32.9 


1 


164-209 


1428 


GRAM 


GRAM domain 


u.uo / 


1 1 1 
ij. l 




177-245 


1435 


Rhomboid 


Rhomboid family 


1.5e-45 


164.8 


1 


169-326 


1437 


Sema 


Sema domain 


/ e-ZUZ 


fS>A 1 


i 
i 


102-513 


1437 


tsp_l 


Thrombospondin type 1 
domain 


1 la AO 

1. ie-4Z 


1 ^ ^ 


u 


589-637*644- 
695:702-746:833- 
883:890-940:945- 
985 


1437 


PSI 


Plexin repeat 


2.5e-14 


61.0 


1 


531-578 


1438 


ank 


Ankyrin repeat 


2e-15 


64.7 


3 


336-368:383- 
418:468-498 


1438 


ion trans 


Ion transport protein 


9.4e-07 


35.9 


1 


615-817 


1439 


ank 


Ankyrin repeat 


i.Je-iz 






417-449:464- 
499:549-579 


1439 


ion trans 


Ion transport protein 


y.4e-u/ 


^ Q 


i 

i 


696-898 


1442 


LRR 


Leucine Rich Repeat 


3.2e-20 


80.6 


5 


57-80:81-104:105- 

128:129-152:153- 

176 


1442 


LRRCT 


Leucine rich repeat C- 
terminal domain 


Z.JQ-V 1 


"XH Q 


i 
i 


186-240 


1442 


LRRNT 


Leucine rich repeat N- 
terminal domain 


A AAA1 1 


Zo.O 


l 


27-55 


1443 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


7.2e-23 


89.4 


1 


166-476 


1443 


HMG_box 


HMG (high mobility group) 
box 


3.1e-05 


30.8 


1 


3-76 


1443 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.0066 


19.1 


1 


110-162 


1444 


DUF214 


Predicted permease 


U.UU4!) 


-1D.O 


i 
i 


136-276 


1 A K1 
14 D Z 


mno carr 


Mitochondrial carrier 
protein 


8.4e-58 


205.5 


2 


27-123:124-217 


1454 


mito_carr 


Mitochondrial carrier 
protein 


0.00049 


22.3 


2 


79-179:198-287 


1455 


WD40 


WD domain, G-beta repeat 


2.7e-26 


100.8 


4 


64-99:105- 

141:235-273:279- 

315 


1456 


Furin-like 


Furin-like cysteine rich 


6.1e-89 


308.9 


1 


172-328 
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region 










1456 


Re cep_L_domain 


Receptor L domain 




280.0 


2 


46-169:345-471 


1456 


pkinase 


Protein kinase domain 


7e-65 


229.0 


1 


986-1258 


1462 


Peptidase M10 


Matrixin 


1 *p 7Q 


977 7 




62-227 


1462 


hemopexin 


Hemopexin 


i 4^ 1 o 


to.u 


1 


309-350 


1463 


Armadillo_seg 


Armadillo^eta-catenin-like 
repeat 


0.028 


21.0 


4 


549-591:641- 

685:686-737:738- 

783 


1466 


C2 




C2 domain 


f\ ^-07 


36.4 


1 


219-304 


1468 


RTC 


KJN A 3 -terminal pnospnaxe 
cyclase 


o 0006 


123.2 


1 


3-221 


1476 


CN hydrolase 


Carbon-nitrogen hydrolase 


0.0014 


-83.3 


1 


64-341 


1477 


zf-C2H2 


Zinc finger, C2H2 type 




H 7 


3 


267-289:294- 
317:419-442 


1478 


Euk porin 


Eukaryotic porin 


3.3e-06 


-42.1 


1 


1-142 


1484 


BTB 


T> 'I'll ITl /™\ r 7 J ^v-^v-t rt 1 -n 

B 1 x3/r UZ domain 


1 8p-0Q 


44.9 


1 


52-194 


1487 


rvt 


Reverse transcriptase 


j.Dc-JO 




1 


241-507 j 


1489 


Galactosyl _T_2 


Gal actosyltr ansferase 


J.OC" 1 / -J 


595.3 


1 


97-404 ! 


1491 


GBP_C 


Guanylate-binding protein, 
C-terminal domain 


i o~ ao 


117 0 


1 


23-125 


1493 


NUDIX 


Mut l -like domain 


u.uuio 


23.5 


1 


297-439 


1496 


Big^.2 


Bacterial Ig-like domain 
(group 2) 




66 8 


1 


1177-1258 


1497 


iS 


Immunoglobulin domain 


0 Qp~1 1 


46.1 


1 


38-112 


1498 


aldo ket red 


Aldo/keto reductase family 


6.8e-37 


126.5 


2 


207-237:245-359 


1502 


Caveolin 


Caveolin 


1.6e-06 


13.3 


1 


90-262 


1504 


sushi 


Sushi domain (SCR repeat) 


8.2e-16 


66.0 


2 


223-281:286-344 


1511 


zf-C2H2 


Zinc finger, C2H2 type 


l.le-16 


68.9 


2 


341-363:369-391 


1511 


KRAB 


KRAB box 


6e-05 


22.1 


1 


231-275 


1516 


CBS 


CBS domain 


2.8e-33 


124.0 


3 


189-243:264- 
317:336-389 


1517 


zf-C2H2 


Zinc finger, C2H2 type 


2.1e-128 


440.0 


18 


279-301:307- 

329:335-357:363- 

385:391-413:419- 

AA'\ 'AA1 4£Q-47^- 
407- ^01-^9^-868- 
RQ0-8Q6-Q1 8-924- 
946-952-974-980- 
1002:1008- 
1030:1036- 
1058:1064- 
1086:1092-1114 


1517 


KRAB 


KRAB box 


5.3e-69 


242.7 


2 


74-136:684-735 


1518 


i£ 


Immunoglobulin domain 


1 la H7 

i.i e-u / 


90 8 
z,y. o 




38-102 


1521 


CoA trans 


Coenzyme A transferase 


1 1 1 




9 


49-261 '271-472 


1521 


TGF-beta 


Transforming growth factor 
beta like domain 


6.8e-64 


ZZ3. / 


i 
1 


\}D / — / *T 1 


1 ^91 




Reprolysin (M12B) family 
zinc metallo 


4.4e-95 


329.3 


1 


223-422 


1523 


Pep„M12B_prop 
ep 


Reprolysin family 
propeptide 


6.7e-30 


112.8 


1 


121-234 


1523 


disintegrin 


Disintegrin 


l.le-18 


59.5 


1 


437-513 


1523 


EGF 


EGF-like domain 


0.0092 


22.6 


1 


661-693 


1524 


DSPc 


Dual specificity 
phosphatase, catalytic doma 


3.6e-71 


249.9 


1 


102-241 
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1525 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


C 1 A 

5.6e-14 


A£* & 
40.0 


i 
i 




1527 


Spermine_synth 


Spermine/spermidine 
synthase 


6e-13 


A 


i 
l 


954-441 
ZJf -*+*t1 


1528 


efhand 


EF hand 


3.8e-10 


An 1 
4/.1 


A 
4 


173:329-357:365- 


1530 


rvt 


Reverse transcriptase 


6e-68 


Z3y.Z 


i 
1 




1531 


RPH3A_effector 


Rabphilin-3A effector 
domain 


1.9e-09 


-O.l 


1 


i^i 040 


1533 


TPR 


TPR Domain 


A 1« 1/1 

4.1e-lU 


AH A 

4 /.U 


D 


990-951-444- 
477:478-511 


1534 


SAM 


SAM domain (Sterile alpha 
motif) 


A A/1 £. 
U.U40 


ion 

1 O.U 


1 

1 




1538 


pkinase 


Protein kinase domain 


7.2e-42 


152.5 


2 


42-76:82-242 


1540 


zf-C2H2 


Zinc finger, C2H2 type 


U.UUU3J 


99 'X 

Z /. o 


1 
1 


795-750 


1541 


COLFI 


Fibrillar collagen C-terminal 
domain 


8.5e-216 


553.4 


1 


624-841 


1542 


EGF 


EGF-like domain 


2.6e-ll 


51.0 


3 


916-947:1004- 
1044-1 050-1 094 


1543 


ig 


Immunoglobulin domain 


1.3e-12 


45.7 


1 


38-112 


1544 


UCH-1 


Ubiquitin carboxyl-terminal 
hydrolases famil 


9.1e-15 


62.5 


1 


477-508 


1544 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


6.4e-13 


56.3 


1 


759-811 


1544 


zf-UBP 


Zn-fmger in ubiquitin- 
hydrolases and other 


0.00069 


18.9 


1 


350-420 


1546 


LRR 


Leucine Rich Repeat 


2.4e-30 


114.3 


9 


1 1 1 
loZ-130. 1DO- 

179:180-199:201- 

9£Q*971- 
ZZ4.ZDU-ZO:7.Z / 1- 

294:295-318:319- 

j4 1 .O^+Z-OOJ 


1546 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00082 


1 

ZO.l 


i 
l 

— 


69 84 


1551 


Peptidase_JS9 


Prolyl oligopeptidase family 


A 1o 1 A 

4. Je- iu 


jO.O 




397-469 


1552 


Orn_DAP_Arg_d 
eC 


Pyridoxal-dependent 
decarboxylase, C- 


1.5e-14 


55.5 


i 


150-202 


1552 


Orn_Ar&_deC_N 


Pyridoxal-dependent 
decarboxylase, py 


A At* 1 1 

4.4e-i i 




— : 


19-117 


1555 


rvt 


Reverse transcriptase 


5e-17 


67.6 


i 


616-718 


1557 


rvt 


Reverse transcriptase 


1 1 a £Q 


941 & 




247-520 


1558 


RhoGEF 


RhoGEF domain 


1.5e-25 


98.4 


i 


971-1155 


1558 


PDZ 


PDZ domain (Also known 
as DHR or GLGF) 


A A « 10 

4.4e-12 


DD.O 




15Q-914 


1558 


PH 


PH domain 


0.0017 


17.5 


i 


1199-1312 


1559 


pkinase 


Protein kinase domain 


1.9e-Z4 


OO 1 

yZ. 1 


— 


151-911 -10Q-419 


1560 


FH2 


Formin Homology 2 

Dfvm 9 in 

J_y Will CI 111 


A A — 111 

4.4e-lll 


1QO A 




5Q5-1 018 


1565 


HMG_CoA_synt 


Hydroxymethylglutaryl- 
coenzyme A synthas 


6.5e-300 


1009. 
7 




50-582 


1568 


GCV H 


Glycine cleavage H-protein 


1.6e-76 


267.6 




65-185 


1577 


Peptidase_Ml 


Peptidase family Ml 


1.3e-114 


332.5 




42-161:225- 
354:365-399 


1579 


rrm 


RNA recognition motif 


0.0068 


23.0 




823-891 


1580 


serpin 


Serpin (serine protease 


8.9e-135 


458.1 


3 


37-241:352- 
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inhibitor) 








475:504-533 


1581 


CH 


Calponin homology (CH) 
domain 


0.012 


13.5 


1 


O/l 101 
o4- 1 0 1 


1582 


LRR 


Leucine Rich Repeat 


2.2e-37 


137.6 


12 


/U-yi:y4-l 1 /: 1 lo- 

1/11.1/10 1 1 

141: 14Z-IOJ. 100- 
189:190-213:214- j 

0^7«9^52 9^1 -9£9_ 
Zj / .Zjo-ZOI .zuz,- 

285:286-310:311- 

jjJ.JjD-JJ7 


1582 


LRRCT 


Leucine rich repeat C- 
terminal domain 


8.1e-12 


DZ. / 


1 
1 


ooy-H-z, 1 


1582 


ig 


Immunoglobulin domain 


1 Oa HQ 

i.ye-Uo 


*\o 9 


1 
1 


438-499 


1582 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0099 


22.5 


1 


28-68 


1587 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


0.00027 


18.1 


1 


168-213 


1587 


IBR 


IBR domain 


u.uu / 1 


1 c\ 0 
1U.Z 


1 
1 


9^7-908 
Zj /-z,yo 


1588 


rvt 


Reverse transcriptase 


1.2e-05 


25.4 


1 


192-250 


1589 


rvt 


Reverse transcriptase 


le-10 


A A *\ 

44. Z 


1 


9A 


1591 


TPR 


TPR Domain 


3.8e-13 


0 /A 


D 


9*\ZL 9R7-9RR- 
191 •199-^S'S 


1592 


PHD 


PHD-finger 


U.U4Z 


z.y 


Z 

— 


fiR4_79 i v1 1 1 7- 

UO't / £*0 .111/ 

1163 


1595 


Peptidase_Ml 


*i 

Peptidase family Ml 


Op 11-5 

ze-i id 


19Q 1 

jz,y, 1 





100-279 


1597 


IPPc 


Inositol polyphosphate 
phosphatase family, c 


1 1 p K 


7 




50-112 


1600 


thyroglobulin_l 


Thyroglobulin type-1 repeat 


d. le-uo 


Id 1 




38-118 


1604 


ig 


Immunoglobulin domain 


0.014 


13.3 


1 


34-59 


1606 


zf-CXXC 


CXXC zinc finger 


1 He*. O 1 

1 . /e-z 1 


Q/l Q 

o4.y 


— 


Ju"7U 


1606 


F-box 


F-box domain 


/.ze-uo 


^9 Q 

oz.y 







1606 


PHD 


PHD-finger 


U.UoZ 


1 ^ 


— 


Z>\J 1 / 1 


1613 


rrm 


RNA recognition motif. 


z.ye-14 


OU.O 


— ■ 


1 ^1-991 
1 j i-^^j 1 


1614 


SRCR 


Scavenger receptor 
cysteine-rich domain 


z.ze-zo 


1A1 1 

1U1.1 




Jz," 1 tJ 


1617 


B56 


Protein phosphatase 2A 
regulatory B subunit 


6.1e-35 


124.7 


1 


146-218 


1618 


SI 


SI RNA binding domain 


1.2e-10 


44.3 


1 


352-429 


1619 


trypsin 


Trypsin 


j.le-oy 


1ZD.J 




30-122 


1620 


STphosphatase 


Ser/Thr protein phosphatase 


1.7e-71 


245.5 


1 


66-236 


1623 


Flgl 


Flagellar P-ring protein 


5.4e-81 


282.5 




OQ 911 

/y-zo 1 


1623 


FlgH 


Flagellar L-ring protein 


7.5e-34 


110.4 


l 


21-73 


1625 


S emialdhyde__dh 
C 


Semialdehyde 
dehydrogenase, dimerisat 


8.1e-61 


215.5 


1 


13 /-3Uo 


1625 


Semialdhyde_dh 


Semialdehyde 
dehydrogenase, NAD bindi 


2,6e-34 


109.5 


1 


20-128 


1628 


HTH_1 


Bacterial regulatory helix- 
turn-helix protei 


2.6e-33 


124.1 


1 


7-146 


1 &i 1 


r c,r -uti nzer s_\^ 


T>T?T> i iti1 i ^in cr pn7VmP 1 TA/i 
rC/ir -ULlllZiliig CLicyni^i) ± ±ly± 

barrel doma 


9.1e-64 


225.3 


-j 


53-163 


1632 


ACR tran 


AcrB/AcrD/AcrF family 


3.1e-15 


58.8 




188-254 


1632 


OTCace 


Aspartate/ornithine 
carbamoyltransferase, A 


1.7e-09 


35.8 




459-514 


1632 


OTCaceJNT 


Aspartate/ornithine 
carbamoyltransferase, c 


0.44 


8.5 




41 1-426 


1632 


Replicase 


Replicase family 


2.9 






399-576 
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102.2 






1634 


ribonuclease T2 


Ribonuclease T2 family 


43e-17 


70.2 


1 


78-138 


1638 


efhand 


EF hand 


5.6e-10 


46.6 


4 


38-66:74-102:111- 
139:147-175 


1638 


RrnaAD 


Ribosomal RNA adenine 
dimethylase 


5 


2.6 


1 


133-173 


1643 


Ribosomal L23 


Ribosomal protein L23 


1.2e-14 


53.9 


1 


197-276 


1645 


tsp_l 


Thrombospondin type 1 
domain 


0.0026 


21.0 


5 


420-476:480- 

534:541-601:607- 

663:669-718 


1647 


Arginosuc_synth 


Arginosuccinate synthase 


2.8e-68 


218.8 


2 


48-12o:128-lc9 


1651 


vwa 


von Willebrand factor type 
A domain 


l.le-33 


125.4 


1 


89-326 


1652 


ig 


Immunoglobulin domain 


6.1e-10 


37.1 


2 


O A 1 1 T.nn ooi 

34-1 17:220-293 


1654 


PP2C 


Protein phosphatase 2C 


3.4e-05 


21.8 


1 


86-138 


1658 


thiored 


Thioredoxin 


l.le-105 


354.6 


2 


324-432:459-570 


1662 


Hist_deacetyl 


Histone deacetylase family 


5.5e-85 


295.7 


2 


857-924:1247- 

1 AO A 

1434 


1666 


wap 


WAP -type (Whey Acidic 
Protein) 'four-disum 


1.5e-21 


85.1 


1 


ha 1 1 / 

72-1 16 


1667 


KRAB 


KRAB box 


4.3e-43 


156.6 


1 


1 f\C 1 £1Q 


1672 


ig 


Immunoglobulin domain 


le-13 


49.3 


3 


146-203:245- 

OO^.TI 1 /I AO 

Zyj.jD 1-4U.Z 


1674 


Transposase_22 


LI transposable element 


l.De-j 1 


10/I T 

lo4. / 


I 


o-zol 


1676 


Ribosomal S2 


Ribosomal protein S2 


9.5e-09 


34.8 


1 


51-134 


1678 


ig 


Immunoglobulin domain 


0.036 


1 O A 
1Z.U 


1 


Jj- /o 


1684 


adh short 


short chain dehydrogenase 


2.3e-32 


120.9 


1 


2-244 


1686 


pkinase 


Protein kinase domain 


2.5e-50 


180.6 


2 


553-770:875-896 


1687 


HRM 


Hormone receptor domain 


7.8e-05 


29.5 


1 


82-134 


1688 


SH3 


SH3 domain 


1.9e-21 


84.7 


1 


150-206 


1688 


SH2 


SH2 domain 


1.2e-08 


28.6 


1 


214-233 


1688 


RGS 


Regulator of G protein 
signaling domain 


0.0003 


19.5 


1 


75-100 


1693 


LRR 


Leucine Rich Repeat 


5.3e-34 


126.4 


10 


505-528:529- 

554:555-573:575- 

598:613-632:634- 

657:659-684:686- 

708:709-732:738- 

763 


1694 


SH3 


SH3 domain 


0.00093 


18.2 


1 


19-73 


1701 


cystatin 


Cystatin domain 


8.9e-ll 


41.4 


1 


32-78 


1705 


Ribosomal S26e 


Ribosomal protein S26e 


2.1e-16 


68.0 


1 


133-231 


1706 


ank 


Ank repeat 


4.9e-30 


113.2 


3 


381-413:414- 
446:450-482 


1706 


TPR 


TPR Domain 


1.4e-06 


35.3 


4 


3-36:43-76:164- 
197:205-238 


1706 


LRR 


Leucine Rich Repeat 


0.0026 


24.4 


4 


716-743:744- 

764:775-802:835- 

859 


1708 


zf-C2H2 


Zinc finger, C2H2 type 


3.6e-23 


90.4 


4 


51-74:205- 

227:233-255:261- 

284 


1709 


PepJVI12B_prop 
ep 


Reprolysin family 
propeptide 


0.048 


-17.2 


1 


120-196 


1710 


TIG 


IPT/TIG domain 


1.4e-101 


350.8 


10 


620-705:708- 
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— 


788:793-876:883- 

i/oo. 1U0*+- 

1169:1174- 
nc/i. i 0^6- 

1336:1341- 
1 49^*1494- 
151 1-1516-1602 


1711 


HK 


Hydroxyethylthiazole 
kinase family 


c nn 


JJ.U 




12-39 


1711 


pp-binding 


Phosphopantetheine 
attachment site 


5.4e-09 


38.4 


i 


220-255 


1712 


IGFBP | 


Insulin-like growth factor 
binding pr 


2.1e-26 


97.4 




40-99 


1712 


thyreoglobulin 1 


Thyroglobulintype-1 repeat 


4.6e-05 


29.5 


i 


264-323 


1715 


rvt 


Reverse transcriptase 


7.8e-zo 






1 8-147 


1717 


A2M 


Alpha-2-macroglobulin 
family 


0 


1259. 

A 

4 




776-879:880- 

1111:1112- 

1 172-1243-1530 


1717 


A2MJN 


Alpha-2-macroglobulin 
family N-terminal regi 


rv 
U 


1 flQ9 
u 


-j 


75-684 


1717 


TCTP 


Translationally controlled 
tumor protein 


0.0013 


-47.0 


i 


1172-1249 


1718 


cadherin 


Cadherin domain 


ze-zy 


1 1 1 9 

ill .-£r 




68-159:334- 
425:439-543 


1722 


p450 


Cytochrome P450 




179 0 




270-506 


1725 




Immunoglobulin domain 


0.0079 


14.2 


i 


34-116 


1726 


iS 


Immunoglobulin domain 


A O^ f\Z 

4.ze-lD 


Zl.D 


— 


R7-1 ^1 -145-163 


1727 


Synaptophysin 


Synaptophysin/ 
synaptoporin 


o o« oon 

z.ze-z/y 


1 /D.4 




19-107 


1729 


PA 


PA domain 


o a ~ on 
2.4e-z9 


i in o 





r 9 1-696 


1729 


Glyco_hydro_47 


Glycosyl hydrolase family 
47 


6.2e-13 


Zo.4 






1731 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1 la £.K 

l.le-oD 


Z 1U.O 


9 


50-1 17-118-293 


1732 


is 


Immunoglobulin domain 


1 7o 1 C 

l. /e-io 


o*+.o 


o 


49-125-169-239 


1734 


p450 


Cytochrome P450 


1 a i a 
le-io 


61 8 
Ol.O 




50-106-162- 

\J l.\J\J ■* M.\J*~i 

228:298-425 


1735 


profilin 


Profilin 


1 Co 1 < 


ou.z> 


1 
1 


966-1071 


1735 


Transposase_22 


LI transposable element 


0.00026 


i no 7 

1UZ. / 


1 


731-1016 


1742 


PAS 


PAS domain 


3.2e-07 


31.9 


3 


360-413:474- 
507-581-613 


1744 


Dynein_heavy 


Dynein heavy chain 


9.8e-251 


846.4 


1 


34-781 


1745 


SCP 


SCP-like extracellular 
protein 


9.2e-36 


132.2 


1 


4-180 


1749 


lipocalin 


Lipocalin / cytosolic fatty- 
acid binding pr 


£ C~ AC 

6.5e-05 


o i n 


i 
i 


49-76 


1 1 

1 /D 1 


QTYE 
olJr 


SnHium •Hipat'bn'X'vlate 

kjUUiUlIl.Uiuai u\j/\.y icii.v> 

symporter family 


1.8e-47 


171.2 


2 


1-34:315-396 


1753 


ig 


Immunoglobulin domain 


9.4e-19 


65.5 


5 


49-124:162- 
' 219:261-310:345- 
400:432-496 


1753 


fh3 


Fibronectin type III domain 


0.00052 


26.8 


1 


514-606 


1757 


IPPc 


Inositol polyphosphate 
phosphatase family, c 


3e-68 


240.1 


3 


64-350:363- 
621:753-796 
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1762 


SSF 


Sodium: solute symporter 
family 


4.2e-256 


864.2 


1 


39-477 


1763 


LRR 


Leucine Rich Repeat 


3.4e-37 


137.0 


11 


61-84:85-108:109- 

132:133-156:157- 

180:181-204:205- 

OOO OCO TT/T OT7 

228:253-276:277- 
300:301-324:349- 
3/1 


1763 


ig 


Immunoglobulin domain 


9.1e-09 


33.3 


1 


428-486 


1763 


LRRCT 


Leucine rich repeat C- 
terminal domain 


3.1e-07 


37.5 




358-41 1 


1763 


LRRNT 


Leucine rich repeat N~ 
terminal domain 


0.00023 


27.9 


1 


30-59 


1764 


Reprolysin 


Reprolysin (M12B) family 
zinc metalloprote 


l.le-21 


85.5 


1 


156-359 


1764 


tsp_l 


Thrombospondin type 1 
domain 


2.5e-06 


34.5 




422-491 


1768 


ANF_receptor 


Receptor family ligand 
binding region 


L9e-34 


125.9 


1 


68-174 


1769 


C2 


C2 domain 


2.5e-66 


233.8 


2 


249-337:383-471 


1771 


cadherin 


Cadherin domain 


2e-80 


280.6 


5 


140-235:249- 
343:359-451:465- 
ddo.ou /-oy / 


1775 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


3.2e-34 


110.9 


2 


34-130:147-330 


1778 


synaptobrevin 


Synaptobrevin 


2.3e-15 


64.5 


1 


104-192 


1780 


MHCJ 


Class I Histocompatibility 
antigen, domains 


8.1e-93 


321.8 


1 


ic 1 no 

25-178 


1780 


ig 


Immunoglobulin domain 


9.9e-0o 


30.0 


1 


194-259 


1782 


ABC tran 


ABC transporter 


7.2e-19 


76.1 


1 


44o-ozo 


1783 


SNF 


S odium :neurotransmitter 
symporter iamily 


0 


1 1 

1182. 

c 

D 


1 


203-747 


1784 


TIG 


IPT/TIG domain 


4.6e-33 


123.3 


2 


1032-1119:1121- 
1206 


1784 


Plexin_repeat 


Plexin repeat 


4.1e-08 


40.4 


3 


653-706:806- 
858:953-1009 


1784 


Sema 


Sema domain 


2.8e-06 


26.6 


2 


59-160:576-602 


1785 


CTF NFI 


CTF/NF-I family 


1.6e-167 


570.0 


1 


651-824 


1786 


TIR 


TIR domain 


le-38 


132.4 


1 


791-931 


1786 


LRR 


Leucine Rich Repeat 


4.4e-37 


136.6 


15 


143-166:192- 
215:216-241:242- 
266:267-290:385- 
408:409-432:433- 

A t* f A CI ^ OA A Ci t 

456:457-480:481- 
502:545-569:570- 
Dyb.jyy-oZZ.oZj- 
643:645-668 


1 / OO 




Jueucine ncn repeal \^>- 
terminal domain 


/ ,jo- 1j 




i 
i 




1788 


RPEL 


RPEL repeat 


1.2e-17 


72.1 


2 


222-247:266-291 


1789 


ACR tran 


AcrB/AcrD/AcrF family 


1.8e-191 


649.5 


1 


1-452 


1792 


Sulfatase 


Sulfatase 


0.00013 


-44.3 


1 


502-912 


1792 


3 5 exonuclease 


3-5' exonuclease 


0.0041 


-14.7 


1 


764-923 


1794 


DUF81 


Domain of unknown 
function DUF81 


3.2e-49 


176.1 


1 


8-166 
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1796 


LRR 


Leucine Rich Repeat 


7.6e-09 


42. o 




314-337:338-361 


1796 


LRRCT 


Leucine rich repeat C- 
terminal domain 


1.9e-08 


At C 

41.5 


1 


371-421 


1796 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.0041 


20.8 


1 
l 


007-939 


1796 


HRM 


Hormone receptor domain 


0.00/O 




i 


D31-OUU 


1796 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


0.018 


101.1 


1 


935-1252 


1798 


secY 


eubacterial secY protein 


9.6e-62 


218.5 


1 


1-287 


1801 


PTR2 


POT family 


3.5e-38 


130.8 


2 


4-72:364-505 


1802 


asp 


Eukaryotic aspartyl protease 


5.1e-13 


41.6 


1 


133-471 


1802 


Dakl 


Dakl domain 


6.7e-06 


132.2 


1 


695-1003 


1803 


PMP22_Claudin 


PMP- 

22/EMP/MP20/Claudin 
family 


4.5e-21 


83.4 


1 


4-181 


1804 


HC03_cotransp 


HC03- transporter family 


1.3e-69 


244.7 


3 


39-52:108- 

1 /ZO. 1*71 1*>0 

loo:171-22o 


1805 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


5.1e-17 


70.0 


1 


1278-1339 


1805 


UCH-1 


Ubiquitin carboxyl-terminal 
hydrolases famil 


9.3e-08 


39.2 


1 


785-820 


1805 


zf-UBP 


Zn-finger in ubiquitin- 
hydrolases and other 


0.0049 


15.8 


1 


688-727 


1806 


Nucleoside tran 


Nucleoside transporter 


5.7e~42 


152.8 


1 


127-446 


1810 


sugarjr 


Sugar (and other) 
transporter 


0.044 


132.3 


1 


637-1048 


1813 


zf-C2H2 


Zinc finger, C2H2 type 


le-38 


142.1 


7 


270-292:298- 
320:414-437:440- 
463:485-507:513- 
535:771-793 


1815 


trypsin 


Trypsin 


4.6e-56 


179.2 


1 


28-341 


1817 


TauD 


Taurine catabolism 

di oxygenase TauD, TfdA fa 


1.7e-196 


666.2 


1 


107-385 


1818 


BPD_transp 


Binding-protein-dependent 
transport system 


0.0048 


23.3 


1 


129-194 


1819 


Y_phosphatase 


Protein-tyrosine 
phosphatase 


L3e-103 


357.6 


1 


1277-1494 


1819 


fii3 


Fibronectin type III domain 


1.2e-51 


185.1 


7 


461-541:552- 
634:641-719:730- 
812:819-900:908- 
990:998-1087 
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912 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


2.4e-54 


194.0 


1 


124-227 


912 


GST_C 


Glutathione S-transferase, 
C-terminal domain 


7.1e-21 


82.8 


1 


410-525 


912 


GST_N 


Glutathione S-transferase, 
N-terminal domain 


2e-15 


64.7 


1 


324-400 


913 


tsp_l 


Thrombospondin type 1 
domain 


2.9e-23 


90.7 


2 


124-174:181-231 


913 


EGF 


EGF-like domain 


1.6e-20 


81.6 


7 


474-508:514- 
553:559-591:597- 
633:692-720:802- 
836:842-882 


913 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3 


-75.8 


1 


523-649 


913 


granulin 


Granulin 


3.2 


-15.2 




484-510 


913 


TIL 


Trypsin Inhibitor like 
cysteine rich domai 


6.4 


-13.2 




540-597:785-842 


913 


cadherin 


Cadherin domain 


6.5 


-10.4 


1 


903-995 


914 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


4.5e-50 


179.8 


1 


2-101 


915 


MIF 


Macrophage migration 
inhibitory factor (MIF) 


6.1e-68 


239.1 


i 


2-129 


916 


trypsin 


Trypsin 


2.3e-91 


316.9 


i 


445-675 


916 


kringle 


Kringle domain 


4.2e-45 


163.3 




286-367 


916 


fn2 


Fibronectin type II domain 


4.1e-25 


96.9 




108-148 


916 


EGF 


EGF-like domain 


6.2e-15 


63.0 




164-197:245-278 


916 


fhl 


Fibronectin type I domain 


4.8e-10 


46.8 


: 


202-237 


916 


Keratin_B2 


Keratin, high sulfur B2 
protein 


5.3 


-79.6 




104-211 


916 


DUF32 


Domain of unknown 
function DUF32 


9.6 


-154.4 




445-556 


917 


laminin_EGF 


Laminin EGF-like 
(Domains III and V) 


9.8e-159 


540.7 


12 


282-346:349- 

409:412-469:472- 

520:523-565:786- 

831:834-877:880- 

925:928-984:987- 

1036:1039- 

1094:1095-1139 


917 


laminin_Nter 
m 


Laminin N-terminal 
(Domain VI) 


5.9e-105 


362.1 


1 


45-280 


917 


DUF26 


Domain of unknown 
function DUF26 


3.2 


-13.0 


1 


1110-1155 


917 


DUF232 


Putative transcriptional 
regulator 


3.2 


-24.4 


1 


1149-1281 


917 


LIM 


LIM domain 


3.8 


-16.6 




1125-1189 


917 


zf-CXXC 


CXXC zinc finger 


4.5 


-8.4 


-\ 


367-414 


917 


Methyltransf 
D12 


D12 class N6 adenine- 
specific DNA met 


5.3 


-88.8 




1404-1654 


917 


VSP 


Giardia variant-specific 
surface prot 


8.4 


-265.1 




864-1168 


917 


EB 


EB module 


9.5 


-12.4 




1069-1125 


918 


EGF 


EGF-like domain 


3.2e-53 


190.2 


9 


33-68:74-110:116- 
151:161-197:208- 
243:247-282:288- 
323:329-362:368- 
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404 


918 


CUB 


CUB domain 


2.9e-17 


70.8 




TAO 01*7 

/Uo-o I / 


918 


TIL 


Trypsin Inhibitor like 
cysteine rich domai 


1.3 


-5.0 




219-288 


918 


granulin 


Granulin 


2.4 


-14.4 






918 


wap 


WAP-type (Whey Acidic 
Protein) 'four-disul 


4.7 


-8.7 




6-71 


918 


KeratinJB2 


Keratin, high sulfur B2 
protein 


6.6 


-81.2 


■ 


95-247 


918 


VSP 


Giardia variant-specific 
surface protein 


7.7 


-264.1 


1 


95-403 


919 


Parathyroid 


Parathyroid hormone family 


3.1e-82 


286.6 




46-175 


920 


EGF 


EGF-like domain 


2.2e-22 


87.8 


9 


2-36:47-82:88- 

122:128-163:232- 

263:276-306:319- 

349:353-394:407- 

439 


921 


IL8 


Small cytokines 
(intecrine/chemokine), inter 


0.19 


-3.2 


1 


25-65 


922 


Zn_carbOpep 
t 


Zinc carboxypeptidase 


8.7e-120 


411.4 


1 


149-429 


922 


Propep_M14 


Carboxypeptidase activation 
peptide 


5.6e-20 


79.8 


1 


7-84 


923 


fibrinogen^ 


Fibrinogen beta and gamma 
chains, C-term 


9.5e-59 


208.6 


1 


62-263 


927 


FKBP 


FKBP-type peptidyl-prolyl 
cis-trans isomeras 


2.4e-41 


150.8 


1 


40-122 


930 


FAD- 
oxidase G, 


FAD linked oxidases, C- 
terminal domain 


4.1e-88 


306.2 


1 


one £1 AH 

395-647 


930 


FAD_binding 
4 


FAD binding domain 


5.2e-78 


272.6 


2 


24-172:219-381 


931 


UPF0136 


Uncharacterised protein 
family (UPF0136) 


0.00031 


17.6 


1 


12-75 


933 


Fringe 


Fringe-like 


1.4e-06 


-12.8 


1 


262-487 


933 


Galactosyl^ 


Galactosyltransferase 


2.6 


-84.8 


1 


282-439 


937 


14-3-3 


14-3-3 protein 


9.5e-124 


424.5 


1 


50-282 


938 


S_100 


S-100/ICaBP type calcium 
binding domain 


3.6e-23 


90.4 


1 


65-108 


938 


efhand 


EF hand 


0.0012 


25.6 


1 


114-142 


940 


ig 


Immunoglobulin domain 


2.9e-35 


130.6 


5 


38-105:139- 

206:241-293:326- 

377:412-487 


941 


LRR 


Leucine Rich Repeat 


2e-30 


114.5 


8 


42-63:64-87:88- 
112:113-136:137- 
160:161-184:185- 
208:209-232 


941 


LRRCT 


Leucine rich repeat C- 
Terminal uomam 


3.6e-07 


37.2 


1 


242-292 


941 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.46 


15.7 


1 


11-40 


944 


serpin 


Serpin (serine protease 
inhibitor) 


1.6e-133 


457.0 


1 


173-532 


947 


RUN 


RUN domain 


8e-44 


159.0 


1 


31-163 


947 


FYVE 


FYVE zinc finger 


3.5e-13 


57.2 


1 


529-598 


947 


Tektin 


Tektin family 


1.9 


-229.2 


1 


154-473 1 
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947 


filament 


Intermediate filament 
protein 


4.7 


-202.1 


i 


216-501 


947 


PAPSjreduct 


Phosphoadenosine 
phosphosulfate reductase 


4.9 


-69.1 


i 


305-467 


947 


metalthio 


Metallothionein 


7.3 


-11.4 


i 


576-635 


950 


lectin c 


Lectin C-type domain 


2.4e-08 


41.1 


1 


1 3U-Z34 


951 


lectin c 


Lectin C-type domain 


8 


-36.4 


1 


ZlJ-Zol 


951 


FAD- 
oxidase C 


FAD linked oxidases, C- 
terminal domain ! 


9.7 


-84.1 


1 




oU-zo / 


952 


Glyco transf_ 
8 


Glycosyl transferase family 
8 


0.01 


-47.7 




IZjj-IjZO 


952 


Glyco hydro 
20 


Glycosyl hydrolase family 
20, catalyti 


9.8 


-224.0 





Dy 1-0 /4 


953 


TPR 


TPR Domain 


0.0042 


23.8 




171-204 


954 


DENN 


DENN (AEX-3) domain 


4.2 


-67.5 




1 CL 1 AH 

10-14Z 


956 


Branch 


Core-2/I-Branching enzyme 


2.2e-77 


270.5 


i 


46-310 


957 


m3 


Fibronectin type III domain 


l.le-15 


65.6 




30-108 


958 


Peptidase_Ml 
0 


Matrixin 


5.9e-121 


415.2 


i 


58-225 


958 


hemopexin 


Hemopexin 


2e-58 


207.5 


4 


305-347:349- 

393:398-445:447- 

4o7 


958 


Astacin 


Astacin (Peptidase family 
M12A) 


0.28 


-108.1 


1 


lzo-zoj 


958 


PG binding_ 
1 


Putative peptidoglycan 
binding domain 


2.4 


-9.5 


i 


44-112 


960 


Robl_LC7 


Roadblock/LC7 domain 


1.8e-30 


114.7 




C/Z 1/17 

JO- 14/ 


962 


chromo 


'chromo' (CHRromatin 
Organization Modifier) 


2.7e-05 


31.1 


i 


47-84 


962 


SET 


SET domain 


1.2 


-48.2 


i 


243-307 


964 


iff 


Immunoglobulin domain 


0.0047 


23.6 




O A 1 1 < 

34-1 1 o 


967 


7trn_l 


7 transmembrane receptor 
(rhodopsin family) 


3.5e-13 


57.2 


i 


41-237 


968 


adh zinc 


Zinc-binding dehydrogenase 


2.4e-61 


217.2 


— 


1*7/1 AQ<\ 


968 


DUF142 


Domain of unknown 
function DUF142 


6.6 


-67.7 


1 


Z4o-33Z 


970 


El_dehydrog 


Dehydrogenase El 
component 


2.1e-183 


622.7 


i 


129-425 


970 


transketolase 


Transketolase, thiamine 
diphosphate bin 


2.6 


-227.4 


i 


173-410 


971 


ig 


Immunoglobulin domain 


5.8e-09 


43.2 


i 


45-122 


973 


zf-MIZ 


MIZ zinc finger 


3.2e-32 


120.5 




279-331 


973 


SAP 


SAP domain 


2.4e-05 


31.2 


i 


2-36 


973 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


5.7 


-6.3 


1 


283-325 


975 


WD40 


WD domain, G-beta repeat 


4.3e-09 


43.6 




Zj4-Z /U.440- 
4o3.4yj.-DZo 


r\T~I 

977 


An__p eroxidas 
e 




6.9e-137 


468.2 


-j 


137-701 


978 


Defensin_pro 
pep 


Defensin propeptide 


3.8e-19 


77.0 




32-82 


978 


defensins 


Mammalian defensin 


0.0015 


25.3 




95-123 


981 


ig 


Immunoglobulin domain 


0.00022 


28.0 




49-130 


982 


ig 


Immunoglobulin domain 


5.8e-07 


36.6 




36-112 


983 


Acetyltransf 


Acetyltransferase (GNAT) 


0.22 


15.5 




260-334 
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family 










985 


tsp_l 


Thrombospondin type 1 
domain 


0.0075 


17.1 


1 


151-206 


985 


Furin-like 


Furin-like cysteine rich 
region 


5.1 


-94.5 


1 


37-149 


985 


TNFR_c6 


TNFR/NGFR cysteine-rich 
region 


9.6 


-0.7 


1 


114-148 


987 


LRR 


Leucine Rich Repeat 


5.5e-59 


209.4 


12 


78-101:102- 
125: 126-149:150- 
173:174-197:198- 
22 1 :222-245 :24o- 
269:270-293 : 294- 

on.-iio 1A1-1AO 

31 /:3 15-341.34Z- 

365 


987 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.23 


18.0 


1 


Z4-3Z 


987 


Keratin_B2 


Keratin, high sulfur B2 
protein 


6.1 


-80.6 


1 


11 A AAO 

334-44/ 


988 


ParA 


ParA family ATPase 


0.029 


0.0 


1 


177-283 


988 


APS kinase 


Adenylylsulfate kinase 


0.92 


-112.5 


1 


102-226 


988 


ArsA_ATPas 
e 


Anion-transporting ATPase 


0.97 


-180.2 


1 


103-336 


989 


vwd 


von Willebrand factor type 
D domain 


l.le-38 


141.9 


1 


364-514 


989 


vwc 


von Willebrand factor type 
C domain 


5.3e-35 


129.7 


5 


c r\ 1 nr. 1 AO 

50-105:108- 
163: 166-ZZ4:Z3o- 
289:301-357 


989 


TIL 


Trypsin Inhibitor like 
cysteine rich 


8.9e-06 


32.6 


1 


629-6 oZ 


989 


Keratin_B2 


Keratin, high sulfur B2 
protein 


1 


-68.4 


1 


1£A 1QQ 

Zo4-3yo 


989 


TILa 


TILa domain 


1.4 


-2.7 


1 


1A1 1C1 

3U1-303 


989 


fhl 


Fibronectin type I domain 


2.3 


4.7 


1 


166-198 


989 


Metallothio P 
EC 


Plant PfiC family 
metallothionein 


4.8 


-38.2 


1 


con cc 1 

5&7-651 


990 


Cys_knot 


Cystine-knot domain 


3.2 


-31.3 


1 


007 1M 

ZZ /-3 Id 


992 


ig 


Immunoglobulin domain 


2.9e-13 


57.5 


2 


34-111:150-220 


994 


ig 


Immunoglobulin domain 


1.2e-26 


102.0 


3 


43-1 1U. 143- 
197:230-291 


995 


SCP 


SCP-like extracellular 
protein 


5e-41 


149.7 


1 


A 1 H 

4-1 /3 


995 


granulin 


Granulin 


6.9 


-17.5 


1 


i /u-iyy 


996 


ig 


Immunoglobulin domain 


0.014 


22.0 


1 


LJD-ZVO 


999 


RNase_PH 


3' exoribonuclease family, 
domain 1 


6.1e-23 


89.6 


1 


272-408 


999 


RNase_PH_C 


3' exoribonuclease family, 
domain 2 


1.4e-13 


58.5 


2 


96-157:392-469 


999 


PNPase 


Polyribonucleotide 
nucleotidyltransferase 


O 1 a. f»< 

o.ie-Uj 


ZD.y 


i 
i 


189-269 


1000 


ig 


Immunoglobulin domain 


0.00054 


26.7 


1 


38-102 


1002 


zf-C2H2 


Zinc finger, C2H2 type 


7.6e-54 


192.3 


7 


183-205:211- 
233:239-261:267- 
289:295-317:323- 
345:351-373 


1002 


zf-BED 


BED zinc finger 


3.9 


-3.7 


1 


224-262 



WO 03/054152 



PCT/US02/39555 



Table 4B 
472 



SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1002 


LIM 


LIM domain 


7.8 


-19.2 


1 


185-249 


1003 


zf-C2H2 


Zinc finger, C2H2 type 


2.3e-114 


393.3 


16 


260-282:297- 

319:325-347:353- 

375:381-403:409- 

431:437-459:465- 

487:493-515:521- 

543:549-571:577- 

t: c\c\ . rnc /TOT, /CQ 1 

oDr>:ooi-uoJ.ooy- 
711 


1003 1 


zf-BED 


BED zinc finger 


0.41 


5.0 


1 


A(\A 


1003 


TFIIS 


Transcription factor S-II 
(TFIIS) 


0.75 


2.4 


2 


353-391:468-503 


1003 


GATA 


GATA zinc finger 


1.8 


-6.6 


1 


AO C /I O 1 


1003 


PHD 


PHD-fmger 


5.5 


-16.7 


1 


326-389 


1003 


E6 


Early Protein (E6) 


6.9 


-67.7 


1 


467-554 


1003 


Bowman- 
Birkjeg 


Bowman-Birk serine 
protease inhibitor 


8 


-18.8 


1 


355-411 


1007 


LIM 


LIM domain 


1.3e-38 


141.7 


3 


390-448:450- 
507:510-576 


1007 


E7 


E7 protein, Early protein i 


0.7 


-39.5 


1 


235-422 


1008 


Transposase 
8 


Transposase 


2.9 


-12.7 


1 


53-148 


1010 


zf-C2H2 


Zinc finger, C2H2 type 


4.8e-06 


33.5 


3 


2-24:29-52:154- 
177 


1010 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


7.1 


-7.0 


1 


4-34 


1011 


60s ribosoma 
1 


60s Acidic ribosomal 
protein 


1.2e-20 


82.1 . 


1 


1-89 


1012 


abhydrolase_ 
2 


Phospholipase/Carboxyleste 
rase 


1.3e-24 


95.2 


1 


9-213 


1016 


Anti_prolifera 
t 


BTG1 family 


1.7e-73 


257.6 


1 


11-253 


1017 


zf-C2H2 


Zinc finger, C2H2 type 


9.8e-71 


248.4 


11 


147-169:213- 
231:269-291:325- 
347:353-375:381- 
4U3 :4Uy-43 1 .43 /- 
459:465-487:493- 

c 1 c . en i ca o 


1017 


KRAB 


KRAB box 


2.6e-26 


100.8 


1 


42-82 


1017 


DM-domain 


DM DNA binding domain 


0.26 


0.8 


1 


AC\1 CIO 


1017 


zf-BED 


BED zinc finger 


1.8 


-0.6 


2 


338-376:394-432 


1017 


LIM 


LIM domain 


3.4 


-16.2 


1 


495-549 


1017 


zf-TRAF 


TRAF-type zinc finger 


5 


-6.6 


1 


427-480 


1017 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


9.5 


-8.1 


1 


177-218 


1017 


Bowman- 
Birk leg 


Bowman-Birk serine 
protease inhibitor 


9.6 


-19.6 


1 


147-203 


1020 


LIM 


LIM domain 


2.3e-34 


127.6 


4 


78-136:139- 

197:200-256:259- 

315 


1020 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


3.3 


-4.2 


1 


106-142 


1020 


TNFR_c6 


TNFR/NGFR cysteine-rich 
region 


8.2 


-0.1 


1 


89-127 
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1024 


zf-C2H2 


Zinc finger, C2H2 type 


4.3e-53 


189.8 


10 


156-183:184- 

206:212-234:240- 

262:268-290:296- 

318:324-346:352- 

374:380-402:408- 

430 


1024 


KRAB 


KRAB box 


2.7e-26 


100.8 


1 


15-55 


1024 


TFIIS 


Transcription factor S-II 
(TFIIS) 


2.5 


-2.0 


1 


129-166 


1024 


zf-BED 


BED zinc finger 


3.9 


-3.8 


1 


225-263 


1024 


fer4 


4Fe-4S binding domain 


8.3 


-3.2 


1 


176-199 


1025 


RAG2 


Recombination activating 
protein 2 


0 


1380.2 


1 


1-527 


1025 


PHD 


PHD-fmger 


2.9 


-14.2 


1 


419-483 


1026 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


l.le-06 


35.6 


1 


16-54 


1029 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.8e-06 


16.4 


2 


5-149:150-281 


1031 


pro_isomeras 
e 


Cyclophilin type peptidyl- 
prolyl cis-tr 


4.1e-107 


369.3 


1 


5-165 


1035 ! 


zf-C2H2 


Zinc finger, C2H2 type 


1.2e-113 


391.0 


17 


100-122:132- 

154:160-182:188- 

210:216-238:244- 

266:272-294:300- 

322:328-350:356- 

378:384-406:412- 

434:440-462:468- 

490:496-518:524- 

546:552-574 


1035 


KRAB 


KRAB box 


7.1e-24 


92.8 


1 


4-44 


1035 


zf-BED 


BED zinc finger 


0.25 


6.9 


2 


78-123:537-575 


1035 


LIM 


LIM domain 


1.5 


-13.2 




330-394 


1035 


TFIIS 


Transcription factor S-II 
(TFIIS) 


2.4 


-1.9 


1 


244-282 


1035 


zf-MIZ 


MIZ zinc finger 


8.5 


-20.9 


\ 


457-507 


1038 


Sulfatejxans 
P 


Sulfate transporter family 


4.3e-103 


355.9 




2-284:441-751 


1038 


STAS 


STAS domain 


4.8e-20 


80.0 


; 


774-987 


1038 


BenE 


Benzoate membrane 
transport protein 


1.3 


-317.8 




422-745 


1038 


UPF0118 


Domain of unknown 
function DUF20 


4.9 


-125.1 




454-754 


1038 


Transp__cyt_p 
ur 


Permease for 
cytosine/purines, uracil 


5.3 


-195.7 


- x 


403-760 


1038 


voltage_CLC 


Voltage gated chloride 
channel 


5.5 


-238.5 


i 


378-680 


1038 


FecCD 


FecCD transport family 


7.9 


-219.3 


i 


440-686 


1039 


ubiquitin 


Ubiquitin family 


1.9e-08 


41.5 




ZO-1ZO 


1042 


ig 


Immunoglobulin domain 


3.1e-30 


113.9 




62-129:163- 

229:264-316:349- 

400:433-501 


1043 


UPAR LY6 


u-PAR/Ly-6 domain 


7.7e-31 


115.9 




88-158 


1044 


7tm 5 


7TM chemoreceptor 


0.17 


-139.3 




29-293 


1044 


sugarjx 


Sugar (and other) 
transporter , 


0.27 


-153.3 




1-419 
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1044 


Competence 


Competence protein 


1.8 


-92.8 




10-207 


1044 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


2.6 


-160.0 


i 


185-410 


1044 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


6.6 


-392.2 


1 


188-508 


1044 


COX1 


Cytochrome C and Quinol 
oxidase polypepti 


7.4 


-311.2 


i 


1-396 


1045 


Lipase_chap 


Proteobacterial lipase 
chaperone prot 


3.1 


-124.4 


1 


95-336 


1045 


Cys_MetJVle 
ta PP 


Cys/Met metabolism PLP- 
dependent enzy 


8.2 


-317.8 


1 


40-279 


1046 


PAP2 


PAP2 superfamily 


7.7 


-31.4 




88-175 


1047 


MHC_I 


Class I Histocompatibility 
antigen, domains 


7.9e-148 


504.5 


i 


25-203 


1047 


ig 


Immunoglobulin domain 


0.79 


16.0 


i 


220-285 


1050 


ig 


Immunoglobulin domain 


8.7e-07 


36.0 




38-110 


1051 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


0.0019 


-4.7 


1 


4-194 


1052 


GASA 


Gibberellin regulated 
protein 


2.6 


-47.4 


1 


1-69 


1054 


SRCR 


Scavenger receptor 
cysteine-rich domain 


6.2e~25 


96.3 


1 


722-820 


1055 


P2X_receptor 


ATP P2X receptor 


S.7e-303 


1019.3 


4 — 


13-388 


1055 


Metallothio 5 


Metallothionein family 5 


7.1 


-0.8 




124-161 


1057 


zona_pellucid 
a 


Zona pellucida-like domain 


3.1e-81 


283.3 


1 


268-538 


1057 


trefoil 


Trefoil (P-type) domain 


0.02 


9.1 




224-262 


1058 


Aa_trans 


Transmembrane amino acid 
transporter prot 


2.4e-22 


87.7 




4-388 


1058 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


2.6 


-159.9 


i 


120-362 


1058 


DUF286 


Caenorhabditis protein of 
unknown functio 


2.7 


-94.1 


1 


83-246 


1058 


PUCC 


PUCC protein 


8.4 


-281.4 




83-365 


1058 


Competence 


Competence protein 


8.4 


-105.8 




99-341 


1059 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.5e-05 


10.6 




33-401 


1060 


Na_sulph_sy 
mp 


Sodium: sulfate symporter 
transmembran 


4.5e-143 


488.7 


1 


1-554 


1060 


Na_H_antipor 
ter 


Na+/H+ antiporter family 


5.8 


-114.4 


1 


212-546 


1060 


LrgA 


LrgA family 


7.6 


-55.7 


i 


37-141 


1061 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


9.5 


-171.4 




129-355 


1063 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 




29-204 


1064 


ERG4 ERG2 
4 


Ergosterol biosynthesis 
ERG4/ERG24 family 


l.le-103 


357.8 




7-350 


1065 


Sulfatase 


Sulfatase 


0.28 


-144.2 




66-431 


1065 


CRCB 


CrcB-like protein 


2.6 


-51.2 




728-827 


1065 


Phosphodiest 


Type I phosphodiesterase / 


2.7 


-167.6 




43-368 
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nucleotide p 










1065 


Cad 


Cadmium resistance 
transporter 


7.5 


-136.9 


1 


a t n Cite 
412-30D 


1065 


Metalloenzy 
me 


Metalloenzyme superfamily 


9.3 


-43.8 


1 


i on inn 

189-308 


1066 


ig 


Immunoglobulin domain 


1.2e-15 


65.4 




42-95:135- 
192:231-2ob 


1067 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.5e-28 


108.3 


1 


28-266 


1068 


UPF0136 


Uncharacterised protein 
family (UPF0136) 


4e-45 


163.4 


1 


12-108 


1069 


pkinase 


Protein kinase domain 


1.6e-65 


231.1 




298-580 


1069 


Activin_recp 


Activin types I and II 
receptor domain 


6.2e-28 


106.2 


i 


20-107 


1069 


toxin 


Snake toxin 


2.8 


-19.3 


i 


33-101 


1069 


UPAR LY6 


u-PAR/Ly-6 domain 


4 


-19.7 


1 


47-101 


1070 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


0.00018 


13.2 




3-177 


1070 


UPF0136 


Uncharacterised protein 
family 


3.6 


-32.2 


1 


67-164 


1071 


MHCJ 


Class I Histocompatibility 
antigen, domains 


1.2e-12 


55.5 


1 


24-202 


1071 


ig 


Immunoglobulin domain 


3.6e-05 


30.6 




2 18-284 


1072 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


3.1 


-60.4 


i 


36-207 


1073 


sugar_tr 


Sugar (and other) 
transporter 


0.032 


-126.5 


1 


48-528 


1073 


PUCC 


PUCC protein 


4.5 


-274.8 




ion /iio 


1073 


Competence 


Competence protein 


6.5 


-103.7 


1 


115-316 


1073 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


9.6 


-276.3 




72-523 


1074 


sugarjr 


Sugar (and other) 
transporter 


0.032 


-126.5 


1 


136-616 


1074 


PUCC 


PUCC protein 


4.5 


-274.8 


1 


218-520 


1074 


Competence 


Competence protein 


6.5 


-103.7 


1 


203-404 


1074 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


9.6 


-276.3 




160-611 


1075 


ank 


Ankyrin repeat 


L7e-45 


164.6 


6 


31-63:64-96:97- 

129:130-162:163- 

195:196-228 


1076 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.7e-05 


-2.6 


1 


1 1 8-374 


1076 


DUF216 


Domain of unknown 
function DUF 


1.6 


-42.6 


1 


193-368 


1077 


sugar_tr 


Sugar (and other) 
transporter 


0.0039 


-100.2 


1 


46-470 


1077 


OATP„C 


Organic Anion Transporter 
Polypeptide (OATP) 


0.018 


-180.1 


1 


64-361 


1077 


PUCC 


PUCC protein 


6.2 


-278.1 


1 


85-480 


1077 


PAP2 


PAP2 superfamily 


7.9 


-31.5 


1 


353-476 


1078 


UvrD- 
helicase 


UvrD/REP helicase 


7.1 


-248.3 


1 


299-774 


1078 


DUF246 


Plant protein family 


9.1 


-257.7 


1 


584-865 
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1079 


G_glu_transp 
ept 


Gamma- 

glutamyltranspeptidase 


3.2e-05 


-146.3 


1 


122-499 


1079 


DLH 


Dienelactone hydrolase 
family 


2.5 


-92.8 


1 


189-376 


1079 


DUF312 


Short repeats of unknown 
function 


33 


0.8 


1 


415-453 


1080 


TPR 


TPR Domain 


2e-19 


78.0 


5 


28-61:68-101:108- 

141:148-181:188- 

221 


1081 


ACAT 


Sterol O-acyltransferase 


1.9e-32 


121.3 


1 


300-406 


1082 


WD40 


WD domain, G-beta repeat 


2.5e-20 


81.0 


7 


1015-1050:1059- 

1097:1115- 

1151:1158- 

1194:1203- 

1240:1246- 

1 oo 1 . 1 ooo 1 ion 

1281:1293-1329 


1082 


V1R 


Vomeronasal organ 
pheromone receptor family 


3.2 


-162.0 


1 


421-676 


1083 


pkinase 


Protein kinase domain 


1.4e-54 


194.8 


1 


248-Doo 


1083 


Activin_recp 


Activin types I and II 
receptor domain 


5.8e-38 


139.6 


1 


26-127 


1083 


UPAR LY6 


u-PAR/Ly-6 domain 


7.9 


-22.8 


i 


48-121 


1084 


pkinase 


Protein kinase domain 


1.4e-54 


194.8 




310-630 


1084 


Activin_recp 


Activin types I and II 
receptor domain 


5.8e-38 


139.6 


i 


26-127 


1084 


UPAR LY6 


u-PAR/Ly-6 domain 


7.9 


-22.8 


i 


48-121 


1086 


C2 


C2 domain 


1.7e-06 


35.0 




233-316 


1087 


Cache 


Cache domain 


2.7e-24 


94.2 


"i 


557-650 


1087 


vwa 


von Willebrand factor type 
A domain 


0.13 


-30.6 


1 


361-537 


1088 


Glyco hydro 
31 


Glycosyl hydrolases family 
31 


1.6e-224 


759.3 


1 


1-636 


1090 


SSrecog 


Structure-specific 
recognition protein 


2.7e-153 


522.7 


1 


343-555 


1090 


HMG_box 


HMG (high mobility group) 
box 


3.6e-32 


120.3 




681-749 


1090 


DUF304 


Bacterial membrane flanked 
domain 


4.8 


-12.0 




144-223 


1091 


serpin 


Serpin (serine protease 
inhibitor) 


8.1e-196 


663.9 


1 


315-683 


1092 


trypsin 


Trypsin 


0.00023 


-4.5 


1 


334-526 


1093 


Competence 


Competence protein 


9.1 


-106.5 




9-260 


1094 


Aajxans 


Transmembrane amino acid 
transporter pro 


1.5e-07 


4.2 


1 


115-561 


1094 


TrpJTyrjper 
m 


Tryptophan/tyrosine 
permease family 


2.7 


-283.0 


1 


118-558 


1094 


oxidored_ql 


NADH- 

u Diquinone/piasioquinone 


8.1 


-170.0 


1 


117-338 


1097 


ATP-synt C 


ATP synthase subunit C 


3.3e-18 


73.9 




72-140 


1098 


aconitase 


Aconitase family (aconitate 
hydratase) 


1.9e-177 


602.9 




162-744 


1098 


Aconitase_C 


Aconitase C-terminal 
domain 


8.9e-72 


251.9 




872-1043 


1103 


PAP2 


PAP2 superfamily 


6.3e-15 


63.0 




89-236 


1104 


ig 


Immunoglobulin domain 


1.3e-18 


75.3 


2 


80-148:183-251 
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1106 


TCTP 


Translationally controlled 
tumor protein 


3.5e-93 


323.0 


1 


1 1 cc 

l-loo 


1109 


efhand 


EF hand 


1.3e-13 


58.7 


3 


33-61:102- 
130:1 Jo-loo 


1110 


ATP1G1 PL 
M MAT8 


ATP 1 G1/PLM/MAT8 
family 


1.8e-13 


58.2 




no 1 a a 


1112 


ldl_recept_a 


Low-density lipoprotein 
receptor domain 


0.00073 


26.3 




1 1 C 1 CI 

115-153 


1112 


CUB 


CUB domain 


0.0032 


-3.1 




9-109 


1115 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.8e-05 


9.0 


1 


139-349 


1116 


7tm 1 


7 transmembrane receptor 


7.5e-21 


82.7 




114-363 


1116 


oxidored q5 
N 


NADH-ubiquinone 
oxidoreductase chain 4 


4.2 


-55.0 


■ 


213-282 


1116 


Abi 


CAAX amino terminal 
protease family 


9.4 


-28.2 




171-254 


1116 


PhaGJMnhG 
YufB 


Na+/H+ antiporter subunit 


9.8 


-57.7 




281-376 


1119 


tsp_l 


Thrombospondin type 1 
domain 


4.5e-38 


139.9 


10 


149-198:306- 

364:571-626:631- 

696:707-761:841- 

i 

889:970- 
1021:1099- 
1148:1219- 
1269:1342-1398 


1120 


HMG14 17 


HMGUand HMG17 


1.3e-34 


128.4 


1 


2-86 


1124 


lectin c 


Lectin C-type domain 


5.1 


-34.2 


1 


216-281 


1125 


ig 


Immunoglobulin domain 


2.1e-17 


71.3 


3 


84-153:185- 
255:292-347 


1136 


sugar_tr 


Sugar (and other) 
transporter 


0.029 


-125.4 


1 


23-504 


1136 


OATP__C 


Organic Anion Transporter 
Polypeptide (OATP) 


2.8 


-225.4 


1 


116-381 


1136 


PUCC 


PUCC protein 


5.8 


-277.4 


1 


83-476 


1136 


Frizzled 


Frizzled/Smoothened family 
membrane region 


8.3 


-247.2 


1 


54-264 


1137 


ATP-synt_C 


ATP synthase subunit C 


3.3e-35 


130.4 


2 


14-79:9U-lDD 


1138 


DHDPS 


Dihydrodipicolinate 
synthetase family 


7.2e-31 


116.0 


1 


34-325 


1139 


SSF 


Sodium: solute symporter 
family 


1.5e-48 


174.7 


2 


jU-4o l .DOy-yJJ 


1139 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


7.6 


-393.3 


1 


62-347 


1139 


Competence 


Competence protein 


8.9 


-106.3 


1 
1 


04-Zo / 


1140 


Cdc37 


Cdc37 family 


5.4 


-122.6 


1 


317-539 


1143 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.3e-44 


161.6 


1 


78-332 


1144 


Peptidase Ao 


bignal peptidase (prasej n 


7 9 


-70.3 


1 


117-211 


1145 


PID 


Phosphotyrosine interaction 
domain (PTB/PID) 


1.7e-99 


343.9 


2 


488-627:661-782 


1145 


WW 


WW domain 


1.7e-08 


41.7 


1 


373-401 


1145 


KdpC 


K+-transporting ATPase, c 
chain 


3.6 


-97.4 


1 


60-245 


1146 


PRA1 


Prenylated rab acceptor 
(PRA1) 


0.0071 


-32.6 


1 


1-153 
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1147 


DUF214 


Predicted permease 


9.5 


-68.5 


1 


n i c i 


1148 


C2 


C2 domain 


2.1e-30 


114.4 


2 


87-165:240-320 


1149 


Reticulon 


Reticulon 


0.29 


-68.6 


1 


o5-2oo 


1150 


Reticulon 


Reticulon 


1.8 


-81.1 


1 


79-266 


1152 


7tm 1 


7 transmembrane receptor 


2.8e-27 


104.1 


1 


1 /T 1 A 1 A 

161-410 


1152 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


5.3 


-166.1 


1 


90-329 


1153 


WD40 


WD domain, G-beta repeat 


9.8e-06 


32.5 


4 


26-62:73-109:236- 
271:430-467 


1155 


aa permeases 


Amino acid permease 


5.6e-06 


-179.9 


1 


193-613 


1155 


7tm 5 


7TM chemoreceptor 


1.6 


-156.9 


1 


199-448 


1155 


oxidorecLql 


NADH- 

Ubiquinone/plastoquinone 


3.7 


-162.9 


1 


306-552 


1155 


Spore_perme 
ase 


Spore germination protein 


5.6 


-193.9 


1 


180-533 


1155 


Aajrans 


Transmembrane amino acid 
transporter p 


6.4 


-205.1 


1 


179-589 


1155 


BPD_transp_ 
2 


Branched-chain amino acid 
transport sy 


6.5 


-123.5 


1 


An in 
47-32 / 


1157 


PRA1 


Prenylated rab acceptor 
(PRA1) 


l.le-46 


168.5 


1 


23-149 


1158 


dUTPase 


dUTPase 


5.1e-29 


109.8 


1 


46-167 


1158 


rvp 


Retroviral aspartyl protease 


4.1e-22 


86.9 


1 


179-280 


1158 


G-patch 


G-patch domain 


0.0011 


25.7 


1 


285-329 


1158 


MIP 


Major intrinsic protein 


0.048 


-83.0 


1 


1-111 


1159 


synaptobrevin 


Synaptobrevin 


0.49 


-3.1 


1 


96-184 


1160 


laminin__EGF 


Laminin EGF-like 
(Domains III and V) 


9.5e-132 


451.1 


14 : 


299-353:356- 

423:426-466:490- 

532:535-578:584- 

627:630-680:683- 

727:1265- 

1308:1311- 

1352:1355- 

1401:1404- 

1452:1686- 

1730:1733-17o6 


1160 


laminin_Nter 
m 


Laminin N-terminal 
(Domain VI) 


4e-77 


269.6 


i 


47-297 


1160 


laminin_G 


Laminin G domain 


6e-31 


116.2 


3 


2625-2748:3015- 
3138:3185-3313 


1160 


laminin B 


Laminin B (Domain IV) 


l.le-09 


45.6 




1517-1651 


1160 


MCPsignal 


Methyl-accepting 
chemotaxis protein 


0.61 


-119.3 


1 


20H-2237 


1160 


TSPN 


Thrombospondin N- 
terminal -like domain 


1.7 


-47.9 


1 


3143-3311 


1160 


filament 


Intermediate filament 
protein 


2.2 


-193.4 


i 


1826-2081 


1160 


DSL 


Delta serrate ligand 


3.3 


-17.5 




472-519 


1160 


OEP 


Outer membrane efflux 
protein 


3.7 


-29.3 




2020-2211 


1160 


EB 


EB module 


6.5 


-10.6 




665-713 


1160 


Apidaecin 


Apidaecin 


7 


7.3 




907-934 


1160 


Acetatejtinas 
e 


Acetokinase family 


8.3 


-242.2 




2093-2412 



WO 03/054152 PCT/US02/39555 



Table 4B 
479 



SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1160 


spectrin 


Spectrin repeat 


8.4 


-20.0 


1 


210j~22k)j 


1161 


EGF 


EGF-like domain 


2.2e-26 


101.1 


5 




2y-j /:oU-oo. 1U4- 
140:147-178:185- 

210 


1161 


EB 


EB module 


1.7 


-4.2 




i a 1 1 70 
Id 1-1 /o 


1161 


metalthio 


Metallothionein 


7.7 


-11.7 


i 


64-133 


1162 


interferon 


Interferon alpha/beta 
domain 


1.5e-17 


71.7 




16-171 


1163 


IL1 


Interleukin-1 / 18 


8.7e-21 


82.5 




in i cr\ 

iy-i5y 


1164 


PDGF ! 


Platelet-derived growth 
factor (PDGF) 


1.8e-51 


184.4 


i 


52-130 


1165 


IL1 


Interleukin-1 / 18 


3.7e-23 


90.4 




11-144 


1167 


Palm_thioest 


Palmitoyl protein 
thioesterase 


2.4e-216 


732.1 


i 


28-282 


1168 


serpin 


Serpin (serine protease 
inhibitor) 


1.6e-202 


686.2 


i 


47-415 


1170 


Defensh\_pro 
pep 


Defensin propeptide 


1.2e-26 


102.0 


i 


38-90 


1170 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


i 


103-131 


1172 


Y_phosphatas 
e 


Protein-tyrosine 
phosphatase 


1.8e-110 


380.4 


1 


271-499 


1179 


Metallophos 


Calcineurin-like 
phosphoesterase 


l.le-11 


52.3 


i 


70-285 


1184 


MHCJ 


Class I Histocompatibility 
antigen, domains 


6.2e-06 


-5.9 


i 


29-205 


1187 


PLA2_B 


Lysophospholipase catalytic 
domain 


8.3e-53 


188.9 


1 


i-y t-*J OAA 

357-800 


1187 


C2 


C2 domain 


1.6e-06 


35.1 


i 


46-129 


1189 


lipocalin 


Lipocalin / cytosolic fatty- 
acid binding 


2.6e-39 


144.0 


1 


O O 1 o o 


1189 


Kunitz_BPTI 


Kunitz/Bovine pancreatic 
trypsin inhibito 


4.7e-23 
* 


90.0 





in no.oT/i 1 1 n 
25Y-2I5. 2/H-DLy 


1190 


ig 


Immunoglobulin domain 


4e-07 


37.1 




152 1 1 6 
35-110 


1191 


ig 


Immunoglobulin domain 


4e-07 


37.1 


1 


38-116 


1196 


UPF0185 


Uncharacterised protein 
family (UPF01 85) 


3.6e-41 


150.2 




1 < 1 no 


1200 


zf-DHHC 


DHHC zinc finger domain 


3.7e-15 


63.8 




87-149 


1201 


KRAB 


KRAB box 


6.6e-22 


86.2 




33-73 


1202 


KRAB 


KRAB box 


6.6e-22 


86.2 




33-73 


1204 


trypsin 


Trypsin 


3e-31 


117.2 




56-239 


1208 


lipocalin 


Lipocalin / cytosolic fatty- 
acid binding pr 


0.12 


9.2 


"1 


38-186 


1212 


LRR 


Leucine Rich Repeat 


2.5e-54 


193.9 


15 


73-96:97-122:123- 

149:150-175:176- 

195:197-220:221- 

246:247-270:292- 

317:318-337:339- 

362-363-388*389- 

408:410-433:434- 

459 


1212 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.24 


17.9 


1 


43-71 


1212 


SCAN 


SCAN domain 


7.2 


-38.8 


1 


88-187 


1213 


Zn_carbOpep 
t 


Zinc carboxypeptidase 


1.9e-61 


217.6 


1 


50-285 
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1214 


ig 


Immunoglobulin domain 


5.5 


Ci 1 

y. i 


— 


l /-OH 


1216 


death 


Death domain 


2.4 


-0.1 




qq i on 


1217 


death 


Death domain 


1.5 


2.0 


i 


73-160 


1221 


Clq 


Clq domain 


3.4e-41 


150.3 


1 


1 1A KQ 

134-2DO 


1221 


Collagen 


Collagen triple helix repeat 
(20 copies) 


2.6e-10 


47.7 




A 1 1 AA 

41-100 


1223 


pldnase 


Protein kinase domain 


9e-75 


261.8 


1 


ICO A"\H 

158-41 / 


1223 


SH3 


SH3 domain 


0.019 


7.4 




4Z3-4 /o 


1223 


UBA 


UBA/TS-N domain 


0.035 


20.7 


i 


987-1026 


1224 


vwa 


von Willebrand factor type 
A domain 


1.2e-05 


29.0 




51-242 


1225 


abhydrolase 


alpha/beta hydrolase fold 


3.7e-12 


53.8 




111-390 


1225 


abhydrolase_ 
2 


Phospholipase/Carboxyleste 
rase 


2.3 


-88.8 




64-270 


1227 


lectin c 


Lectin C-type domain 


9.6e-29 


108.9 


; 


54-166 


1237 


fibrinogen_C 


Fibrinogen beta and gamma 
chains, C-term 


1.5e-09 


-1.3 




242-385 


1257 


LRR 


Leucine Rich Repeat 


9e-65 


228.6 


14 


78-101:102- 

125:126-149:150- 

173:174-197:198- 

221:222-245:246- 

269:2^ '0-293:29 4- 

317:318-341:342- 

365:366-389:390- 

413 


1257 


LRRCT 


Leucine rich repeat C- 
terminal domain 


0.011 


22.3 


1 


423-475 


1257 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.23 


18.0 


1 


24-52 


1263 


Pep_M12B_p 
ropep 


Reprolysin family 
propeptide 


1.8e-28 


108.1 


1 


75-191 


1263 


disintegrin 


Disintegrin 


2.7e-ll 


51.0 


1 


419-494 


1263 


Reprolysin 


Reprolysin (M12B) family 
zinc metallo 


7.3e-07 


-22.2 


1 


206-402 


1263 


EGF 


EGF-like domain 


2.7 


10.3 




635-663 


1266 


Clusterin 


Clusterin 


2.3e-298 


1004.6 


1 


2-394 


1267 


serpin 


Serpin (serine protease 
inhibitor) 


3.8e-140 


479.0 


1 


80-425 


1268 


serpin 


Serpin (serine protease 
inhibitor) 


6.8e-162 


551.2 


1 


98-461 


1269 


COX6C 


Cytochrome c oxidase 
subunit Vic 


2.3e-38 


140.9 


1 


1-75 


1270 


Cadherin_C_t 
erm 


Cadherin cytoplasmic 
region 


4 


-79.4 




13-130 


1271 


spectrin 


Spectrin repeat 


8.2 


-19.9 




O AT /in 

30 7-41 / 


1272 


W2 


eIF4-gamma/eIF5/eIF2- 
epsilon 


7.3 


-26.9 


1 


325-395 


1272 


spectrin 


Spectrin repeat 


Q 1 
O.J 


-10 0 





327-437 


1273 


Rhodanese 


Rhodanese-like domain 


1.5e-48 


174.7 




16-138:165-266 


1274 


trypsin 


Trypsin 


2.9e-80 


280.1 




121-346 


1279 


CalpainJII 


Calpain large subunit, 
domain III 


1.5e-25 


98.3 




30-179 


1284 


Ribosomal L 
35p 


Ribosomal protein L35 


2.4 


-18.1 




122-179 


1285 


Ribosomal L 


Ribosomal protein L35 


2.4 


-18.1 




122-179 
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35p 












1287 


zf-C2H2 


Zinc finger, C2H2 type 


4.4e-66 


233.0 


9 


173-195:201- 

223:229-251:257- 

279:285-307:313- 

391:397-419 


1287 


KRAB 


KRAB box 


4.9e-23 


90.0 




14-54 


1287 


zf-BED 


BED zinc finger 


2.4 


-1.9 




186-224 


1287 


LIM 


LIM domain 


9.3 


-19.9 




175-239 


1292 


EGF 


EGF-like domain 


0.36 


17.3 


1 


30-65 


1294 


vwc 


von Willebrand factor type 
C domain 


L3e-09 


45.4 


1 


69-127 


1295 


zf-C2H2 


Zinc finger, C2H2 type 


5e-50 


179.6 


8 


263-285:319- 

341:347-369:375- 

397:403-425:431- 

453:459-481:487- 

510 


1295 


KRAB 


KRAB box 


8.9e-26 


99.1 


1 


8-48 


1295 


TFIIS 


Transcription factor S-II 
(TFIIS) 


5.7 


-5.0 


1 


431-469 


1295 


zf-TRAF 


TRAF-type zinc finger 


7.3 


-8.4 


1 


369-419 


1295 


LIM 


LIM domain 


7.7 


-19.2 


1 


321-385 


1296 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-113 


389.6 


13 


190-212:218- 
240:246-268:274- 
296:302-324:330- 
352:358-380:386- 

J An A *t A A J" A A S\ 

408:414-436:442- 
464:470-492:498- 
520:526-548 


1296 


KRAB 


KRAB box 


6.7e-28 


106.1 


1 


14-54 


1296 


zf-BED 


BED zinc finger 


0.51 


4.1 


1 


511-549 


1296 


PHD 


PHD-finger 


2.4 


-13.4 


1 


275-338 


1296 


TFIIS 


Transcription factor S-II 
(TFIIS) 


7.6 


-6.1 


1 


330-368 


1296 


LIM 


LIM domain 


8.1 j 


-19.4 


1 


388-456 


1299 


hemopexin 


Hemopexin 


7e-25 


96.1 


4 


279-322:324- 

367:369-415:417- 

461 


1299 


Peptidase_Ml 
0 


Matrixin 


8.4e-14 


58.8 


1 


1-176 


1299 


Astacin 


Astacin (Peptidase family 
M12A) 


1.4 


-117.0 


1 


91-238 


1301 


ig 


Immunoglobulin domain 


0.67 


16.4 


1 


33-116 


1305 


DUF25 


Domain of unknown 
function DUF25 


5.3e-76 


265.9 


1 


40-162 


1307 


kazal 


Kazal-type serine protease 
inhibitor domain 


1.3e-12 


55.3 


1 


21-68 


1309 


Defensin__pro 
pep 


Defensin propeptide 


1 .2e-2o 


1 AO A 

102.0 


1 


HA 1 9£ 


1309 


defensins 


Mammalian defensin 


7.3e-14 


59.5 


1 


139-167 


1312 


ras 


Ras family 


2.7e-59 


210.4 


1 


64-253 


1312 


GTPJBFTU 


Elongation factor Tu GTP 
binding domain 


0.91 


-79.4 


1 


61-226 


1312 


arf 


ADP-ribosylation factor 
family 


4.3 


-116.7 


1 


46-226 
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1314 


zf~C2H2 


Zinc finger, C2H2 type 


l.le-51 


1 o« 1 

185.1 


Q 
O 


1 £7 1 PC.1Q1 

103-ioj : iy i- 
213:219-241:247- 

ZOy.Z Ij-Zy /.5U5- 

325:331-353:359- 

3ol 


1314 


LIM 


LIM domain 


1.8 


-13.9 


1 


221-285 


1314 


PHD 


PHD-finger 


2.5 


-13. 5 


1 
1 


5U4-50 / 


1314 


zf-BED 


BED zinc finger 


2.8 


-2.4 


1 


232-270 


1315 


Keratin_B2 


Keratin, high sulfur B2 
protein 


0.0035 


-30.0 


1 


1 1 1 <A 

13- 1j4 


1316 


Keratin_B2 


Keratin, high sulfur B2 
protein 


3.6e-15 


63.8 


1 


23-154 


1317 


Keratin_B2 


Keratin, high sulfur B2 
protein 


8.2e-20 


79.2 


2 


20-163:164-295 


1318 


Keratin_B2 


Keratm, high sulfur B2 
protein 


5.5 


-79.9 


1 


3-1 /Z 


1319 


zf-C2H2 


Zinc finger, C2H2 type 


2.9e-52 


18.7.1 


7 


33-55:ol-o3:o9- 

1 1 1.1 17 1 1Q. 1 AC 

iii:ii /-i5y. 14 j- 
167:173-195:201- 

ZZj 


1319 


PHD 


PHD-finger 


1.6 


-11.8 


1 


146-209 


1319 


zf-BED 


BED zinc finger 


3.4 


-3.2 


1 


74-112 


1321 


Keratin_B2 


Keratin, high sulfur B2 
protein 


0.00014 


-8.3 


1 


23-193 


1331 


ig 


Immunoglobulin domain 


9.7e-06 


32.5 


1 


24-/0 


1332 


Kunitz_BPTI 


Kunitz/Bovine pancreatic 
trypsin inhibito 


2.2e-18 


74.5 


1 


76-126 


1333 


zf-C2H2 


Zinc finger, C2H2 type 


4.6e-55 


196.3 


7 


199-221:227- 
z4y.ZDD-z/ /.Zo5- 
305:311-333:339- 

301.50 /-3oy 


1333 


KRAB 


KRAB box 


D.oe-24 , 


Q1 f\ 

y3.u 


1 


O-40 


1333 


zf-BED 


BED zinc finger 


0.31 


6.1 


2 


240-278:352-390 


1336 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 


32-193 


1336 


TIG 


IPT/TIG domain 


1.4 


4.5 


1 


199-291 


1337 


vwa 


von Willebrand factor type 
A domain 


0.0038 


-8.1 


1 


32-193 


1337 


TIG 


IPT/TIG domain 


1.4 


4.5 


1 


199-291 


1339 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


0.0088 


-16.3 


1 


4-203 


1340 


PMP22_Clau 
din 


t» TV jm 

PMP- 

22/EMP/MP20/Claudin 
family 


a An i o 
0.0019 


a n 
-4.7 


1 


A 1 QA 

4-iy4 


1341 


PsaL 


Photo system I reaction 
centre subunit XI 


A & 
4.0 


111 o 

-1 11. 0 


i 


zi-iuo 


1 j4j 


ig 


T-m rv» unA rrl /*» \~n 1 1 1 yi /i /vm Qt-n 
lllllllUIlOgllJUUllil ULrlllcil.il 




JO.J 


9 




1344 


MARCKS 


MARCKS family 


6.7 


-95.4 


1 


124-395 


1347 


ig 


Immunoglobulin domain 


1.2e-07 


38.9 


2 


168-226:276-330 


1348 


ig 


Immunoglobulin domain 


1.5e-18 


75.1 


3 


157-217:269- 
325:373-427 


1348 


rhv 


picornavirus capsid protein 


1.4 


-52.0 


1 


28-258 


1353 


LRR 


Leucine Rich Repeat 


2e-39 


144.4 


9 


120-143:144- 
167:168-191:192- 
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215:216-239:240- 
263:264-2o 1- 
334:335-358 


1353 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0066 


23.1 


1 


92-119 


1353 


LRRCT 


Leucine rich repeat C- 
terminal domain 


1.5 


4.9 


1 


368-418 


1353 


Bgal_small_C 


Beta galactosidase small 
chain, C termin 


6.6 


-44.4 


1 


105-174 


1354 | 


oxidored_q4 


NADH- 

ubiquinone/plastoquinone 
oxidoreduct 


5.8 


-35.2 


1 


60-142 


1356 


ig 


Immunoglobulin domain 


2.1e-17 


71.3 


3 


37-106:13o- 


1357 


CD36 


CD36 family 


3.9e-231 


781.3 


1 


< AA^. 


1358 


DUF139 


Cysteine rich repeat 
(DUF139) 


1.2e-08 


42.2 


4 


195-211:245- 

261:288-304:316- 

332 


1359 


ig 


Immunoglobulin domain 


1.6e-06 


35.1 


2 


a 1 io/i.K^ Tin 
41-124: ijo-ZD\) 


1359 


Gag MA 


Matrix protein (MA), pl5 


0.0044 


-25.2 


1 


on 1 a on 

291-429 


1360 


ion trans 


Ion transport protein 


0.0031 


24.2 


1 


O O £T A A C 

285-445 


1360 


DUF214 


Predicted permease 


8.6 


-67.8 


1 


202-378 


1363 


spectrin 


Spectrin repeat 


9.5e-22 


85.7 


6 


17-121:124- 
226:229-340:3 IZ- 
476:678-785:788- 
896 


1363 


Remorin_C 


Remorin, C-terminal region 


3.5 


-39.8 


1 


s~ C C\ l A A 

659-744 


1366 


Clq 


Clq domain 


2.1e-31 


117.8 


1 


266-390 


1366 


Collagen 


Collagen triple helix repeat 
(20 copies) 


0.00029 


19.8 


1 


182-241 


1367 


rnaseA 


Pancreatic ribonuclease 


l.le-34 


128.7 


1 


27-132 


1368 


3HCDH_N 


3-hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.0021 


-60.2 


1 


50-141 


1369 


3HCDH_N 


3-hydroxyacyl-CoA 
dehydrogenase, NAD 
binding 


0.25 


-91.5 


1 


50-153 


1371 


voltage_CLC 


Voltage gated chloride 
channel 


3.1e-199 


675.3 


1 


92-528 


1371 


CBS 


CBS domain 


4.9e-16 


66.7 


2 


559-617:761-814 


1371 


PH04 


Phosphate transporter 
family 


4.1 


-199.1 


1 


129-495 


1371 


K trans 


K+ potassium transporter 


5.8 


-566.6 


1 


0/C7 HCCi 

267-759 


1371 


7tm 5 


7TM chemoreceptor 


6.3 


-167.6 


1 


254-486 


1372 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


0.2 


-84.7 


1 


82-199 


1373 


ldh_C 


lactate/malate 
aenydrogenase, aipna/ oeta 
C-t 


0.69 


-61.5 


1 


38-200 


1374 


disintegrin 


Disintegrin 


7.2 


-30.4 


1 


241-306 


1375 


Peptidase_Cl 


Papain family cysteine 
protease 


le-120 


414.5 


1 


114-332 


1376 


7tm 1 


7 transmembrane receptor 


2.6e-50 


180.6 


1 


48-454 


1376 


7tm 5 


7TM chemoreceptor 


1 


-153.3 


1 


26-453 


1376 


Sec62 


Translocation protein Sec62 


1.4 


-124.2 


1 


307-477 
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1376 


DUF280 


Caenorhabditis protein of 
unknown function 


5.6 


-141.0 


i 


93-274 


1376 


Sre 


C. elegans Sre G protein- 
coupled chemorece 


7.2 


-136.9 


1 

— 


1 caq 


1376 


Colicin im 


Colicin immunity protein 


9.6 








1378 


7tm_2 


7 transmembrane receptor 
(Secretin family) 


4.9e-09 


32.5 


i 


250-500 


1378 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.38 


4.4 


i 


197-248 


1378 


Competence 


Competence protein 


5.9 


-102.8 


i 


240-490 


1380 


MAGE 


MAGE family 


1.4e-91 


317.7 


1 


O T2 1 
3~231 


1381 


ion trans 


Ion transport protein 


0.065 


16.8 




158-323 


1381 


ABC2_memb 
rane 


ABC-2 type transporter 


6.1 


-130.1 


i 


150-339 


1381 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 


6.2 


-167.5 


i 


51-322 


1381 


OATP_C 


Organic Anion Transporter 
Polypeptide 


9.5 


-236.4 


i 


80-321 


1382 


ig 


Immunoglobulin domain 


7e-12 


52.9 




3 /-12o. 10U-241 


1385 


MHCJ 


Class I Histocompatibility 
antigen, domains 


9.3e-06 


-8.4 




OA OA/1 


1388 


pro_jsomeras 
e 


Cyclophilin type peptidyl- 
prolyl cis-tr 


7e-88 


305.4 




AK 1 Q/1 


1390 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


5.3e-06 


21.2 




1 O 1 A 

1-21U 


1391 


Reticulon 


Reticulon 


6.4 


-89.9 




CO OC7 


1392 


Collagen 


Collagen triple helix repeat 


0.92 


-48.2 


i 


27-84 


1392 


MDM2 


p53-associated protein 
(MDM2) 


5.1 


-12.8 


1 


no kc 

92-155 


1392 


Cytidylyltrans 


Phosphatidate 
cytidylyltransferase 


9.4 


-91.4 


1 


141-242 


1393 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


1.6e-12 


55.0 




r\c\ oon 

99-329 


1393 


DUF40 


Domain of unknown 
function DUF40 


5.7 


-131.9 


i 


97-267 


1395 


trefoil 


Trefoil (P-type) domain 


3.9 


-11.1 


i 


28-53 


1397 


fh3 


Fibronectin type III domain 


4.7e-78 


272.7 


5 


533-619:631- 

7l7:734-833:o54- 

939:951-1039 


1397 


ig 


Immunoglobulin domain 


3.7e-42 


153.5 


5 


1 C A OOO.O/C 1 

154-227:261- 

3 1 o. 302-4 lo.4D.5- 

ri i ,i oca 1 ai £ 
D 1 1 : 12D3-1 j 10 


1398 


DUF300 


Domain of unknown 
function 


1.4e-152 


520.3 


1 




1404 


is 


Immunoglobulin domain 


2.9e-08 


40.9 


>-> 

2 


AO QQ- 1 T.Q 1 QS 


1409 


ig 


Immunoglobulin domain 


8.1e-30 


112.5 


3 


142-199:263- 

l0 1 .ICQ 
321:33^-435 


1410 


Tissue fac 




0.26 


-135.2 


1 


1-267 


1410 


HRM 


Hormone receptor domain 


8.4 


-13.9 


1 


28-101 


1412 


FecCD 


FecCD transport family 


7.2 


-218.5 


1 


144-342 


1413 


MetallothioJP 
ro 


Prokaryotic metallothionein 


6.2 


-8.0 


1 


464-511 


1414 


mito_carr 


Mitochondrial carrier 
protein 


3.5e-64 


226.7 


3 


40-130:137- 
227:238-322 


1416 


ig 


Immunoglobulin domain 


0.0011 


25.6 


1 


57-112 
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1417 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


6.6e-06 


33.1 


1 


27-67 


1418 


Synaptophysi 
n 


Synaptophysin / 
synaptoporin 


6.9e~71 


248.9 


1 


99 979 


1419 


PMP22_Clau 
din 


PMP- 

22/EMP/MP20/Claudin 
family 


5.2e-18 


73.3 


1 


A 1 


1421 


Ammonium_t 
ransp 


Ammonium Transporter 
Family 


8.5e-18 


/Z.o 


i 
i 


Zj"JOJ 


1421 


FecCD 


FecCD transport family 


1.8 


-206.7 


1 


97-311 


1421 


Bac_chlorC 


Bacteriochlorophyll C 
binding protein 


5.8 


-12.0 


1 


212-289 


1423 


rnaseA 


Pancreatic ribonuclease 


/i i„ on 

4. le-zy 


1 1A 1 

1 1U. I 


i 
i 


32-136 


1424 


PCI 


PCI domain 


8.5 


Q 1 

-y.i 


i 
i 




1424 


Nrap 


Nrap protein 


9.7 


-257.5 


1 


162-764 


1425 


CbiM 


CbiM 


O A 

3.4 


-OO.O 


i 
i 


1 0- 1 jo 


1427 


LRR 


Leucine Rich Repeat 


4.5e-26 


i nn n 
1UU.U 


0 


^ 96-97-^ft'S 1 - 
74-76-QQ-1 DO- 
19^- 194-1 44 


1427 


fh3 


Fibronectin type III domain 


2. /e-Uo 


Aid 
41. U 


i 


^4-417 


1427 


ig 


Immunoglobulin domain 


O A ^ n*7 

2.4e-U / 


5 /.y 


i 
i 


99^-98^ ' 


1427 


LRRCT 


Leucine rich repeat C- 
terminal domain 


7.2e~Uo 




i 
i 


1 64-90Q 


1428 


GRAM 


GRAM domain 


0.067 


13.1 


1 


177-245 


1429 


YGGT 


YGGT family 


7.4 


-24.4 


1 


147-217 


1431 


Papilloma_E5 


Papillomavirus E5 


6.7 


-34.9 


1 


1 no 
jZ- IUo 


1435 


Rhomboid 


Rhomboid family 


1.5e-45 


164.8 


1 


1 £Q 19£ 


1435 


efhand 


EF hand 


3.8 


11.0 


1 


38-66 


1437 


Sema 


Sema domain 


7e-202 


684.1 


1 


1 no -c 1 1 
1UZO Id 


1437 


tsp_l 


Thrombospondin type 1 
domain 


l.le-42 


155.3 


6 


Doy-oJ /:o44- 
695:702-746:833- 

QQ'i-CQn Q4fVQ4^ 


1437 


PSI 


Plexin repeat 


2.5e-14 


61.0 


1 


531-578 


1438 


ank 


Ankyrin repeat 


2e-15 


64.7 


3 


336-368:383- 

41 S-4£8 4QS 
41o.40o-' 1 fyo 


1438 


ion trans 


Ion transport protein 


9.4e-07 


35.9 


1 


615-817 


1439 


ank 


Ankyrin repeat 


3.3e-12 


54.0 


•-> 
3 


41 /-44y.404- 

499:549-579 


1439 


ion trans 


Ion transport protein 


9.4e-07 


35.9 


1 


oyo-oyo 


1442 


LRR 


Leucine Rich Repeat 


3.2e-20 


80.6 


5 


57-80:81-104:105- 

1 9Q«1 9Q 1 *n9«1 ^ 

IZo. izy-l jZ. 1 Do- 
176 


1442 


LRRCT 


Leucine rich repeat C- 
terminal domain 


2.3e-07 


37.9 


1 


10/C o /in 

1 OO-240 


1442 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00013 


28.8 


1 


Z /-DD 


1443 


7tm_2 


/ transmembrane receptor 
(Secretin family) 


7 9p 9^ 


KQ 4 


I 


166-476 


1443 


HMG_box 


HMG (high mobility group) 
box 


3.1e-05 


30.8 


1 


3-76 


1443 


GPS 


Latrophilin/CL-l-like GPS 
domain 


0.0066 


19.1 


1 


110-162 


1443 


7tm„l 


7 transmembrane receptor 
(rhodopsin family) 


1.9 


-107.5 


1 


255-473 
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1443 


Frizzled 


Frizzled/Smoothened family 
membrane region 


3.3 


-241.3 


1 


156-459 


1444 


DUF214 


Predicted permease 


0.0051 


-16.4 


1 


136-275 


1444 


secY 


eubacterial secY protein 


1.1 


-230.8 


1 


13-270 


1448 


ribonuc_red_s 
m 


Ribonucleotide reductase, 
small chain 


5.6 


-142.1 


1 


529-805 


1452 


mito_carr 


Mitochondrial carrier 
protein 


8.4e-58 


205.5 


2 


27-123:124-217 


1454 


mito_carr 


Mitochondrial carrier 
protein 


0.00049 


22.3 


2 


79-179:198-287 


1455 


WD40 


WD domain, G-beta repeat 


2.7e-26 


100.8 


4 


64-99:105- 

141 :235-273:2 /y- 


1456 


Furin-like 


Furin-like cysteine rich 
region 


6.1e-89 


308.9 


1 


172-328 


1456 


Recep_L_do 
main 


Receptor L domain 


3.1e-80 


280.0 


2 


46-169:345-471 


1456 


pkinase 


Protein kinase domain 


7e-65 


229.0 


1 


986-1258 


1456 


fh3 


Fibronectin type III domain 


0.2 


11.5 


2 


604-794:514-909 


1456 


fer4 


4Fe-4S binding domain 


3.7 


-1.0 


1 


oo o o/n 

238-267 


1456 


Keratin_B2 


Keratin, high sulfur B2 
protein 


5.2 


-79.5 


1 


143-285 


1456 


TIL 


Trypsin Inhibitor like 
cysteine rich d 


6.3 


-13.2 


1 


221-279 


1462 


Peptidase Ml 
0 


Matrixin 


1.5e-79 


277.7 


1 


62-227 


1462 


hemopexin 


Hemopexin 


1.4e-10 


48.6 


1 


309-350 


1463 


Armadillq_se 
g 


Armadillo/beta-catenin-like 
repeat 


0.028 


21.0 


4 


549-591:641- 

685:686-737:738- 

783 


1466 


C2 


C2 domain 


6.5e~07 


36.4 


1 


219-304 


1468 


RTC 


RNA 3'-terminaI phosphate 
cyclase 


6.1e-05 


-118.8 


1 


9-196 


1476 


CN_hydrolas 
e 


Carbon-nitrogen hydrolase 


0.0014 


-76.2 


1 


64-341 


1477 


zf-C2H2 


Zinc finger, C2H2 type 


4.8e-06 


33.5 


3 


267-289:294- 
317:419-442 


1477 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


7.1 


-7.0 


1 


269-299 


1478 


Eukporin 


Eukaryotic porin 


3.3e-06 


An 1 

-42.1 


l 


1 1 AO 
1-14Z 


1482 


ras 


Ras family 


0.24 


-121.5 


1 


389-513 


1484 


BTB 


BTB/POZ domain 


2.3e-13 


57.8 


1 




1487 


rvt 


Reverse transcriptase 


8.1e-57 


202.2 


1 


OA 1 ^ftO 

Z4 1 O U / 


1489 


GalactosylJT 
2 


Galactosyltransferase 


4.7e-177 


601.6 


1 


97-404 


1489 


TT ORF2 


TT viral ORF2 


9 


-98.0 


1 


339-453 


1490 


Orexin 


Prepro-orexin 


8.3 


-32,3 


1 


1 1 OO 

1-128 


1491 


<jrJ3r_C 


Guanylate-binding protein, 
C-terminal domain 




QA A 


i 
i 


2-155 


1493 


NUDIX 


NUDIX domain 


0.0011 


21.7 


1 


297-439 


1496 


Big_2 


Bacterial Ig-like domain 
(group 2) 


4.5e-16 


66.8 


1 


1177-1258 


1497 


ig 


Immunoglobulin domain 


1.9e-10 


48.2 


1 


38-112 


1498 


aldo ket red 


Aldo/keto reductase family 


3.7e-08 


-8.7 


1 


108-367 


1502 


Caveolin 


Caveolin 


1.6e-06 


13.3 


1 


90-262 
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1504 


sushi 


Sushi domain (SCR repeat) 


8.2e-16 


66.0 


2 


223-281:286-344 


1511 


zf-C2H2 


Zinc finger, C2H2 type 


l.le-16 


68.9 


2 


341-363:369-391 


1511 


KRAB 


KRAB box 


4.9e-09 


43.5 


1 


230-268 


1511 


zf-BED 


BED zinc finger 


4.1 


-3.9 


1 


354-392 


1516 


CBS 


CBS domain 


1.4e-30 


115.0 


3 


189-243:264- 
317:336-389 


1517 


zf-C2H2 


Zinc finger, C2H2 type 


2.4e-128 


439.8 


18 


279-301:307- 

329:335-357:363- 

385:391-413:419- 

441:447-469:475- 

497:503-525:868- 

890:896-918:924- 

946:952-974:980- 

1002:1008- 

1 AO A. 1 AO*C 

1U3U. lloo- 
1058:1064- 

1 nQ^. 1 AAO 111/1 

lUoo. iuyz- 1114 


1517 


KRAB 


KRAB box 


5.6e-40 


1 A £1 O 

146.2 


2 


HA 1 1 A »£Q 1 ^1 ^ 
/4-114.0ol-/ 13 


1517 


PHD 


PHD-fmger 


1.2 


-10.7 




QQH Q£f\ 

oy /-you 


1517 


SCAN 


SCAN domain 


1.6 


-32.1 


i 


544-624 


1517 


LIM 


LIM domain 


5.4 


-17.9 




-ion a A1 

337-401 


1517 


BolA 


BolA-like protein 


6.1 


-25.1 


i 


283-347 


1517 


TFIIS 


Transcription factor S-II 
(TFIIS) 


9.1 


-6.8 




nro nAA 

952-990 


1518 


ig 


Immunoglobulin domain 


le-06 


35.7 


1 


to 1 AO 

38-102 


1521 


CoA trans 


Coenzyme A transferase 


l.le-161 


550.5 




42-20L.J. 1 1-4 11 


1521 


TGF-beta 


Transforming growth factor 
beta like domain 


6.8e-64 


225.7 




/COT HA 1 

637- /41 


1523 


Reprolysin 


Reprolysin (M12B) family 
zinc metallo 


6.6e-95 


328.7 




223-422 


1523 


Pep_M12B_p 
ropep 


Reprolysin family 
propeptide 


2e-27 


104.5 




121-248 


1523 


disintegrin 


Disintegrin 


1.9e-14 


61.5 




437-513 


1523 


EGF 


EGF-like domain 


0.0092 


22.6 


i 


661-693 


1523 


VSP 


Giardia variant-specific 
surface prot 


4.9 


-258.5 


- 


428-750 


1523 


Keratin_B2 


Keratin, high sulfur B2 
protein 


7.2 


-81.7 




427-538 


1524 


DSPc ! 


Dual specificity 
phosphatase, catalytic doma 


3.6e-71 


249.9 


1 


102-241 


1527 


Spermine syn 
th 


Spermine/spermidine 
synthase 


8.2e-13 


56.0 


1 


254-440 


1528 


efhand 


EF hand 


3.4e-10 


47.3 


4 


65-91:145- 

173:329-357:365- 

393 


1529 


Epimerase 


NAD dependent 
epimerase/ dehydratase 

-£V» W% ill/ 

iamny 


0.66 


-192.0 


1 


10-451 


1530 


rvt 


Reverse transcriptase 


7.7e-69 


242.1 


1 


125-395 


1531 


RPH3A_effec 
tor 


Rabphilin-3A effector 
domain 


1.9e-09 


-6.1 


1 


161-339 


1533 


TPR 


TPR Domain 


3.7e-10 


47.2 


3 


220-253:444- 
477:478-511 


1533 


CENP-B 


CENP-B protein 


1.2e-06 


-7.7 


1 


541-671 


1534 


SAM 


SAM domain (Sterile alpha 


0.045 


18.0 


1 


688-752 
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motif) 










1534 


Atrophin-1 


Atrophin-1 family 


7.9 


-679.9 


-j 


731-1402 


1538 


pkinase 


Protein kinase domain 


1.5e-20 


81.7 




42-242 


1540 


zf-C2H2 


Zinc finger, C2H2 type 


0.00035 


27.3 


1 


725-750 


1541 


COLFI 


Fibrillar collagen C-terminal 
domain 


3.9e-163 


555.3 


1 


624-841 


1541 


Collagen 


Collagen triple helix repeat 
(20 copies) 


2.2 


-55.6 




538-597 


1541 


TT ORF2 


TT viral ORF2 


3.9 


-93.3 


i 


508-610 


1542 


EGF 


EGF-like domain 


2.6e-ll 


51.0 




916-947:1004- 
1044:1050-1094 


1542 


granulin 


Granulin 


6.3 


-17.2 


1 


1019-1074 


1542 


TIL 


Trypsin Inhibitor like 
cysteine rich domain 


9.4 


-15.2 




938-1004 


1543 


ig 


Immunoglobulin domain 


1.5e-10 


48.5 




38-112 


1544 


UCH-1 


Ubiquitin carboxyl-terminal 
hydrolases famil 


8.5e-15 


62.6 




477-508 


1544 


UCH-2 


Ubiquitin carboxyl-terminal 
hydrolase family 


7.3e-13 


56.2 


i 


759-811 


1544 


zf-UBP 


Zn-fmger in ubiquitin- 
hydrolases and other 


0.011 


20.7 


1 


350-420 


1545 


spectrin 


Spectrin repeat 


2.8 


-14.5 


1 


696-803 


1546 


LRR 


Leucine Rich Repeat 


l.le-30 


115.3 


9 


132-155:156- 

179:180-199:201- 

224:250-269:271- 

294:295-318:319- 

341:342-365 


1546 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00082 


26.1 


1 


62-84 


1547 


Allantoicase 


Allantoicase repeat 


7e-70 


245.6 


2 


116-271:376-476 


1547 


DCX 


Doublecortin 


3.3e-17 


70.6 


1 


611-671 


1551 


Peptidase S9 


Prolyl oligopeptidase family 


1.7e-06 


35.0 


1 


389-469 


1551 


DPPIV_N_ter 
m 


Dipeptidyl peptidase IV 
(DPP IV) N-termi 


1.4 


-267.4 


1 


1-385 


1551 


DLH 


Dienelactone hydrolase 
family 


4.7 


-96.4 


1 


355-597 


1552 


Orn DAPAr 
g deC 


Pyridoxal-dependent 
decarboxylase, C- 


1.3e-05 


23.1 


1 


84-202 


1555 


rvt 


Reverse transcriptase 


0.00015 


7.9 


i 


505-718 


1557 


rvt 


Reverse transcriptase 


1.4e-69 


244.6 




247-520 


1557 


RcbX 


RbcX protein 


1.8 


-79.3 


1 


1-108 


1558 


RhoGEF 


RhoGEF domain 


9.1e-26 


99.0 




971-1155 


1558 


PDZ 


PDZ domain (Also known 
as DHR or GLGF) 


4.2e-12 


53.6 


1 


159-234 


1558 


PH 


PH domain 


0.081 


19.5 


1 


1199-1312 


1558 


RGS 


Regulator of G protein 
signaling domain 


2.5 


-34.3 




465-577 


1559 


pkinase 


Protein kinase domain 


9e-ll 


20.4 




151-443 


1560 


FH2 


Formin Homology 2 
Domain 


4e-105 


362.7 




595-1038 


1561 


Keratin_B2 


Keratin, high sulfur B2 
protein 


9.6 


-83.6 




136-235 


1565 


HMG_CoA_s 
ynt 


Hydroxymethylglutaryl- 
coenzyme A synthas 


6.5e-300 


1009.7 




50-582 


1568 


GCV H 


Glycine cleavage H-protein 


4.3e-77 


269.5 




65-185 
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1570 


DLH 


Dienelactone hydrolase 
family 


0.039 


-69.3 


1 


168-431 


1577 


Peptidase_Ml 


Peptidase family Ml 


5.6e-74 


259.2 


1 


42-399 


1577 


DUF59 


Domain of unknown 
function DUF59 


1.8 


-24.4 




188-265 


1579 


rrm 


RNA recognition motif. 


0.0082 


22.8 




823-891 


1579 


zf-C2H2 


Zinc finger, C2H2 type 


9.5 


9.5 




584-608 


1580 


serpin 


Serpin (serine protease 
inhibitor) 


2.1e-107 


370.3 


"1 


37-541 


1581 


CH 


Calponin homology (CH) 
domain 


0.012 


13.5 


1 


84-181 


1582 


LRR 


Leucine Rich Repeat 


9.6e-38 


138.8 


12 


70-93:94-117:118- 

141:142-165:166- 

189:190-213:214- 

237:238-261:262- 

285:286-310:311- 

335:336-359 


1582 


LRRCT 


Leucine rich repeat C- 
terminal domain 


5.6e-12 


53.2 


1 


369-421 


1582 




Immunoglobulin domain 


1.9e-05 


31.5 


1 


438-499 


1582 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.0099 


22.5 




28-68 


1583 


Nucleoplasmi 
n 


Nucleoplasmin 


4.4 


-94.1 


1 


131-199 


1587 


IBR 


IBR domain 


1.3e-13 


58.7 


1 


237-298 


1587 


zf-C3HC4 


Zinc finger, C3HC4 type 
(RING finger) 


0.0053 


20.0 


1 


168-213 


1591 


TPR 


TPR Domain 


4.1e-13 


57.0 


3 


254-287:288- 
321:322-355 


1592 


PHD 


PHD-fmger 


0.042 


2.9 


2 


684-725:1117- 
1163 


1592 


DNA ligase 
ZBD 


NAD-dependent DNA 
ligase C4 zinc tinge 


5.2 


0.3 


1 


904-927 


1593 


Collagen 


Collagen triple helix repeat 
(20 copies) 


5.9 


-64.0 


1 


302-360 


1593 


LIM 


LIM domain 


5.9 


-18.2 


1 


841-870 


1595 


Peptidase_Ml 


Peptidase family Ml 


6.4e-44 


159.3 


1 


7-279 


1600 


thyreoglobulin 
1 


Thyroglobulin type-1 repeat 


3.1e-06 


34.1 


1 


38-118 


1604 


ig 


Immunoglobulin domain 


7 


8.2 


1 


34-92 


1605 


PHD 


PHD-finger 


0.59 


-7.8 




roc a a a 

585-644 


1606 


zf-CXXC 


CXXC zinc finger 


1.7e-21 


84.9 


i 


50-96 


1606 


F-box 


F-box domain 


5e-06 


33.4 




545-602:713-762 


1606 


PHD 


PHD-fmger 


0.062 


1.3 


-i 


207-271 


1606 


rubredoxin 


Rubredoxin 


3.6 


-9.0 


'1 


229-276 


1611 


L27 


L27 domain 


3.5 


5.2 


-\ 


285-341 


1612 


fh3 


Fibronectin type III domain 


0.68 


5.6 




357-443 


1613 


rrm 


RNA recognition motif. 


3.8e-14 


60.4 




1 J 1-ZZl 


1614 


SRCR 


Scavenger receptor 
cysteine-rich domain 


2.2e-26 


101.1 




52-145 


1616 


ldl_recept_a 


Low-density lipoprotein 
receptor domain 


0.12 


8.2 




68-114 


1616 


Tropomyosin 


Tropomyosin 


7.3 


-118.6 




216-441 


1617 


B56 


Protein phosphatase 2A 
regulatory B subunit 


3.3 


-315.0 




146-232 



WO 03/054152 



PCT/US02/39555 



Table 4B 
490 


SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1618 


SI 


SI RNA binding domain 


6.7e-10 


46.3 


i 


352-429 ! 


1618 


HHH 


Helix-hairpin-helix motif 


3.8 


12.3 




173-202 


1619 


trypsin 


Trypsin 


5.9e-09 


43.2 


i 


30-190 


1623 


Flgl 


Flagellar P-ring protein 


6.7e-19 


76.2 


4 — 


46-257 


1623 


FlgH 


Flagellar L-ring protein 


1.5 


-75.1 




1-73 


1624 


Dihydroorota 
se 


Dihydroorotase-like 


9.9 


-195.7 


1 


1-187 


1625 


Semialdhyde_ 
dhC 


Semialdehyde 
dehydrogenase, dimerisat 


1.3e-54 


194.8 


1 


137-338 


1625 


Semialdhyde 
dh 


Semialdehyde 
dehydrogenase, NAD bindi 


2.3e-23 


91.1 


l 


7-128 


1626 


FDX-ACB 


Ferredoxin-fold anticodon 
binding domain 


le-43 


158.6 


i — 


83-176 


1628 


HTH_1 


Bacterial regulatory helix- 
turn-helix 


1.5e-13 


58.4 


-i — 


7-66 


1628 


LysR_substra 
te 


LysR substrate binding 
domain 


2.5e-08 


41.1 


1 


90-281 


1628 


Hpt 


Hpt domain 


7.8 


-35.7 


-\ — 


65-156 


1631 


PEP- 

utilizers C 


PEP-utilizing enzyme, TIM 
barrel doma 


1.5e-10 


-43.4 




58-174 

■ 


1632 


OTCace 


Aspartate/ornithine 
carbamoyltransferase, A 


0.24 


-67.2 


l 


432-546 


1632 


Replicase 


Replicase family 


2.9 


-102.2 


-\ 


399-576 


1634 


ribonuclease 
T2 


Ribonuclease T2 family 


0.00089 


-47.2 




31-165 


1638 


efhand 


EF hand 


5.9e-10 


46.5 




38-66:74-102:111- 
139:147-175 


1639 


PCI 


PCI domain 


0.013 


19.5 




165-246 


1641 


Cys knot 


Cystine-knot domain 


0.011 


-2.5 


1 


227-337 


1641 


DAN 


DAN domain 


0.28 


-29.8 


4 


199-327 


1643 


Ribosomal L 
23 


Ribosomal protein L23 


9.6e-13 


55.8 




197-276 


1645 


tsp 1 


Thrombospondin type 1 
domain 


0.0026 


21.0 


5 


420-476:480- 

534:541-601:607- 

663:669-718 


1647 


Arginosuc_sy 
nth 


Arginosuccinate synthase 


1.3e-07 


-97.2 


1 


45-196 


1651 


vwa 


von Willebrand factor type 
A domain 


l.le-33 


125.4 


1 


89-326 


1652 


ig 


Immunoglobulin domain 


2.4e-09 


44.5 


2 


34-117:220-296 


1658 


thiored 


Thioredoxin 


6.9e-104 


358.5 


2 


324-432:459-570 


1658 


glutaredoxin 


Glutaredoxin 


7.4 


-4.2 


1 


482-544 


1662 


Hist deacetyl 


Histone deacetylase family 


5.9e-61 


215.9 


1 


857-1434 


1666 


wap 


WAP-type (Whey Acidic 
Protein) 'four-disulfi 


7.4e-15 


62.8 


1 


72-116 


1667 


KRAB 


KRAB box 


le-26 


102.2 


1 


106-146 1 


1672 


ig 


Immunoglobulin domain 


3.5e-ll 


50.6 


3 


146-203:245- 
295:331-405 


1674 


Transposase 
22 


LI transposable element 


1.5e-51 


184.7 


1 


6-281 


1676 


Ribosomal S 

2 


Ribosomal protein S2 


0.19 


-72.3 


1 


48-145 


1677 


Patatin 


Patatin-like phospholipase 


1.4e-36 


135.0 


1 


132-301 


1678 


ig 


Immunoglobulin domain 


7.6 


7.9 


1 


35-126 


1678 


Gemini mov 


Geminivirus putative 


9.4 


-42.6 


1 


51-131 
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ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 






movement protein 










1684 


adh short 


short chain dehydrogenase 


1.3e-32 


121.8 




2-244 


1685 


UPARLY6 


u-PAR/Ly-6 domain 


0.0039 


12.2 




131-213 


1686 


pkinase 


Protein kinase domain 


2e-43 


157.7 


> 


553-900 1 


1686 


Huntingtin 


Huntingtin 


7.3 


-163.8 




520-893 


1687 


HRM 


Hormone receptor domain 


7.8e-05 


29.5 




82-134 


1688 


SH3 


SH3 domain 


5.4e-22 


86.5 


1 


150-206 


1690 


ACAT 


Sterol Oacyltransferase 


6 


-109.5 




181-304 


1693 


LRR 


Leucine Rich Repeat 


4.3e-34 


126.7 


10 


505-528:529- 

554:555-573:575- 

598:613-632:634- 

657:659-684:686- 

708:709-732:738- 

763 


1694 


SH3 


SH3 domain 


0.00089 


19.6 


1 


19-73 


1697 


Metallothio 1 
1 


Metallothionein family 1 1 


5.8 


-11.1 


1 


36-81 


1701 1 


cystatin 


Cystatin domain 


0.01 


11.8 


1 


32-125 


1704 


TPR 


TPR Domain 


9 


6.5 


1 


122-155 


1705 


Ribosomal_S 
26e 


Ribosomal protein S26e 


2.1e-16 


68.0 


1 


133-231 


1706 


ank 


Ankyrin repeat 


5.5e-30 


113.1 


3 


381-413:414- 
446:450-482 


1706 


TPR 


TPR Domain 


1.6e-06 


35.1 


4 


3-36:43-76:164- 
197:205-238 


1706 


LRR 


Leucine Rich Repeat 


0.0014 


25.4 


4 


716-743:744- 

764:775-802:835- 

859 


1706 


Tropomoduli 
n 


Tropomodulin 


2.6 


-158.7 


1 


248-839 


1708 


zf-C2H2 


Zinc finger, C2H2 type 


3.6e-23 


90.4 


4 


51-74:205- 

227:233-255:261- 

284 


1708 


zf-BED 


BED zinc ringer 


7.7 


-6.3 


1 


246-285 


1709 


Pep_M12B_p 
ropep 


Reprolysin family 
propeptide 


0.055 


-20.1 


1 


120-202 


1710 


TIG 


IPT/TIG domain 


1.3e-100 


347.7 


10 


620-705:708- 

788:793-876:883- 

988:1084- 

1169:1174- 

1254:1257- 

1336:1341- 

1423:1424- 

1511:1516-1602 


1711 


pp-binding 


Phosphopantetheine 
attachment site 


0.048 


12.0 


1 


201-255 


1711 


HK 


Hydroxyethylthiazole 
kinase family 


2.9 


-161.9 




12-232 


1711 


ketoacyl-synt 


Beta-ketoacyl synthase, N- 
terminal doma 


4 


-114.1 




265-397 


1712 


IGFBP 


Insulin-like growth factor 
binding pr 


8.8e-26 


99.1 




40-99 


1712 


thyroglobulin 
1 


Thyroglobulin type-1 repeat 


4.6e-05 


29.5 




264-323 


1714 


Paralemmin 


Paralemmin 


4.2e-26 


100.1 1 


101-439 
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ID 
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E-value 


Score 


Repeats 


Position 


1715 


rvt 


Reverse transcriptase 


3.1e-08 


40.8 


1 


2-147 


1717 


A2M_N 


Alpha-2-macroglobulin 
family N-terminal r 


0 


1082.3 


1 


52-684 


1717 


A2M 


Alpha-2-macroglobulin 
family 


0 


1184.5 


1 


776-1530 


1717 


TCTP 


Translationally controlled 
tumor protein 


0.0013 


-47.0 


1 


1172-1249 


1717 


prenyltrans 


Prenyltransferase and 
squalene oxidase re 


7.6 


3.0 


1 


1296-1332 


1717 


TSPN 


Thrombospondin N- 
terminal -like domain 1 


8.2 


-57.8 


1 


34-202 


1718 


cadherin 


Cadherin domain 


9.7e-29 


108.9 




68-159:341- 
425:439-543 


1721 


MB OAT 


MBOAT family 


0.38 


-121.1 




3-173 


1722 


p450 


Cytochrome P450 


9.4e-48 


172.1 


i 


86-527 


1723 


DUF270 


Protein of unknown 
function, DUF270 


0.12 


-272.7 


1 


9-284 


1725 


ig 


Immunoglobulin domain 


0.75 


16.2 


1 


t> a 11/: 
34-1 10 


1729 


PA 


PA domain 


1.4e-29 


111.7 




521-ozo 


1729 


Glyco hydro 
47 


Glycosyl hydrolase family 
47 


6.2e-13 


28.4 


i 


99-364 


1731 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


3.1e-62 


220.2 


1 


50-293 


1731 


7tm 5 


7TM chemoreceptor 


1.7 


-157.3 


i 


31-289 


1731 


7tm 3 


7 transmembrane receptor 


6.1 


-155.5 


1 


39-302 


1732 


ig 


Immunoglobulin domain 


9.3e-16 


65.8 




49-125:169-240 


1732 


Adeno E3 C 
Rl 


Adenovirus E3 region 
protein CR1 


8.5 


-24.2 


1 


171-251 


1734 


p450 __j 


Cytochrome P450 


0.0069 


-119.1 


i 


51-492 


1735 


profilin 


Profilin 


5.1e-06 


33.3 




955-1071 


1735 


Transposase_ 
22 


LI transposable element 


0.00026 


-102.7 


i 


731-1016 


1735 


KE2 


KE2 family protein 


7.8 


-47.5 


i 


765-838 


1739 


IL3 


Interleukin-3 


8.7 


-38.1 


1 


121-224 


1742 


PAS 


PAS domain 


0.024 


21.2 


1 


348-413 


1743 


transmembran 
e4 


Tetraspanin family 


0.87 


-77.9 




5-194 


1744 


Dynein_heav 

y 


Dynein heavy chain 


2e-253 


855.3 


i 


4-781 


1745 


SCP 


S CP-like extracellular 
protein 


9.8e-36 


132.1 


1 


4-180 


1745 


granulin 


Granulin 


0.32 


-8.1 




173-206 


1749 


sugar_tr 


Sugar (and other) 
transporter 


0.26 


-152.9 


i 


260-692 


1749 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 


1.6 


-155.3 


i 


391-674 


1749 


FecCD 


FecCD transport family 


4.7 


-214.9 


i 


473-686 


1749 


Herpes HEP 
A 


Herpesvirus DNA 
helicase/primase compl 


C. A 

j A 






77-605 


1749 


PsbT 


Photosystem II reaction 
centre T prote 


5.9 


5.1 




545-572 


1749 


Bac_rhodopsi 
n 


Bacteriorhodopsin 


6.8 


-137.3 




295-496 


1749 


C4dic_mal_tr 
an 


C4-dicarboxylate 
transporter/malic aci 


9.9 


-183.4 




265-477 



WO 03/054152 PCT/US02/39555 



Table 4B 
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ID 
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E-value 


Score 


Repeats 


Position 


1750 


ATP-synt A 


ATP synthase A chain 


8.1 


-80.8 


i 


837-958 


1751 


SDF 


S odium : dicarboxyl ate 
symporter family 


2.7 


-254.2 


1 


275-426 


1753 


ig 


Immunoglobulin domain 


9.9e-19 


75.7 




49-124:162- 

219:261-310:345- 

400:432-496 


1753 


fn3 


Fibronectin type III domain 


0.00052 


26.8 


J 


514-606 


1755 


sugar_tr 


Sugar (and other) 
transporter 


0.13 


-144.2 




1-366 


1755 


PUCC 


PUCC protein 


2.3 


-267.4 


i 


21-314 


1755 


Nuc_H_symp 
ort 


Nucleoside H+ symporter 


4.4 


-267.2 


1 


1-356 


1755 


7tm 5 


7TM chemoreceptor 


4.8 


-165.5 


! 


59-281 


1755 


Competence 


Competence protein 


5.1 


-101.6 




16-209 


1755 


Ammonium t 
ransp 


Ammonium Transporter 
Family 


5.4 


-242.0 


1 


67-380 


1755 


BenE 


Benzoate membrane 
transport protein 


8.1 


-335.2 


1 


69-351 


1757 


Exo_endo__ph 

OS 


Endonuclease/Exonuclease/ 
phosphatase fa 


1.3e-20 


81.9 


1 


367-788 


1758 


Kjetra 


K+ channel tetramerisation 
domain 


2.9 


-38.6 


i 


114-214 


1759 


HK 


Hydroxyethylthiazole 
kinase family 


3 


-162.1 


i — 


303-502 


1760 


Peptidase C2 
0 


Type IV leader peptidase 
family 


6.6 


-187.3 




29-187 


1762 


SSF 


Sodium: solute symporter 
family 


4.2e-256 


864.2 


— 


39-477 


1762 


MVIN 


Virulence factor MVIN 


7.1 


-250.2 




68-548 


1762 


PNTB 


NAD(P) transhydrogenase 
beta subunit 


8.1 


-394.0 




11-300 


1762 


7tm_5 


7TM chemoreceptor 


9.1 


-170.5 




75-352 


1762 


oxidored_ql 


NADH- 

Ubiquinone/plastoquinone 
(complex I) 


9.1 


-170.9 




145-463 


1763 


LRR 


Leucine Rich Repeat 


2e-37 


137.7 


n 


61-84:85-108:109- 

132:133-156:157- 

180:181-204:205- 

228:253-276:277- 

300:301-324:349- 

371 


1763 


LRRCT 


Leucine rich repeat C- 
terminal domain 


1.9e-07 


38.2 


l 


358-411 


1763 


ig 


Immunoglobulin domain 


8.5e-05 


29.4 


l 


428-486 


1763 


LRRNT 


Leucine rich repeat N- 
terminal domain 


0.00023 


27.9 


l 


30-59 


1764 


Reprolysin 


Reprolysin (M12B) family 
zinc metalloprote 


le-21 


85.6 


l 


156-359 


1764 


tsp_l 


Thrombospondin type 1 
domain 


2.5e-06 


34.5 


l 


422-491 


1769 


C2 


C2 domain 


2.5e-66 


233.8 


2 


249-337:383-471 


1771 


cadherin 


Cadherin domain 


2.7e-79 


276.8 


5 


140-235:249- 

343:359-451:465- 

556:607-697 


1774 


transmembran 


Tetraspanin family 


6.4e-59 


209.2 


1 


458-702 
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SEQ 
ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 




e4 












1775 


7tm_l 


7 transmembrane receptor 
(rhodopsin family) 


7.9e-U5 




— 






1775 


Srg 


C.elegans Srg family 
integral membrane prote 








1 1 \J\C\f\ 

1 1 1-JvJO 


1778 


synaptobrevin 


S ynapt obre vin 


Alt* 'XO 


ion 1 

1ZU. 1 




1 1 fi-1 OR 


1779 


trefoil 


Trefoil (P-type) domain 


3.9 


-11.1 


i 


229-254 


1780 


MHC_I 


Class I Histocompatibility 
antigen, domains 


o £. ~ on 

z.oe-oy 






1 QQ 


1780 


ig 


Immunoglobulin domain 


3.3e-0o 


34.0 




1 Qzl O^Q 


1782 


ABC tran 


ABC transporter 


2.4e-19 


111 


i 


448-626 


1783 


SNF 


S odiurmneurotransmitter 
symporter famil 


0 


1189,5 




ZU3- /4/ 


1783 


Herpes_glyco 
P 


Herpesvirus glycoprotein M 


7.2 


-262.5 




4oU-oUI 


1783 


UPF0118 


Domain of unknown 
function DUF20 


8 


-131.6 




240-582 


1783 


Cytq_ox__2 


Cytochrome oxidase subunit 
II 


9 


-243.9 




467-690 


1819 


Y_phosphatas 
e 


Protein-tyrosine 
phosphatase 


1.2e-103 


357.8 




1 OT7 1 AC\A 


1819 


fh3 


Fibronectin type III domain 


1 Oct ^1 

i.zeo i 


IOC 1 

loD. 1 


/ 


634:641-719:730- 
812:819-900:908- 
990:998-1087 


1821 


7tm__l 


7 transmembrane receptor 
(rhodopsin family) 


4.4 


-116.1 


1 


206-413 


1822 


Metallophos 


Calcineurin-like 
phosphoesterase 


1.5 


2.4 


1 


66-252 
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LO 


u> 


VO 
OJ 


1— 1 

u> 


vo 
t— » 


oJ 


^ a s 


g* 

5 


c? 

s 


laut 


I—* 

I 


laut 


Si 


w 














CHAI 
NID 


O 


-J 
o 


^4 
OO 
4> 


Oh 
Oh 

u> 


4^ 
a\ 

CO 


4^ 


STAR 
TAA 


G\ 
LO 
-J 


Oh 


to 


Ov 
OJ 

-o 


on 


Oh 
\o 
oo 




» 

4^ 
CO 

to 

Oh 


U> 

4^ 
CO 

tb 
CJ\ 


Oh 

z 

ON 


3.4e-15 


8.5e-19 


Oh 
CD 


Psi Blast 


1 0.17 


0.32 


0.04 


0.01 


0.17 


0.14 


Verify 
score 


p 

'>—* 
Oh 


i 

p 

oo 


0.54 


-0.13 


0.03 


-0.09 


PMF 
score 














SEQFOL 
D score 


| PHOSPHOLIPASE A2; 


PHOSPHOLIPASE A2; 
CHAIN: NULL; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 


PHOSPHOLIPASE A2; 
CHAIN: NULL; 


Compound 


| HYDROLASE HYDROLASE, ' 


HYDROLASE HYDROLASE, 
PHOSPHOLIPASE A2, PLATELET 
AGGREGATION INHIBITOR, 2 
PBPB HEADER MODRES 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


HYDROLASE PLA2, 
PHOSPHATIDE SN-2 
ACYLHYDROLASE; HYDROLASE, 
PHOSPHOLIPASE A2, LIPID 
DEGRADATION, PRESYNAPTIC 2 
NEUROTOXIN, VENOM 


PDB annotation 
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l 5 l 



•§■ 



gs 



2 Q 

s 
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4^ 
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^ a £2 



r 



© 

3 



o 
o 
-J 



on 



0 CM 

g o 

1 a 



§1 



TO 
O 

o 
a 

a 



« o g 
o 



H 
CO 

|c 

w 
o 



H< 2 > O 

S ^ 



< jug a 

Jt 1 do 
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•-a tr 1 
H W 

s 




H trj 

8 



WO 03/054152 



PCT/US02/39555 



499 



vo 
t— » 


VO 
i— » 

u> 


VO 

i — » 
u> 


VO 




^ H w 

5 s g 


ldx5 


I — 1 


>—»■ 

ft 


1— I 

B- 




B l 


i—i 




I— 1 


)— i 




CHAI 
NID 


OO 
4^ 




o 


ft 
4^ 




STAR 
TAA 


OO 

oo 

ON 


oo 

LO 
0\ 


vo 
u> 


Ui 
CO 






OO 

z 

oo 


2.8e-14 


6.8e-29 


►— » 

? 
i — » 
-J 




Psi Blast 


-0.12 


cb 
O 
CO 


0.29 


0.24 




Verify 
score 


0.07 


oro 


0.86 


p 

CO 




PMF 
score 












SEQFOL 
D score 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, 0, 
P; THROMBOMODULIN; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, J,K,L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; . | 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, O, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N,0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


Compound 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


PDB annotation 
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VO 
OJ 


vo 
>— * 


VO 

i — * 

U) 


vo 
i — » 
u> 




2 a w 




§ 


lemn 


lemn 




Sg 
w 












CHAI 
NID 


•^3 
VO 
On 


o\ 
oo 
u> 


on 
on 
N> 


o 




STAR 
TAA 


OO 

on 
-o 


u5 


ON 

>— * 
1—* 


on 
o 
vo 






le-19 


5.6e-10 


U) 

4* 
CD 


JO 

bo 
? 
on 




Psi Blast 


-0.23 


0.02 


-0.45 


0.40 




Verify 
score 


0.83 


-0.05 


0.74 


0.86 




PMF 
score 












SEQFOL 
D score 


FIBRILLIN; CHAIN: 
NULL; 


FIBRILLIN; CHAIN: 
NULL; 


FIBRILLIN; CHAIN: 
, NULL; 


FIBRILLIN; CHAIN: 
NULL; 


CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


Compound i 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 


PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 
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SEQFOL 
D score 


| LAMININ; CHAIN: NULL: 


| LAMININ; CHAIN: NULL: 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR; CHAIN: A, B: 


INSULIN-LIKE GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: A; 


LDL RECEPTOR; CHAIN: 
A; 


LDL RECEPTOR; CHAIN: 
A; 


LDL RECEPTOR; CHAIN: 

A; 


LDL RECEPTOR; CHAIN: 
A; 


LDL RECEPTOR; CHAIN: 
A; 




Compound 








GLYCOPROTEIN GLYCOPROTEIN 


1 GLYCOPROTEIN GLYCOPROTEIN 


LIPID TRANSPORT LDL 
RECEPTOR; BETA-PROPELLER 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


MATRIX PROTEIN I 
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SEQFOL 
D score 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, 
C; 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, 
C; 


VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I1VM0 3 


t 
3 

§ 

■d 

i 

Z 


PHOSPHOLIPASE A2; 
CHAIN: A, B; 


PHOSPHOLIPASE A2; 
CHAIN: A, B; 


COAGULATION FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L; 

COAGULATION FACTOR 
VIIA (HEAVY CHAIN); 
CHAIN: H; TRIPEPTIDYL 
INHIBITOR; CHAIN: C; 


Compound 






BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 


BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 




LIPID DEGRADATION 
PHOSPHOLIPASE A2, LIPID 
DEGRADATION, HYDROLASE 


LIPID DEGRADATION 
PHOSPHOLIPASE A2, LIPID 
DEGRADATION, HYDROLASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 
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217.96 
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SEQFOL 
D score 


UROKINASE-TYPE 
PLASMINOGEN 


UROKINASE-TYPE 
PLASMINOGEN 
ACTIVATOR; CHAIN: A; 
UROKINASE-TYPE 
PLASMINOGEN 
ACTIVATOR; CHAIN: B; 


ELASTASE; CHAIN: P; 


ELASTASE; CHAIN: P; 


ELASTASE; CHAIN: P; 


FACTOR VII; CHAIN: 
NULL; 


PLASMINOGEN; CHAIN: 
A; 


PLASMINOGEN; CHAIN: 
A; 


PLASMINOGEN; CHAIN: 
A, 




Compound 


BLOOD CLOTTING SELECTIVE, SI 
SITE INHIBITOR, STRUCTURE- 


BLOOD CLOTTING SELECTIVE, SI 
SITE INHIBITOR, STRUCTURE- 
BASED DRUG DESIGN, 2 
UROKINASE, TRYPSIN, 
THROMBIN 


SERINE PROTEASE PPE; SERINE 
PROTEASE, HYDROLASE 


SERINE PROTEASE PPE; SERINE 
PROTEASE, HYDROLASE 


SERINE PROTEASE PPE; SERINE 
PROTEASE, HYDROLASE 


BLOOD COAGULATION BLOOD 
COAGULATION, EGF, 
HYDROLASE, SERINE PROTEASE 


HYDROLASE SERINE PROTEASE, 
FIBRINOLYSIS, LYSINE-BINDING 
DOMAIN, 2 PLASMINOGEN, 
KRINGLE 2, HYDROLASE 


HYDROLASE SERINE PROTEASE, 
FIBRINOLYSIS, LYSINE-BINDING 
DOMAIN, 2 PLASMINOGEN, 
KRINGLE 2, HYDROLASE 


HYDROLASE SERINE PROTEASE, 
FIBRINOLYSIS, LYSINE-BINDING 
DOMAIN, 2 PLASMINOGEN, 
KRINGLE 2, HYDROLASE 


HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
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PMF 1 






170.95 


60.11 




65.46 




SEQFOL 
D score 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T, U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T, U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 
CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T,U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 
CHAIN: C; 


HUMAN MATRIX 
METALLOPROTEINASE 
2; CHAIN: A; 


METALLOPROTEINASE 
2; CHAIN: A; 


HUMAN MATRIX 


PLASMINOGEN; ICEA 7 
CHAIN: A, B; ICEA 8 


ACTIVATOR; CHAIN: A; 
UROKINASE-TYPE 
PLASMINOGEN 
ACTIVATOR; CHAIN: B; 


Compound 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


HYDROLASE COL-2; BETA SHEET, 
ALPHA HELIX, HYDROLASE 


HYDROLASE COL-2; BETA SHEET, 
ALPHA HELIX, HYDROLASE 


SERINE PROTEASE KIPG; ICEA 10 


BASED DRUG DESIGN, 2 
UROKINASE, TRYPSIN, 
THROMBIN 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



511 



vo 

ON 


vo 
I—* 
ON 


vo 

t— * 
ON 


vo 
t— ' 

ON 


VO 

(— * 

ON 


VO 

> — i 

ON 


vo 

ON 




^ a ss 


Idva 


Idva 






>— * 
Ou 








S3 

w 


r 


r 


w 


CO 


> 


> 


> 




CHAI 
NID 




ON 
bO 


1 




to" 
vo 


oo 






STAR 
TAA 


s 


to 
o 


ON 
£3 


oo 


ON 

oo 
to 


ON 
OO 

to 


OO 






4.2e-19 


L4e-27 


? 
OO 


4.2e-82 


7e-95 


7e-95 


to 

bo 

CD 
1 

oo 
vo 




Psi Blast 
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64.99 
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179.88 


187.88 




176.67 




SEQFOL 
D score 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: 
H, I; DES-GLA FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L, M; (DPN)-PHE- 
ARG; CHAIN: C, D; 


DES-GLA FACTOR VILA 
(HEAVY CHAIN); CHAIN: 
H, I; DES-GLA FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L, M; (DPN)-PHE- 
ARG; CHAIN: C,D; 
PEPTIDE E-76; CHAIN: X, 
Y; 


DELTA- 

CHYMOTRYPSIN; 
CHAIN: A, C; DELTA- 
CHYMOTRYPSIN; 
CHAIN: B, D; 


DELTA- 

CHYMOTRYPSIN; 
CHAIN: A, C; DELTA- 
CHYMOTRYPSIN; 
CHAIN: B, D; 


PLASMINOGEN; CHAIN: 
A, B, C, D; 


PLASMINOGEN; CHAIN: 
A, B, C, D; 


PLASMINOGEN; CHAIN: 
A, B, C, D; 


CHAIN: C; | 


Compound 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


KETONE 


HYDROLASE DELTA- 
CHYMOTRYPSIN, PEPTIDIC 
INHIBIOR, CHLOROMETHYL 


HYDROLASE DELTA- 
CHYMOTRYPSIN, PEPTIDIC 
INHIBIOR, CHLOROMETHYL 
KETONE 


BLOOD CLOTTING 
PLASMINOGEN, CATALYTIC 
DOMAIN 


BLOOD CLOTTING 
PLASMINOGEN, CATALYTIC 
DOMAIN 


BLOOD CLOTTING 
PLASMINOGEN, CATALYTIC 
DOMAIN 
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SEQFOL 
D score 


ENTEROPEPTIDASE; 
CHAIN: A; 

ENTEROPEPTIDASE; 
CHAIN: B; VAL-ASP- 
ASP-ASP-ASP-LYS 
PEPTIDE; CHAIN: C; 


ENTEROPEPTIDASE; 
CHAIN: A; 

ENTEROPEPTIDASE; 
CHAIN: B;VAL-ASP- 
ASP-ASP-ASP-LYS 
PEPTIDE; CHAIN: C; 


FIBRONECTIN; CHAIN: 
A; 


FIBRONECTIN; CHAIN: 
A; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, O, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E,F,G,H; 


PEPTIDE E-76; CHAIN: X, 
Y; 


Compound 


HYDROLASE/HYDROLASE 
INHIBITOR ENTEROKINASE, 
HEAVY CHAIN; ENTEROKINASE, 
LIGHT CHAIN; 
ENTEROPEPTIDASE, 
TRYPSINOGEN ACTIVATION, 2 
HYDROLASE/HYDROLASE 
INHIBITOR 


HYDROLASE/HYDROLASE 
INHIBITOR ENTEROKINASE, 
HEAVY CHAIN; ENTEROKINASE, 
LIGHT CHAIN; 
ENTEROPEPTIDASE, 
TRYPSINOGEN ACTIVATION, 2 
HYDROLASE/HYDROLASE 
INHIBITOR 


EXTRACELLULAR MATRIX 
GLYCOPROTEIN 
EXTRACELLULAR MATRIX 
GLYCOPROTEIN 


EXTRACELLULAR MATRIX 
GLYCOPROTEIN 
EXTRACELLULAR MATRIX 
GLYCOPROTEIN 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 
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170.59 




178.10 


175.26 


SEQFOL 
D score 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 
SOLUBLE TISSUE 
FACTOR; CHAIN: T; 
5L15; CHAIN: I; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 
SOLUBLE TISSUE 
FACTOR; CHAIN: T; 
5L15; CHAIN: I; 


TRYPSIN II, ANIONIC; 
CHAIN: A; PANCREATIC 
TRYPSIN INHIBITOR; 
CHAIN: I; 


TRYPSIN II, ANIONIC; 
CHAIN: A; PANCREATIC 
TRYPSIN INHIBITOR; 
CHAIN: I; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


ELASTASE; 1ELT 4 
CHAIN: NULL; 1ELT5 


ELASTASE; 1ELT4 
CHAIN: NULL; 1ELT5 


Compound 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, RECEPTOR ENZYME, 3 
INHIBITOR, GLA, EGF, COMPLEX 
(SERINE 4 

PROTEASE/COFACTOR/LIGAND), 
BLOOD CLOTTING 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 
PROTEASE, COMPLEX, CO- j 
FACTOR, RECEPTOR ENZYME, 3 
INHIBITOR, GLA, EGF, COMPLEX 
(SERINE 4 

PROTEASE/COFACTOR/LIGAND), 
BLOOD CLOTTING 


HYDROLASE/HYDROLASE 
INHIBITOR BPTI SERINE 
PROTEASE, TRYPSIN PRECURSOR 


HYDROLASE/HYDROLASE 
INHIBITOR BPTI SERINE 
PROTEASE, TRYPSIN PRECURSOR 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 


SERINE PROTEINASE 


SERINE PROTEINASE 
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Verify 
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PMF 
score 




178.61 


189.59 




186.43 


177.71 


171.91 


177.29 


169.75 


SEQFOL 
D score 


COAGULATION FACTOR 
XA-TRYPSIN CHIMERA; 
CHAIN: A; D-PHE-PRO- 
ARG- 

CHLOROMETHYLKETO 


COAGULATION FACTOR 
XA-TRYPSIN CHIMERA; 
CHAIN: A; D-PHE-PRO- 
ARG- 

CHLOROMETHYLKETO 
NE (PPACK) WITH 
CHAIN: I; 


TRYPSIN; CHAIN: A; 


TRYPSIN; CHAIN: A; 


TRYPSIN; CHAIN: A; 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L ! 


Compound 


COMPLEX 

(PROTEASE/INHIBITOR) TRYPSIN, 
COAGULATION FACTOR XA, 
CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 


COMPLEX 

(PROTEASE/INHIBITOR) TRYPSIN, 
COAGULATION FACTOR XA, 
CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 
COMPLEX 

(PROTEASE/INHIBITOR) 


HYDROLASE SERINE PROTEASE, 
HYDROLASE 


HYDROLASE SERINE PROTEASE, 
HYDROLASE 


HYDROLASE SERINE PROTEASE, 
HYDROLASE 




HYDROLASE ANTI-PARALLEL 
i BETA-BARREL 




HYDROLASE ANTI-PARALLEL 
BETA-BARREL 


HYDROLASE ANTI-PARALLEL 
BETA-BARREL 


HYDROLASE ANTI-PARALLEL 
BETA-BARREL 
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Psi Blast 
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Verify 
score 
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PMF 
score 




73.08 




68.60 


82.88 


176.84 


174.02 


181.30 




SEQFOL 
D score 


AP0LIP0PR0TEIN(A); 
CHAIN: A; 


APOLIPOPROTEIN(A); 
CHAIN: A; 


PLASMINOGEN; CHAIN: 
A, B;M PROTEIN; 
CHAIN: C, D; 


PLASMINOGEN; CHAIN: 
A, B;M PROTEIN; 
CHAIN: C, D; 


NKG2-D; CHAIN: A; 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H7.0) 
IGCT 3 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H 7.0) 
IGCT 3 


COAGULATION FACTOR 
XA-TRYPSIN CHIMERA; 
CHAIN: A; D-PHE-PRO- 
ARG- 

CHLOROMETHYLKETO 
NE (PPACK) WITH 
CHAIN: I; 


NE (PPACK) WITH 
CHAIN: I; 


Compound 


HYDROLASE APO(A); 
ALIPOPROTEIN(A), KRINGLE, 


HYDROLASE APO(A); 
ALIPOPROTEIN(A), KRINGLE, 
PROTEIN-LIGAND INTERACTION, 
2 LYSINE BINDING, CRYSTAL 
STRUCTURE 


BLOOD CLOTTING HUMAN 
PLASMINOGEN KRINGLE-2, 
KRINGLES, VEK-30 


BLOOD CLOTTING HUMAN 
PLASMINOGEN KRINGLE-2, 
KRINGLES, VEK-30 


APOPTOSIS HOMODIMER, CIS- 
PROLINE 






COMPLEX 

(PROTEASE/INHIBITOR) TRYPSIN, 
COAGULATION FACTOR XA, 
CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 
COMPLEX 

(PROTEASE/INHIBITOR) 


COMPLEX 

(PROTEASE/INHIBITOR) 
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Verify 
score 
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PMF 
score 


76.64 








79.24 






SEQFOL 
D score 


PLASMINOGEN; CHAIN: 


| LAMININ; CHAIN: NULL; 


PLASMINOGEN 
ACTIVATION 
PLASMINOGEN 
ACTIVATOR 
(UROKINASE-TYPE, - 
KRINGLE DOMAIN) 
1KDU3(U-PAK)(NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IKDU 4 


PLASMINOGEN 
ACTIVATION 
PLASMINOGEN 
ACTIVATOR 
(UROKINASE-TYPE, 
KRINGLE DOMAIN) 
1KDU3(U-PAK)(NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IKDU 4 


PLASMINOGEN 
ACTIVATION 
PLASMINOGEN 
ACTIVATOR 
(UROKINASE-TYPE, 
KRINGLE DOMAIN) 
1KDU3(U-PAK)(NMR, 
MINIMIZED AVERAGE 
STRUCTURE) IKDU 4 


MATRIX 

METALLOPROTEINASE 
2; CHAIN: A; 




Compound 


1 SERINE PROTEASE KRINGLE, 


GLYCOPROTEIN GLYCOPROTEIN | 








HYDROLASE 72 KDA TYPE IV 
COLLAGENASE, GELATINASE A; 
BETA SHEET, ALPHA HELIX 


PROTEIN-LIGAND INTERACTION, 
2 LYSINE BINDING, CRYSTAL 
STRUCTURE 
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PMF 
score 




58.07 


187.81 


182.06 




101.45 




SEQFOL 
D score 


COAGULATION FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L; 

COAGULATION FACTOR 


COAGULATION FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L; 

COAGULATION FACTOR 
VIIA (HEAVY CHAIN); 
CHAIN: H; TRIPEPTIDYL 
INHIBITOR; CHAIN: C; 


PROCARBOXYPEPTIDAS 
EA; CHAIN: A, B; 
PROPROTEINASEE; 
CHAIN: C; 

CHYMOTRYPSINOGEN 
C; CHAIN: D; 


PROCARBOXYPEPTIDAS 
EA; CHAIN: A, B; 
PROPROTEINASEE; 
CHAIN: C; 

CHYMOTRYPSINOGEN 
C; CHAIN: D; 


HYDROLASE CALCIUM- 
FREE PHOSPHOLIPASE 
A=2=(E.C.3.1.1.4)1PP2 4 


HYDROLASE(SERINE 
PROTEASE) TISSUE 
PLASMINOGEN 
ACTIVATOR KRINGLE 2 
(E.C.3.4.21.68) IPML 3 


PROTEASE) TISSUE 
PLASMINOGEN 
ACTIVATOR KRINGLE 2 
(E.C.3.4.21.68) IPML 3 


Compound 


SERINE PROTEASE FVIIA; FVIIA; 1 
BLOOD COAGULATION, SERINE 
PROTEASE 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


TERNARY COMPLEX (ZYMOGEN) 
TC, PCPA-TC; TERNARY 
COMPLEX (ZYMOGEN), SERINE 
PROTEINASE, C-TERMINAL 2 
PEPTIDASE 


TERNARY COMPLEX (ZYMOGEN) 
TC, PCPA-TC; TERNARY 
COMPLEX (ZYMOGEN), SERINE 
PROTEINASE, C-TERMINAL 2 
PEPTIDASE 
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Psi Blast 


0.34 
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Verify 
score 
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1.00 






-0.20 
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PMF 
score 




75.13 




187.74 


184.42 






50.59 




SEQFOL 
D score 


APOLIPOPROTEIN; 
CHAIN: NULL; 


APOLIPOPROTEIN; 
CHAIN: NULL; 


TRYPSIN; CHAIN: E; 
TRYPSIN INHIBITOR; 
CHAIN: I 


TRYPSIN; CHAIN: E; 
TRYPSIN INHIBITOR; 
CHAIN: I 


TRYPSIN; CHAIN: E; 
TRYPSIN INHIBITOR; 
CHAIN: I 


HYDROLASE(SERINE 
PROTEASE) 
PROTHROMBIN 
FRAGMENT 1 
(RESIDUES 1 - 156) 
COMPLEX WITH 2PF2 3 
CALCIUM 2PF2 4 


HYDROLASE(SERINE 
PROTEINASE) 
PROTHROMBIN 
FRAGMENT 1 
(RESIDUES 1 - 156) 2PF1 
3 


HYDROLASE(SERINE 
PROTEINASE) 
PROTHROMBIN 
FRAGMENT 1 
(RESIDUES 1 - 156) 2PF1 
3 




Compound 


KRINGLE KRINGLE, LYSINE 
BINDING SITE, 


KRINGLE KRINGLE, LYSINE 
BINDING SITE, 
APOLIPOPROTEIN(A) 


HYDROLASE/HYDROLASE 
INHIBITOR SERINE PROTEINASE, 
TRYPSIN INHIBITOR 


HYDROLASE/HYDROLASE 
INHIBITOR SERINE PROTEINASE, 
TRYPSIN INHIBITOR 


HYDROLASE/HYDROLASE 
INHIBITOR SERINE PROTEINASE, 
TRYPSIN INHIBITOR 
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Psi Blast 
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0.16 




Verify 
score 


-1202.08 
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-1202.08 




1.00 






-0.09 




PMF 
score 








188.52 




185.89 


67.77 






SEQFOL 
D score 


FIBRILLIN; CHAIN: 
NULL; 


TNF-RELATED 
APOPTOSIS INDUCING 
LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; 
CHAIN: A; 


TNF-RELATED 
APOPTOSIS INDUCING 
LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; 
CHAIN: A; 


BETA TRYPSIN; CHAIN: 
NULL; 


BETA TRYPSIN; CHAIN: 
NULL; 


BETA TRYPSIN; CHAIN: 
NULL; 


PLASMINOGEN; CHAIN: 
A, B; 


APOLIPOPROTEIN; 
CHAIN: NULL; 




Compound 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN- 1 
FRAGMENT, MATRIX PROTEIN 


APOPTOSIS TRAIL; DR5; LIGAND- 
RECEPTOR COMPLEX, TRIMERIC 
JELLY-ROLL, TNF-R 2 
SUPERFAMILY, APOPTOSIS 


APOPTOSIS TRAIL; DR5; LIGAND- 
RECEPTOR COMPLEX, TRIMERIC 
JELLY-ROLL, TNF-R 2 
SUPERFAMILY, APOPTOSIS 


SERINE PROTEASE HYDROLASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN. SIGNAL 


SERINE PROTEASE HYDROLASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN. SIGNAL 


SERINE PROTEASE HYDROLASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN. SIGNAL 


SERINE PROTEASE SERINE 
PROTEASE, KRINGLE 5, HUMAN 
PLASMINOGEN. FIBRINOLYSIS 


KRINGLE KRINGLE, LYSINE 
BINDING SITE, 
APOLIPOPROTEINrA^ 
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SEQFOL 
D score 


| INSULIN-LIKE GROWTH 


INSULIN-LIKE GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: A; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN:A,B,C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


Compound 


HORMONE RECEPTOR HORMONE 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 




IMMUNE SYSTEM RET A. MODULE 1 


| IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
j SIGNALLING PROTEIN 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 
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SEQFOL 
D score 


ANTISTASIN; CHAIN: 
NULL; 


ANTISTASIN; CHAIN: 
NULL; 


NULL; 


1 ANTISTASIN: CHAIN: 1 


ANTISTASIN; CHAIN: 
NULL; 


ANTISTASIN; CHAIN: 
NULL; 


GLYCOPROTEIN I; 
CHAIN: A; 


Compound 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


CONSENSUS REPEAT, SUSHI, 
COMPLEMENT CONTROL 
PROTEIN, 2 N-GLYCOSYLATION, 
MULTI-DOMAIN, MEMBRANE 
ADHESION 
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SEQFOL 
D score 


COAGULATION FACTOR 
VILA (LIGHT CHAIN); 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


LAMININ; CHAIN: NULL; | 


Compound 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 
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Verify 
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PMF 
score 




52.74 


54.30 


69.15 




SEQFOL 
D score 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, 0 5 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M, N, 0, 
P; THROMBOMODULIN; 
CHAIN: I,J,K,L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: 
H, I; DES-GLA FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L, M; (DPN)-PHE- 
ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, 
Y; 


ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, 
Y; 


Compound 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 
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Verify 
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PMF 
score 


71.76 










SEQFOL 
D score 


FIBRILLIN; CHAIN: 
NULL; 


THROMBIN LIGHT 
CHAIN; CHAIN:A,B,C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N,0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-I^GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, I, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; ! 


Compound 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD 141 
ANTIGEN; EGR-CMK SERINE I 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 
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D score 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


FAB ANTIBODY LIGHT 
CHAIN; CHAIN: L; FAB 
ANTIBODY HEAVY 
CHAIN; CHAIN: H; 


IGGl ANTIBODY 1696 
(LIGHT CHAIN); CHAIN: 
L; IGGl ANTIBODY 1696 
(VARIABLE HEAVY 
CHAIN); CHAIN: H; IGGl 
ANTIBODY 1696 
(CONSTANT HEAVY 
CHAIN); CHAIN: I; 


IGGl ANTIBODY 1696 
(LIGHT CHAIN); CHAIN: 
L; IGGl ANTIBODY 1696 
(VARIABLE HEAVY 
CHAIN); CHAIN: H; IGGl 
ANTIBODY 1696 
(CONSTANT HEAVY 
CHAIN); CHAIN: I; 


HEAVY CHAIN V 
REGIONS; CHAIN: C; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: D; 


Compound 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


IMMUNE SYSTEM ANTI-PRION 
FAB 3F4; ANTI-PRION FAB 3F4 
ANTI-PRION ANTIBODY, FAB 3F4 




IMMUNE SYSTEM 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl FAB 
FRAGMENT, CROSS-REACTIVITY, 
HIVl PROTEASE, ENZYME 2 
INHIBITION. IMMUNOGLOBULIN 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl FAB 
FRAGMENT, CROSS-REACTIVITY, 
HTVl PROTEASE, ENZYME 2 
INHIBITION, IMMUNOGLOBULIN 
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SEQFOL 
D score 


T-CELL SURFACE 
GLYCOPROTEIN CD4; 
CHAIN: A, B; 


E8 ANTIBODY; CHAIN: 
L,H; CYTOCHROME C; 
CHAIN: F; 


E8 ANTIBODY; CHAIN: 
L, H; CYTOCHROME C; 
CHAIN: F; 


TRL9 FAB; CHAIN: L,H; 


SM3 ANTIBODY; CHAIN: 
L, H; PEPTIDE EPITOPE; 
CHAIN: P; 


L; IMMUNOGLOBULIN 
IGGl-KAPPA ANTIBODY 
(HEAVY CHAIN); CHAIN: 
H; 


Compound 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 


COMPLEX 

(ANTIBODY/ELECTRON 
TRANSPORT) FAB E8; CYT C, 
ANTIGEN; IMMUNOGLOBULIN, 
IGGl KAPPA, FAB FRAGMENT, 
HORSE 2 CYTOCHROME C, 
COMPLEX 

(ANTIBODY/ELECTRON 
TRANSPORT) 


COMPLEX 

(ANTIBODY/ELECTRON 
TRANSPORT) FAB E8; CYT C, 
ANTIGEN; IMMUNOGLOBULIN, 
IGGl KAPPA, FAB FRAGMENT, 
HORSE 2 CYTOCHROME C, 
COMPLEX 

(ANTIBODY/ELECTRON 
TRANSPORT) 


IMMUNOGLOBULIN TR1 .9, ANTI- 1 
THYROID PEROXIDASE, 
AUTOANTIBODY, 2 
IMMUNOGLOBULIN 


COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR 
ANTIBODY, 2 COMPLEX 
(ANTIBODY/PEPTIDE EPITOPE) 
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SEQFOL 
D score 


RAB 

GERANYLGERANYLTRA 
NSFERASE ALPHA 
SUBUNIT; CHAIN: A, C; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 1 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


U2RW 
CHAIN 
CHAIN 
CHAIN 


U2RN; 
CHAIN: 
CHAIN: 
CHAIN 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


U2RNA 
CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: A, C;U2B"; 
CHAIN: B, D; 


Compound 


P > P ffl 

o o w & 
to to a 


l HAIRPIN IV; 
:Q,R;U2A'; 
: A, C; U2 B"; 
:B,D; 


.HAIRPIN IV; 
:Q,R;U2A'; 
: A, C; U2 B"; 
:B, D; 


L HAIRPIN IV; 

Q,R;U2A'; 
:A,C;U2B"; 

B,D; 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 
GERANYLGERANYLTRANSFERAS 
E, 2.0 A 2 RESOLUTION, N- 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIB0NUCLE0PROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP.RIBONUCLEOPROTEIN 


(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 
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SEQFOL 
D score 


COAGULATION FACTOR 
X BINDING PROTEIN; 
CHAIN: A; 


NKG2-D TYPE II 
INTEGRAL MEMBRANE 
PROTEIN; CHAIN: B, A; 
MHC CLASS I CHAIN- 
RELATED PROTEIN A; 
CHAIN: C; 


NKG2-D; CHAIN: A; 


BOTROCETIN ALPHA 
CHAIN; CHAIN: A, C; 
BOTROCETIN BETA 
CHAIN; CHAIN: B, D; 


ASIALOGLYCOPROTEIN 
RECEPTOR 1; CHAIN: A; 


FLAVOCETIN-A: ALPHA 
SUBUNIT; CHAIN: A; 
FLAVOCETIN-A: BETA 
SUBUNIT; CHAIN: B 


COAGULATION FACTOR 
IX-BINDING PROTEIN A; 
CHAIN: A; 

COAGULATION FACTOR 
IX-BINDING PROTEIN B; 
CHAIN: B; 


CHAIN: A; 


Compound 


HYDROLASE/HYDROLASE 
INHIBITOR CALCIUM BRIDGING, 
DOMAIN SWAPPING 


IMMUNE SYSTEM NKG2D; MIC-A, 
MIC, PERBll; ACTIVATING NK 
CELL RECEPTOR, NKG2D, C- 
TYPE-LECTIN LIKE, MC- 2 A, 
MHC-I, COMPLEX, IMMUNE 
SYSTEM 


APOPTOSIS HOMODIMER, CIS- 
PROLINE 


TOXIN PLATELET 
COAGGLUTININ ALPHA; 
PLATELET COAGGLUTININ BETA 
VON WILLBRAND FACTOR 
MODULATOR, C-TYPE LECTIN, 
METAL- 2 BINDING, LOOP 
EXCHANGED DIMER 


SIGNALING PROTEIN HEPATIC 
LECTIN HI; C-TYPE LECTIN CRD 


MEMBRANE PROTEIN C-TYPE 
LECTIN-LIKE DOMAINS 


COLLAGEN BINDING PROTEIN IX- 
BP; IX-BP; COAGULATION 
FACTOR IX-BINDING, 
HETERODIMER, VENOM, HABU 2 
SNAKE, C-TYPE LECTIN 
SUPERFAMILY, COLLAGEN 
BINDING PROTEIN 


PROTEIN, SOLUTION BACKBONE 
FOLD, C- 2 TYPE LECTIN, 
ANTIFREEZE PROTEIN 
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SEQFOL 
D score 


SERINE/THREONINE 
PROTEIN PHOSPHATASE 
5; CHAIN: NULL; 


GLYCOSYL 

TRANSFERASE; CHAIN: 
A; 


SEA RAVEN TYPE II 
ANTIFREEZE PROTEIN; 
CHAIN: A; 


COAGULATION 
FACTORS IX/X-BINDING 
PROTEIN; CHAIN: A, B, 
C,D,E,F; 


COAGULATION FACTOR 
X BINDING PROTEIN; 
CHAIN: A; 

COAGULATION FACTOR 
X BINDING PROTEIN; 
CHAIN: B; 

COAGULATION FACTOR 
XGLA DOMAIN; CHAIN: 


COAGULATION FACTOR 
X BINDING PROTEIN; 
CHAIN: B; 

COAGULATION FACTOR 
XGLA DOMAIN; CHAIN: 
G; 


Compound 


HYDROLASE 

TETRATRICOPEPTIDE, TRP; 
HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN 
INTERACTIONS, TPR, 2 SUPER- 
HELIX, X-RAY STRUCTURE 




TRANSFERASE ALPHA-BETA 
STRUCTURE 


ANTIFREEZE PROTEIN 
RECOMBINANT SEA RAVEN 
PROTEIN, SOLUTION BACKBONE 
FOLD, C- 2 TYPE LECTIN, 
ANTIFREEZE PROTEIN 


COAGULATION FACTOR BINDING 
DC/X-BP COAGULATION FACTOR 
BINDING, C-TYPE LECTIN, GLA- 
DOMAIN 2 BINDING, C-TYPE CRD 
MOTIF, LOOP EXCHANGED 
DIMER 


HYDROLASE/HYDROLASE 
INHIBITOR CALCIUM BRIDGING, 
DOMAIN SWAPPING 
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SEQFOL 
D score 


TENASCIN; CHAIN: A, B; 


INTEGRIN BETA-4 
SUBUNIT; CHAIN: A, B; 


FIBRONECTIN; CHAIN: 
NULL; 


FIBRONECTIN; CHAIN: 
NULL; 


FIBRONECTIN; CHAIN: 
NULL; 


INTERLEUKIN-6 
RECEPTOR BETA 
CHAIN; CHAIN: A; 
VIRAL IL-6; CHAIN: B; 


FIBRONECTIN; CHAIN: 
A; 


> 


Compound 


STRUCTURAL PROTEIN 
TENASCIN, FIBRONECTIN TYPE- 
III, HEPARIN, EXTRACELLULAR 2 
MATRIX, ADHESION, FUSION 
PROTEIN, STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN 
INTEGRIN, HEMIDESMOSOME, 
FIBRONECTIN, CARCINOMA, 
STRUCTURAL 2 PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


CYTOKINE GP130; FUNCTIONAL 
INTERLEUKIN-6 HOMOLOG; 
CYTOKINE/RECEPTOR COMPLEX, 
GP130, VIRAL IL-6, CRYSTAL 2 
STRUCTURE 


HEPARIN AND INTEGRIN 
BINDING HEPARIN AND 
INTEGRIN BINDING 


BINDING HEPARIN AND 
INTEGRIN BINDING 
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SEQFOL 
D score 


IGG2A KAPPA 
ANTIBODY CB41 (LIGHT 
CHAIN); CHAIN: A; 
IGG2A KAPPA 
ANTIBODY CB41 
(HEAVY CHAIN); CHAIN: 
B; PEPTIDE 5; CHAIN: C; 


IMMUNOGLOBULIN 
NMC-4 IGGl; CHAIN: L; 
IMMUNOGLOBULIN 
NMC-4 IGGl; CHAIN: H; 
VON WILLEBRAND 
FACTOR; CHAIN: A; 


HYBRIDOMA 
ANTIBODY LA2 (LIGHT 
CHAIN); CHAIN: A, C; 
HYBRIDOMA 
ANTIBODY LA2 (HEAVY 
CHAIN); CHAIN: B, D; 
OUTER SURFACE 
PROTEIN A; CHAIN: E, F; 


5G9; CHAIN: L, H; 


1 IMMUNOGLOBULIN FAB 


(ANTIBODY/ANTIGEN) 
FAB FRAGMENT OF THE 
MONOCLONAL 
ANTIBODY F9. 13.7 
(IGGl) 1FBI3 
COMPLEXED WITH 
LYSOZYME(E.C3.2.1.17) 
1FBI4 


Compound 


COMPLEX (ANTIBODY/PEPTIDE) 
POLYSPECIFICITY, 
CROSSREACTIVITY, FAB- 
FRAGMENT, PEPTIDE, 2 HIV-1 


I IMMUNE SYSTEM VON 
WILLEBRAND FACTOR, 
GLYCOPROTEIN IBA (A:ALPHA) 
BINDING, 2 COMPLEX 
(WILLEBRAND/IMMUNOGLOBULI 
N)> BLOOD COAGULATION TYPE 
3 2B VON WILLEBRAND DISEASE 


IMMUNE SYSTEM LA2 FAB; LA2 
FAB; OSPA; OSPA, LYME 
DISEASE, ANTIBODY FAB 
FRAGMENT, NEUTRALIZING 2 
EPITOPE 


IMMUNOGLOBULIN FAB, FAB 
LIGHT CHAIN, FAB HEAVY 
CHAIN; ANTIBODY, FAB, ANTI-TF, 
MONOCLONAL, MURINE, 
IMMUNOGLOBULIN 
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62.32 


73.51 




124.34 






SEQFOL 
D score 


TRANSCRIPTIONAL 
REPRESSOR TUPI; 


TRANSCRIPTIONAL 
REPRESSOR TUPI; 
CHAIN: A, B, C; 


TRANSCRIPTIONAL 
REPRESSOR TUPI; 
CHAIN: A, B, C; 


TRANSCRIPTIONAL 
REPRESSOR TUPI; 
CHAIN: A, B, C; 


ALPHA SPECTRIN; 
CHAIN: A, B, C; 


ALPHA, BETA T-CELL 
RECEPTOR CHAIN: A, B; 


ALPHA, BETA T-CELL 
RECEPTOR CHAIN: A, B; 


N15 ALPHA-BETA T- 
CELL RECEPTOR; 
CHAIN: A, B, C, D; H57 
FAB; CHAIN: E, F, G, H 


N15 ALPHA-BETA T- 
CELL RECEPTOR; 
CHAIN: A, B, C, D;H57 
FAB; CHAIN: E, F, G, H 


CELL RECEPTOR BETA 
CHAIN; CHAIN: E; 


Compound 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED- COILS, 
STRUCTURAL PROTEIN 


RECEPTOR TCR; T-CELL, 
RECEPTOR, TRANSMEMBRANE, 
GLYCOPROTEIN, SIGNAL 


RECEPTOR TCR; T-CELL, 
RECEPTOR, TRANSMEMBRANE, 
GLYCOPROTEIN, SIGNAL 


COMPLEX 

(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) COMPLEX 
(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) 


COMPLEX 

(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) COMPLEX 
(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) 
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score 


59.18 


149.69 


148.59 
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141.89 




SEQFOL 
D score 


MONOCLONAL 
ANTIBODY D1.3; CHAIN: 


PROSTAGLANDIN H2 
SYNTHASE-1; CHAIN: A, 
B; 


PROSTAGLANDIN H2 
SYNTHASE-1; CHAIN: A, 
B; 


MYELOPEROXIDASE; 
CHAIN: A, B; 
MYELOPEROXIDASE; 
CHAIN: C, D; 


MYELOPEROXIDASE; 
CHAIN: A, B; 
MYELOPEROXIDASE; 
CHAIN: C, D; 


MYELOPEROXIDASE; 
CHAIN: A, B; 
MYELOPEROXIDASE; 
CHAIN: C, D; 


CHAIN: C, D; 


Compound 


COMPLEX 

(IMMUNOGLOBULIN/HYDROLAS 


OXIDOREDUCTASE COX-1; EGF 
DOMAIN, NSAID BINDING, 
IBUPROFEN, MEMBRANE- 
BINDING 2 DOMAIN, 
CYCLOOXYGENASE, 
PEROXIDASE, 
OXIDOREDUCTASE, 3 
DIOXYGENASE. PEROXIDASE 


OXIDOREDUCTASE COX-1; EGF 
DOMAIN, NSAID BINDING, 
IBUPROFEN, MEMBRANE- 
BINDING 2 DOMAIN, 
CYCLOOXYGENASE, 
PEROXIDASE, 
OXIDOREDUCTASE, 3 
DIOXYGENASK PEROXIDASE 


OXIDOREDUCTASE HEME- 
PROTEIN, PEROXIDASE, 
OXIDOREDUCTASE, 
PEROXIDASE- 2 BROMIDE 
COMPLEX 


OXIDOREDUCTASE HEME- 
PROTEIN, PEROXIDASE, 
OXIDOREDUCTASE, 
PEROXIDASE- 2 BROMIDE 
COMPLEX 


OXIDOREDUCTASE HEME- 
PROTEIN, PEROXIDASE, 
OXIDOREDUCTASE, 
PEROXIDASE- 2 BROMIDE 
COMPLEX 


PEROXIDASE- 2 BROMIDE 
COMPLEX 
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59.67 


50.53 


56.08 




SEQFOL 
D score 


FAB B7-15A2; CH/ 
H; 


HLA-A 0201; CHAI 
BETA-2 

MICROGLOBULIN 
CHAIN: B; TAX PE 
CHAIN: C; T CELL 
RECEPTOR AL?¥U 
CHAIN: D; T CELL 
RECEPTOR BETA; 
CHAIN: E; 








IGG4REA; CHAIN 
AN IGM/LAMBDA 
CHAIN: H, L; 


FAB FRAGMENT, 
ANTIBODY A5B7; 
CHAIN: A, B, C, D; 


MONOCLONAL 
ANTIBODY Dl. 3; ( 
L,H; 


MONOCLONAL 
ANTIBODY D1.3;( 
L, H; 


Bl-8; CHAIN: L, H 
N,J; 








A, B; LYSOZYME; 
CHAIN: C; 


Compound 




g 
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PTIDE; 
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DHAIN: 




^HAIN: 






j — i 














IMMUNOGLOBULIN HUMAN FAB, 
ANTI-TETANUS TOXOID, HIGH 


COMPLEX (MHCMRAL 
PEPTIDE/RECEPTOR) HLA-A2 
HEAVY CHAIN; CLASS I MHC, T- 
CELL RECEPTOR, VIRAL PEPTIDE, 
2 COMPLEX (MHC/VIRAL 
PEPTIDE/RECEPTOR 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN), RHEUMATOID FACTOR 2 
AUTO-ANTIBODY COMPLEX 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, FAB 
FRAGMENT 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, VARIANT 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, VARIANT 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN. HAPTRN 


E) COMPLEX 

(IMMUNOGLOBULIN/HYDROLAS 
E), IMMUNOGLOBULIN V 2 
REGION, SIGNAL, HYDROLASE, 
GLYCOSIDASE, BACTERIOLYTIC 
3 ENZYME, EGG WHITE 
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RECEPTOR 


IMMUNE SYSTEM IMMUNE 
SYSTEM 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 


SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 
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Verify 
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-1202.08 
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-1202.08 


PMF 
score 


64.42 


67.36 












SEQFOL 
D score 


METAL CHELATASE 
CATALYTIC ANTIBODY; 
CHAIN: A, C; METAL 


FC GAMMA RIIB; 
CHAIN: A; 


FC GAMMA RIIB; 
CHAIN: A; 


MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MUSCLE PROTEIN TITIN 
MODULE M5 
(CONNECTIN) 1TNM3 
(NMR, MINIMIZED 
AVERAGE STRUCTURE) 
1TNM4 1TNM 58 


P58-CL42 KIR; CHAIN: 
NULL; 


TITIN; CHAIN: NULL; 


Compound 

XTT TT T . 


IMMUNE SYSTEM METAL 
CHELATASE, CATALYTIC 
ANTIBODY, FAB FRAGMENT. 


IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 


IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLORT IT JN 


< 
t 
t 


INHIBITORY RECEPTOR KILLER 
CELL INHIBITORY RECEPTOR; 
INHIBITORY RECEPTOR, 
NATURAL KILLER CELLS, 
IMMUNOLOGICAL 2 RECEPTORS, 
IMMUNOGLORT IT .TN FOT n 


UN 1 KA5 1 ERIC REGULATION 
MUSCLE PROTEIN CONNECTIN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN 
FOLD, ALTERNATIVE SPLICING, 
SIGNAL. 3 MTTSr.T.R PBHTetxt 
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Verify 
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114.02 




91.26 




61.11 


65.01 




SEQFOL 
D score 


23S RRNA; CHAIN: 0; 5S 
RRNA; CHAIN: 9; 
RIBOSOMAL PROTEIN 
L2; CHAIN: A; 


PATHOGENESIS- 
RELATED PROTEIN 
P14A; CHAIN: NULL; 


PATHOGENESIS- 
RELATED PROTEIN 
P14A; CHAIN: NULL; 


PATHOGENESIS- 
RELATED PROTEIN 
P14A; CHAIN: NULL; 


TROPOMYOSIN; CHAIN: 
A, B, C, D 


COMPLEX(ANTIBODY- 
ANTIGEN) IG*G1 FAB 
FRAGMENT (HY/HEL$- 
10) AND LYSOZYME 
(E.C.3.2.L17)3HFM4 
COMPLEX 3HFM5 


CHELATASE 
CATALYTIC ANTIBODY; 
CHAIN: B,D; 


Compound 


RIBOSOME 50S RIBOSOMAL 
PROTEIN L2P, HMAL2, HL4; 50S 
RIBOSOMAL PROTEIN L3P, 
HMAL3, HLl; SOS RIBOSOMAL 


PATHOGENESIS-RELATED 
PROTEIN PATHOGENESIS- 
RELATED LEAF PROTEIN 6, 
ETHYLENE PATHOGENESIS- 
RELATED PROTEIN, PR-1 
PROTEINS. 2 PLANT DPFENSP 


PATHOGENESIS-RELATED 
PROTEIN PATHOGENESIS- 
RELATED LEAF PROTEIN 6, ! 
ETHYLENE PATHOGENESIS- 
RELATED PROTEIN, PR-1 
PROTEINS. 2 PLANT np.PRMSP 


PATHOGENESIS-RELATED 
PROTEIN PATHOGENESIS- 
RELATED LEAF PROTEIN 6, 
ETHYLENE PATHOGENESIS- 
RELATED PROTEIN, PR-1 
PROTEINS. 2 PLANT nPPRNcp 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 




IMMUNE 2 SYSTEM 
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Verify 
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PMF 
score 


166.40 


129.28 










SEQFOL 
D score 


H-2 CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, K-B CHAIN: 


H-2 CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, K-B CHAIN: 
A; BETA-2- 
MICROGLOBULIN; 
CHAIN: B; 
NUCLEOCAPSID 
PROTEIN; CHAIN: P; 


HLA-CW3 (HEAVY 
CHAIN); CHAIN: A; 
BETA-2- 

MICROGLOBULIN; 
CHAIN: B; PEPTIDE 
FROMIMPORTIN 
ALPHA-2; CHAIN: C; 
NATURAL KILLER CELL 
RECEPTOR KIR2DL2; 
CHAIN: D, E; 


HLA-DR1 CLASS II 
HISTOCOMPATIBILITY 
PROTEIN; CHAIN: A, B, 
D, E, G, H, J, K; HLA-A2; 
CHAIN: CFJ, L; 


B*0801; CHAIN: A; BETA- 
2 MICROGLOBULIN; 
CHAIN: B; HIV-1 GAG 
PEPTIDE (GGKKKYKL- 
INDEX PEPTIDE); 
CHAIN: C; 




Compound 


IMMUNE SYSTEM SEV9; MAJOR 
HISTOCOMPATIBILITY COMPLEX 
PEPTIDE-MHC 


IMMUNE SYSTEM SEV9; MAJOR 
HISTOCOMPATIBILITY COMPLEX 
PEPTIDE-MHC 


IMMUNE SYSTEM MHC, HLA, 
CLASS I, KIR, NKCELL 
RECEPTOR, IMMUNOGLOBULIN 2 
FOLD, RECEPTOR/MHC COMPLEX 


COMPLEX (MHC 
PROTEIN/ANTIGEN) DRA, DRBl 
01010; COMPLEX (MHC 
PROTEIN/ANTIGEN), 
i HISTOCOMPATIBILITY ANTIGEN 


HISTOCOMPATIBILITY COMPLEX 
B8; B2M; PEPTIDE HLA B8, HIV, 
MHC CLASS I, 

HISTOCOMPATIBILITY COMPLEX 


COMPLEX 

(TRANSMEMBRANE/GLYCOPROT 
EIN) 
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SEQFOL 
D score 


HISTOCOMPATIBILITY 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN HLA-E; 
CHAIN: A, C; BETA-2- 
MICROGLOBULIN; 
CHAIN: B, D; PEPTIDE 
(VMAPRTVLL); CHAIN: 


MHC CLASS II HLA-DQ8; 
CHAIN: A; MHC CLASS II 
HLA-DQ8; CHAIN: B; 
INSULIN B PEPTIDE; 
CHAIN: C; 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, A-2 CHAIN: A; 
BETA-2- 

MICROGLOBULIN; 
CHAIN: B; MELANOMA- 
ASSOCIATED ANTIGEN 
4; CHAIN: C; 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN 
CLASS I 

HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 
4 


A; BETA-2- 
MICROGLOBULIN; 
CHAIN: B; 
NUCLEOCAPSID 
PROTEIN; CHAIN: P; 


Compound 


IMMUNE SYSTEM 


MAJOR HISTOCOMPATIBILITY 
COMPLEX MHC NONCLASSICAL 
CHAIN, MHC-E, HLA-E, MHC 
CLASS HLA-E, HLA E, MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
MHC, HLA, 2 BETA 2 
MICROGLOBULIN, PEPTIDE, 
LEADER PEPTIDE, 3 NON- 
CLASSICAL MHC, CLASS IB MHC 


IMMUNE SYSTEM HLA-DQ8, 
INSULIN B PEPTIDE, TYPE 1 
DIABETES, AUTOIMMUNITY 


IMMUNE SYSTEM MAGE-4 
ANTIGEN; MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
HUMAN LEUKOCYTE ANTIGEN, 2 
MELANOMA-ASSOCIATED 
ANTIGEN 
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SEQFOL 
D score 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


QCRP2 (LIM1); CHAIN: 
NULL; 


QCRP2(LIM1); CHAIN: 
NULL; 




Compound 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


BINDING PROTEIN, ZINC 2 
FINGER 
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SEQFOL 
D score 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 


TRIACYL-GLYCEROL- 
HYDROLASE; CHAIN: D, 
E; 


CHOLESTEROL 
ESTERASE; CHAIN: 
NULL; 


HYDROLASE(CARBOXY 
LIC ESTERASE) LIPASE 
(E.C.3.1.1.3) 
TRIACYLGLYCEROL 
HYDROLASE ITHG 3 


PROLYL 

AMINOPEPTIDASE; 
CHAIN: A; 


HYDROXYNITRILE 
LYASE; CHAIN: A; 




TRIACYLGLYCEROL 
HYDROLASE; CHAIN: D; 
TRIACYLGLYCEROL 
HYDROLASE: CHAIN: E: 


PARA-NITROBENZYL 
ESTERASE; CHAIN: A; 




Compound 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 


LIPASE LIPASE; LIPASE, 
HYDROLASE, 
PSEUDOMONADACEAE, 
COVALENT INTERMEDIATE, 2 
TRIGLYCERIDE ANALOGUE, 
ENANTIOSELECTMTY 


HYDROLASE BILE SALT 
ACTIVATED LIPASE, BILE SALT 
STIMULATED HYDROLASE, 
SERINE ESTERASE, LIPASE 




HYDROLASE ALPHA BETA 
HYDROLASE FOLD, PROLINE, 
PROLYL AMINOPEPTIDASE, 2 
SERRATIA, IMINOPEPTIDASE 


LYASE OXYNITRILE LYASE; 
OXYNITRILASE, CYANOGENESIS, 
CYANHYDRIN FORMATION, 
LYASE 


HYDROLASE 

PSEUDOMONADACEAE, CIS- 
PEPTIDE, CLOSED 
CONFORMATION, 2 HYDROLASE, 
LID 


HYDROLASE PNB ESTERASE; 
ALPHA-BETA HYDROLASE 
DIRECTED EVOLUTION 


GLYCOSYLATED PROTEIN | 
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Psi Blast 
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1 
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-0.41 




Verify 
score 








i — ' 

o 
o 


j 0.51 


i— » 

o 
o 




PMF 
score 


[57.17 


54.94 


77.51 










SEQFOL 
D score 


| QGSR ZINC FINGER 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B, C; 


DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


Compound 


COMPLEX (ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


ZINC FINGER, DNA-BINDING 
PROTEIN 
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to 
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to 




to 
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STAR 
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4.2e-37 


¥ 

OO 


3.4e-55 


1.4e-36 




Psi Blast 


-0.14 




0.01 


-0.47 






Verify 
score 


0.76 




0.90 


0.21 






PMF 
score 




91.96 






90.49 




SEQFOL 
D score 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S^ 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, Q E, F; 


CHAIN: B,C,E,F; 


Compound 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 
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STAR 
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2.8e-30 


1 — k 
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8.4*31 


8.4e-37 




Psi Blast 


-0.19 


-0.45 


! 0.13 






Verify 
score 


0.80 


0.07 


o 






PMF 
score 








68.08 




SEQFOL 
D score 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


CHAIN: B, C,E,F; 


Compound 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


(TRANSCRIPTION 

REGULATION/DNA), RNA ! 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 
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STAR 
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4^ 
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to 
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AA 


END 1 
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tO 
CD 

LO 
On 


3.4e-47 
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4^ 
? 
On 


to 
bo 

? 
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Psi Blast 




-0.33 


-0.24 


-0.42 




score 


Verify 1 PMF 1 




0.62 


0.90 


0.81 




score 


66.22 








89.38 


SEQFOL 
D score 


YYl; CPIAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


Compound 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
| REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 

| (TRANSCRIPTION 

i REGULATION/DNA) 
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Psi Blast 


-0.02 


0.00 


0.13 


0.19 


! 0.16 




0,07 


0.18 




Verify 
score 


0.87 


0.99 


0.99 


0.96 


1.00 
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o 
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0.95 




PMF 
score 












59.85 








SEQFOL 
D score 


ZINC FINGER PROTEIN 
GUI; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTETN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTF.TN/nNA^i 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTRTN/DNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTETN/DNA^i 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTETM/TYNA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDTNG PROTFTM/TWA^ 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTETN/DNA^ 


PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
fDNA-BINDING PROTFTN/DNA^ 
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Verify 
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PMF 
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102.41 






51.43 






62.95 




SEQFOL 
D score 


CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


CRPl; CHAIN: A; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


QCRP2 (LIMl); CHAIN: 
NULL; 


ZINC FINGER PROTEIN 
QUI; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GUI; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


LIM DOMAIN CONTAINING 
PROTEINS LIM DOMAIN 
CONTAINING PROTEINS, METAL- 
BINDING PROTEIN, ZINC 2 
FINGER 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTF.TWnTvJA^ 


{UlNA-mNDlJNCj PROTEIN/DNA) 
COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTETN/DNA1 
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Verify 
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PMF 
score 


69.70 
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62.37 


53.17 








50.28 








SEQFOL 
D score 


CYSTEINE RICH 
INTESTINAL PROTEIN; 
CHAIN: NULL; 
LAMININ; CHAIN: NULL; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


CYSTEINE AND 
GLYCINE-RICH PROTEIN 
CRP2; CHAIN: A; 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


AVIAN CYSTEINE RICH 
PROTEIN; ICTL 3 


CRPl; CHAIN: A; 




Compound 


METAL-BINDING PROTEIN CRIP; 
METAL-BINDING PROTEIN, LIM 
DOMAIN PROTEIN 
GLYCOPROTEIN GLYCOPROTEIN 


IMMUNE SYSTEM BETA, MODULE 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PR OTFJM 


SIGNALING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS, 
METAL-BINDING PROTRTN 


SIGNALING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS, 

METAL-RTNnrNJG PP OTPTM 


SIGNALING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS, 
METAL-RTNDTNO PPOTPm 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


METAL-BINDING PROTEIN LIM 
DOMAIN CONTAINING PROTEINS 
ICTL 15 


CONTRACTILE LIM DOMAIN, 
CRP,NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


DIFFERENTIATION, 
CONTRACTTLR 
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Verify 
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PMF 
score 


51.64 




98.02 






86.13 


87.11 






SEQFOL 
D score 


SUBSTRATE BINDING 
DOMAIN OF DNAK; 
CHAIN: A; SUBSTRATE 
PEPTIDE (7 RESIDUES); 
CHAIN: B; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 1 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 




Compound 


COMPLEX (MOLECULAR 
CHAPERONE/PEPTIDE) DNAK, 
HEAT SHOCK PROTEIN 70 KDA 
(HSP70), COMPLEX 2 
(MOLECULAR 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


REGULATION/DNA) | 
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Verify 
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0.19 
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PMF 
score 












50.66 




SEQFOL 
D score 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


AGGLUTININ 
ISOLECTIN 
VI/AGGLUTININ 
ISOLECTIN V; CHAIN: A; 


AGGLUTININ 
ISOLECTIN 
VI/AGGLUTININ 
ISOLECTIN V; CHAIN: A; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: 
H, I; DES-GLA FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L 5 M;(DPN)-PHE- 
ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, 
Y; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 


DNAK; CHAIN: A, B; 
PEPTIDE SUBSTRATE; 
CHAIN: C, D; 




Compound 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN 


SUGAR BINDING PROTEIN UDA; 
LECTIN, HEVEIN DOMAIN, UDA, 
SUPERANTIGEN 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


COMPLEX (MOLECULAR 
CHAPERONE/PEPTIDE) DNAK, ! 
HEAT SHOCK PROTEIN 70 KDA 
(HSP70), COMPLEX 2 
(MOLECULAR 
CHAPERONE/PEPTIDE) 


CHAPERONE/PEPTIDE) 
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Psi Blast 
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Verify 
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PMF 
score 


94.61 


68.14 


64.18 


| 71.49 i 




92.61 


90.73 


93.40 


93.82 


63.01 


SEQFOL 
D score 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 


HUMAN BETA2- 
GLYCOPROTEIN I; 
CHAIN: A; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


| LAMNIN; CHAIN: NULL; | 


| LAMNIN; CHAIN: NULL; | 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


Compound 




MEMBRANE ADHESION SHORT 
CONSENSUS REPEAT, SUSHI, 
COMPLEMENT CONTROL 
PROTEIN, 2 N-GLYCOSYLATION, 
MULTI-DOMAIN, MEMBRANE 
ADHESION 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


1 GLYCOPROTEIN GLYCOPROTEIN 1 


| GLYCOPROTEIN GLYCOPROTEIN 1 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 


I 

I 
t 

j 

I 

1 

( 

1 


2 

SI 
< 

5 

> 
D 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 
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Verify 
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PMF 
score 


95.25 






77.80 


98.08 


269.60 






SEQFOL 
D score 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


TRANSFORMING 
PROTEIN P21/H-RAS-1; 
CHAIN: A; 


CYCLOPHILIN A; 
CHAIN: A; PEPTIDE 
FROM THE HIV-1 
CAPSID PROTEIN; 
CHAIN: B; 


CYCLOPHILIN A; 
CHAIN: A; PEPTIDE 
FROM THE HIV-1 
CAPSID PROTEIN; 
CHAIN: B; 


| (ISOLECTIN2)9WGA3 


Compound 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
1 COMPLEX (ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


SIGNALING PROTEIN G PROTEIN, 
GTP HYDROLYSIS, KINETIC 
CRYSTALLOGRAPHY, 2 
SIGNALING PROTEIN 


COMPLEX (ISOMERASE/PEPTIDE) 
COMPLEX (ISOMERASE/PEPTIDE), 
CYCLOPHILIN A, HIV-1 CAPSID, 2 
PSEUDO-SYMMETRY 


COMPLEX (ISOMERASE/PEPTIDE) 
COMPLEX (ISOMERASE/PEPTIDE), 
CYCLOPHILIN A, HIV-1 CAPSID, 2 
PSEUDO-SYMMETRY 
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Verify 
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-1202.08 
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-1202.08 


PMF 
score 


58.28 






90.19 


89.69 






SEQFOL 
D score 


TRANSCRIPTION 
FACTOR IIIA; CHAIN: A; 
5S RNA GENE; CHAIN: E, 
F; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


Compound 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) TFIIIA; 5S 
GENE; NMR, TFIIIA, PROTEIN, 
DNA, TRANSCRIPTION FACTOR, 
5S RNA 2 GENE, DNA BINDING 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX TZINC FINGER/DNA 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA^ 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA^ 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA^ 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA t 
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Verify 
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PMF 
score 


76.13 


53.64 










SEQFOL 
D score 


ZINC FINGER PROTEIN 
GUI; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 




Compound 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


RECOGNITION, 3 COMPLEX 

(TRANSCRIPTION 

REGULATION/DNA) 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



703 



1035 


1035 


1035 


1035 


1035 


1035 


1035 


1035 


1035 


Z H go 


to 


to 


2gli 


2gli 


2gH 


2gli 


2gli 


to 

ft 


to 


S3 

u 


> 


> 


> 


> 


> 


> 


> 


> 


> 


CHAI 
NID 


vo 


VO 


i — > 
-O 


co 

OO 
VO 


CO 


CO 
CO 

CO 


to 

VO 

-o 


to 

VO 

-o 


>-* 


STAR 
TAA 


to 
to 


Cn 




Ch 
to 
o 


Ch 

g 


& 

t — k 


4^ 
VO 

to 


4^ 
CO 
Q\ 


CO 

4^ 


s 


l.le-26 


1.2e-61 


8.4e-35 


to 
bo 
? 

CO 

cn 


-J 
? 


7e-35 


4.5e-65 


4.5e-65 


to 
bo 
? 

CO 
CO 


Psi Blast 


0.07 


0.16 


p 
>—* 

VO 


p 

CO 


0.04 


0.54 


^ 0.00 






Verify 
score 


-1202.08 


-1202.08 


t 

to 

s 

o 

OO 


-1202.08 


-1202.08 


! -1202.08 


-1202.08 






PMF 
score 
















98.88 


82.37 


SEQFOL 
D score 


ZINC FINGER PROTEIN 
GUI; CHAIN: A; DNA; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
! (DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
i (DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 
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SEQFOL 
D score 


IGGl ANTIBODY 1696 
(LIGHT CHAIN); CHAIN: 
L; IGGl ANTIBODY 1696 
(VARIABLE HEAVY 
CHAIN); CHAIN: H; IGGl 
ANTIBODY 1696 


IG HEAVY CHAIN V 
REGIONS; CHAIN: A; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: B; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: C; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: D; 


IG HEAVY CHAIN V 
REGIONS; CHAIN: A; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: B; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: C; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: D; 


CHIMERIC 
DECARBOXYLASE 
1 ANTIBODY 21D8; 
CHAIN: L; CHIMERIC 
DECARBOXYLASE 
ANTIBODY 21D8; 
CHAIN: H; 


IMMUNOGLOBULIN 
OPG2 FAB, CONSTANT 
DOMAIN; CHAIN: L; 
IMMUNOGLOBULIN 
OPG2 FAB, VARIABLE 
DOMAIN; CHAIN: H; 


Compound 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl; 
IMMUNOGLOBULIN, IGGl FAB 
FRAGMENT, CROSS-REACTIVITY, 
HIVl PROTEASE, ENZYME 2 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, FAB 
COMPLEX, IDIOTOPE, ANTI- 
IDIOTOPE 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, FAB 
COMPLEX, IDIOTOPE, ANTI- 
IDIOTOPE 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, CATALYTIC 
ANTIBODY, CHIMERIC FAB, 2 
DECARBOXYLASE, HAPTEN 
COMPLEX 


IMMUNE SYSTEM 
IMMUNOGLOBULIN 
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SEQFOL 
D score 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1; VP2; CHAIN: 2; VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1; VP2; CHAIN: 2; VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1;VP2; CHAIN:2;VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


AXONIN-1; CHAIN: A; 


CH3 DOMAIN OF MAK33 
ANTIBODY; CHAIN: A, 
B; 


(CONSTANT HEAVY 
CHAIN); CHAIN: I; 


Compound 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLKMRUS- 
RECEPTOR COMPLEX, 
VIRUSMRAL PROTEIN, 
RECEPTOR 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 
RECEPTOR COMPLEX, 
VIRUSMRAL PROTEIN, 
RECEPTOR 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


IMMUNE SYSTEM CONSTANT 
DOMAIN, Cl-SUBSET, 
IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


INHIBITION, IMMUNOGLOBULIN 
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SEQFOL 
D score 


IGG2A MONOCLONAL 
ANTIBODY (LIGHT 
CHAIN); CHAIN: L; 
IGG2A MONOCLONAL 
ANTIBODY (HEAVY 
CHAIN); CHAIN: H; 
FMDV PEPTIDE; CHAIN: 


CATALYTIC ANTIBODY | 
17E8 COMPLEXED WITH 
PHENYL [1-(1-N- 
SUCCINYLAMINO)PENT 
YL]1EAP3 

PHOSPHONATE 1EAP4 


TAB2; CHAIN: L, M; 
TAB2; CHAIN: H, I; 
CYCLIC PEPTIDE; 
CHAIN: P, Q 


TAB2; CHAIN: L, M; 
TAB2; CHAIN: H, I; 
CYCLIC PEPTIDE; 
CHAIN: P, Q 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC RECEPTOR 
CHAIN: C; FC 
FRAGMENT OF HUMAN 
IGGl; CHAIN: A, B; 


IMMUNOGLOBULIN 
LAMBDA HEAVY 
CHAIN; CHAIN: A, B; 
ENGINEERED PEPTIDE; 
CHAIN: E, F; 




Compound 


IMMUNE SYSTEM FMDV, 
ANTIGENIC-ANTIBODY 
INTERACTIONS, RGD MOTIF, G-H 
LOOP2 0FVP1. 




COMPLEX (ANTIBODY/ANTIGEN) 
CROSS-REACTIVITY, PROTEIN- 
PEPTIDE RECOGNITION 


COMPLEX (ANTIBODY/ANTIGEN) 
CROSS-REACTIVITY, PROTEIN- 
PEPTIDE RECOGNITION 


COMPLEX CD16; IGGl-FC 
COMPLEX, FC FRAGMENT, IGG, 
FC, RECEPTOR, CD16, GAMMA 


IMMUNE SYSTEM FC IGG PHAGE 
DISPLAY PEPTIDE 


RECEPTOR COMPLEX, 
VIRUS/VIRAL PROTEIN, 
RECEPTOR 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



708 



o 

ft 



^98 



8" 

to 



j2 



a 



o 



4^ 
oo 



O 



18 



H eg 



o 
o 



s 



p 
J— * 



o 
o 











3 O 



n 

© 

I 

o 

0 

3 



WO 03/054152 



PCT/US02/39555 



709 



1042 


1042 


1042 


1042 


1042 




2 H S3 


B- 


i — » 


i — * 


>—>■ 


p— » 

f 




S3 

w 


\ J 




> 








CHAI 
NID 


LO 

oo 

ON 


o 


i— ' 


4^ 


CO 




STAR 
TAA 










CM 

t— ' 
4^ 






? 
CM 
O 


4^ 
to 

? 


7e-66 


CO 
4^ 

1 


70-51 




Psi Blast 


iro 


0.03 


-0.02 


0.14 


p 

(— * 




Verify 
score 


0.01 


-0.06 


0.06 


0.07 ; 


0.48 




PMF 
score 














SEQFOL 
D score 


MAJOR POLLEN 
ALLERGEN BET Vl-A; 
CHAIN: A, D,G,J; 
IMMUNOGLOBULIN 
KAPPA LIGHT CHAIN; 
CHAIN: B, E, H, K; 
ANTIBODY HEAVY 
CHAIN FAB; CHAIN: C, F, 


IMMUNOGLOBULIN 
IGG2A FAB FRAGMENT 
(FAB17-IA) 
(ORTHORHOMBIC 
CRYSTAL FORM) IFOR 3 


BLUE FLUORESCENT 
ANTIBODY (19G2)- 
HEAVY CHAIN; CHAIN: 
H, A; BLUE 
FLUORESCENT 
ANTIBODY (19G2)- 
LIGHT CHAIN; CHAIN: L, 
B; 


MONOCLONAL 
ANTIBODY MAK33; 
CHAIN: L; 
MONOCLONAL 
ANTIBODY MAK33; 
CHAIN: H; 


IMMUNOGLOBULIN FAB 
5G9; CHAIN: L, H; 


AND FRAGMENT B OF 
PROTEIN A COMPLEX 
1FC24 


Compound 


IMMUNE SYSTEM BET V I-A, 
BETVI ALLERGEN; BV16 FAB- 
FRAGMENT, KAPPA MOPC21 
CODING SEQUENCE; HEAVY 
CHAIN OF THE MONOCLONAL 
ANTIBODY MST2; BET V 1, BV16 
FAB FRAGMENT, ANTIBODY 
ALLERGEN COMPLEX 




\ 

c 
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I 

c 
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E 


IMMUNE SYSTEM 

TA/fA/TTTXT/^r'T rYDT TT T1S.T XICW T\ 


IMMUNE SYSTEM FAB, BIP, 
CRYSTAL STRUCTURE 


IMMUNOGLOBULIN FAB, FAB 
LIGHT CHAIN, FAB HEAVY 
CHAIN; ANTIBODY, FAB, ANTI-TF, 
MONOCLONAL, MURINE, 
IMMUNOGLOBULIN 
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SEQFOL 
D score 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H, K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H,K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IGG2A KAPPA 
ANTIBODY CB41 (LIGHT 
CHAIN); CHAIN: A; 
IGG2A KAPPA 
ANTIBODY CB41 
(HEAVY CHAIN); CHAIN: 
B; PEPTIDE 5; CHAIN: C; 


MAJOR POLLEN 
ALLERGEN BET Vl-A; 
CHAIN: A, D, G, J; 
IMMUNOGLOBULIN 
KAPPA LIGHT CHAIN; 
CHAIN: B, E, H, K; 
ANTIBODY HEAVY 
CHAIN FAB; CHAIN: C,F, 
I, L; 




Compound 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


COMPLEX (ANTIBODY/PEPTIDE) 
POLYSPECIFICITY, 
CROSSREACTIVITY, FAB- 
FRAGMENT, PEPTIDE, 2 HIV-1 


IMMUNE SYSTEM BET V I-A, 
BETVI ALLERGEN; BV16 FAB- 
FRAGMENT, KAPPA MOPC21 
CODING SEQUENCE; HEAVY 
CHAIN OF THE MONOCLONAL 
ANTIBODY MST2; BET V 1, BV16 
FAB FRAGMENT, ANTIBODY 
ALLERGEN COMPLEX 




PDB annotation 



WO 03/054152 



PCT/US02/39555 



711 



to 



OQ 



25 B £2 



93 



td 



td 



td 



td 



td 



H 



4^ 



3, 



p 



O 

o 



o 2 



o 



o 

3 



g 

CD 



l a 

C6 O 











s 

td 

i 
1 



%1 



9 ° 



WO 03/054152 



PCT/US02/39555 



713 



1042 


1042 


1042 


1042 


1042 


1042 




z! a ^ 






if 


org 




lmlb 




S3 

w 


X 


X 


X 


X 




w 




CHAI 
NID 


4^ 
oo 


u> 

Os 




4^. 
OO 


4^ 

^4 


u> 

ON 




STAR 
TAA 


1 


Oi 

OO 


o 




bO 
O 


Ui 
i— ' 
VO 




3 




Lo 


4^ 


4^ 


2.8e 


)— 1 

45- 

? 




Psi] 


to 


cp 
o 


? 

ON 


? 
->4 
U> 




O 




Blast 


0.10 


0.30 


-0.17 


0.17 


0.02 


0.16 




Verify 
score 


-0.11 


0.28 


0.04 


-0.05 


-0.13 


0.07 




PMF 
score 
















xrx g 

n? o 






























tr 1 


HEMAGGLUTININ (HAl 
CHAIN); CHAIN: A; 
HEMAGGLUTININ (HA2 
CHAIN); CHAIN: B; 
IMMUNOGLOBULIN 
IGGl-KAPPA ANTIBODY 
(LIGHT CHAIN); CHAIN: 
L; IMMUNOGLOBULIN 
IGGl-KAPPA ANTIBODY 


HEMAGGLUTININ (HAl 
CHAIN); CHAIN: A; 
HEMAGGLUTININS 
CHAIN); CHAIN: B; 
IMMUNOGLOBULIN 
IGGl-KAPPA ANTIBODY 
(LIGHT CHAIN); CHAIN: 
L; IMMUNOGLOBULIN 
IGGl-KAPPA ANTIBODY 
(HEAVY CHAIN); CHAIN: 
H; 


ANTIBODY; CHAIN: L,H; 


IGG2A=KAPPA=; 1PLG4 
CHAIN: L,H; 1PLG5 


IMMUNOGLOBULIN 
$P/F$C(PRIME) 
FRAGMENT OF AN 
IG*G1 1PFC4 


IMMUNOGLOBULIN FAB 
D44.1 (IGG1,KAPPA) 
(BALB/C MOUSE, 
MONOCLONAL 
ANTIBODY) IMLB 5 


1MCP 4 l| 


Compound 


VIRAL PROTEIN/IMMUNE 
SYSTEM COMPLEX 
(HEMAGGLUTININ/IMMMUNOGL 
OBULIN), HEMAGGLUTININ, 2 
IMMUNOGLOBULIN, VIRAL 
PROTEIN/IMMUNE SYSTEM 


VIRAL PROTEIN/IMMUNE 
SYSTEM COMPLEX 
(HEMAGGLUTININ/IMMMUNOGL 
OBULIN), HEMAGGLUTININ, 2 
IMMUNOGLOBULIN, VIRAL 
PROTEIN/IMMUNE SYSTEM 


IMMUNOGLOBULIN FAB, GD2- 
GANGLIOSIDE, CARBOHYDRATE, 
MELANOMA, IMMUNOGLOBULIN 


IMMUNOGLOBULIN 
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SEQFOL 
D score 


METAL CHELATASE 
CATALYTIC ANTIBODY; 
CHAIN: A, C; METAL 
CHELATASE 


FC GAMMA RIIB; 
CHAIN: A; 


IMMUNOGLOBULIN 
IG*A FAB FRAGMENT 1 
(J539)(GALACTAN- 
BINDING) 2FBJ 3 


IGG5C8; CHAIN: L,H; 


IG ANTIBODY D2.3 
(LIGHT CHAIN); CHAIN: 
L; IG ANTIBODY D2.3 
(HEAVY CHAIN); CHAIN: 
H; 


E8 ANTIBODY; CHAIN: 
L,H; CYTOCHROME C; 
CHAIN: F; 


TR1.9 FAB; CHAIN: L,H; 


(HEAVY CHAIN); CHAIN: 
H; 


Compound 


IMMUNE SYSTEM METAL 
CHELATASE, CATALYTIC 
ANTIBODY, FAB FRAGMENT, 
IMMUNE 2 SYSTEM 


IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 




CATALYTIC ANTIBODY 
CATALYTIC ANTIBODY, FAB, 
RING CLOSURE REACTION 


IMMUNE SYSTEM ABZYME, 
TRANSITION STATE ANALOG, 
IMMUNE SYSTEM 


! COMPLEX 

1 (ANTIBODY/ELECTRON 
1 TRANSPORT) FAB E8; CYT C, 
ANTIGEN; IMMUNOGLOBULIN, 
IGGl KAPPA, FAB FRAGMENT, 
HORSE 2 CYTOCHROME C, 
COMPLEX 

(ANTIBODY/ELECTRON 
TRANSPORT) 


IMMUNOGLOBULIN TR1.9, ANTI- 
THYROID PEROXIDASE, 
AUTOANTIBODY, 2 
IMMUNOGLOBULIN 
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D score 


i CYCLIN-DEPENDENT 


CYCLIN-DEPENDENT 
KINASE 6; CHAIN: A; 
P19INK4D; CHAIN: B; 


P19INK4D CDK4/6 
INHIBITOR; CHAIN: 
NULL; 


GA BINDING PROTEIN 
ALPHA; CHAIN: A; GA 
BINDING PROTEIN BETA 
1; CHAIN: B; DNA; 
CHAIN: D, E; 


TUMOR SUPPRESSOR 
P16INK4A; CHAIN: 
NULL; 


CARDIOTOXIN 
CARDIOTOXIN III (NMR, 
13 STRUCTURES) 2CRS 3 


CYTOTOXIN TOXIN 
GAMMA 

(CARDIOTOXIN) 1TGX 3 


BENCE-JONES PROTEIN 
WIL, A VARIABLE 
DOMAIN FROM CHAIN: 
A,B; 




Compound 


1 HORMONE/GROWTH FACTOR 1 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 
CONTROL, ALPHA/BETA, 
COMPLEX (INHIBITOR 
PROTEIN/KINASE) 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


COMPLEX (TRANSCRIPTION 
! REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), DNA- 
BINDING, 2 NUCLEAR PROTEIN, 
ETS DOMAIN, ANKYRIN 
REPEATS, TRANSCRIPTION 3 
FACTOR 


ANTI-ONCOGENE CELL CYCLE, 
ANTI-ONCOGENE, REPEAT, ANK 
REPEAT 








IMMUNE SYSTEM 
IMMUNOGLOBULIN, BENCE- 
JONES PROTETN. LAMBDA-6 


ADHESION MOLECULE | 
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SEQFOL 
D score 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN 
CLASS I 

HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 
4 


HISTOCOMPATIBILITY 
ANTIGEN MURINE 
CLASS I MAJOR 
HISTOCOMPATIBILITY 
COMPLEX CONSISTING 
1H0C3 0FH-2D=B= 
B2-MICROGLOBULIN, 
AND A 9-RESIDUE 
PEPTIDE 1HOC4 


H-2 CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, K-B CHAIN: 
A; BETA-2- 
MICROGLOBULIN; 
CHAIN: B; 
NUCLEOCAPSID 
PROTEIN; CHAIN: P; 


MICROGLOBULIN; 
CHAIN: B; PEPTIDE 
FROMIMPORTIN 
ALPHA-2; CHAIN: C; 
NATURAL KILLER CELL 
RECEPTOR KIR2DL2; 
CHAIN: D, E; 


Compound 








IMMUNE SYSTEM SEV9; MAJOR 
HISTOCOMPATIBILITY COMPLEX 
PEPTIDE-MHC 


FOLD, RECEPTOR/MHC COMPLEX 
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72.43 




73.09 
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SEQFOL 
D score 


LEUCOCYTE 
IMMUNOGLOBULIN- 


LEUCOCYTE 
IMMUNOGLOBULIN- 
LIKE RECEPTOR-1; 
CHAIN: A; 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC REGION 
CHAIN: A; 


CHAIN: A; 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC REGION 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC REGION 
CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


CHAIN); CHAIN: L; 
ANTIBODY FAB 
FRAGMENT (HEAVY 
CHAIN); CHAIN: H 


Compound 


IMMUNE SYSTEM LEUKOCYTE 
INHIBITORY RECEPTOR-1; 


IMMUNE SYSTEM LEUKOCYTE 
INHIBITORY RECEPTOR-1; 
LEUKOCYTE IMMUNOGLOBULIN 
FOLD, 3-10 HELIX 


IMMUNE SYSTEM RECEPTOR 
BETA SANDWICH, 
IMMUNOGLOBULIN-LIKE, 
RECEPTOR 


IMMUNE SYSTEM RECEPTOR 
BETA SANDWICH, 
IMMUNOGLOBULIN-LIKE, 
RECEPTOR 


IMMUNE SYSTEM RECEPTOR 
BETA SANDWICH, 
IMMUNOGLOBULIN-LIKE, 
RECEPTOR 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
i CD32 


IMMUNE SYSTEM, MEMBRANE 
' PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 


CELL ADHESION 
! IMMUNOGLOBULIN 

SUPERFAMILY, BETA-SANDWICH 
| FOLD 


FRAGMENT, INTERLEUKIN- 2 2, 
X-RAY ANALYSIS, CRYSTAL 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



734 




WO 03/054152 



PCT/US02/39555 



735 




WO 03/054152 



PCT/US02/39555 



736 




WO 03/054152 



PCT/US02/39555 



737 



5! a 
o e w 



P3 
era 
p- 



SI 



0*\ 



to 

oo 



o 2 

» 4 s 



to 
o 



o 

3 



I 

CD 



GO g 

3 O 

r 



00 
GO 



o 



-** 3 §5 g £ 3 

C w p > 2: 



" '' S3 ^ w > : 




n m w 





4 
§5 



n 
© 

1 

o 



£ Q 









WO 03/054152 



PCT/US02/39555 



738 




WO 03/054152 



PCT/US02/39555 



739 



1071 


1071 


1071 


1071 


1071 




*2 H w> 


lhsb 


lhsb 


Ihsa 


Ihsa 


lhoc 




93 

w 


> 


> 


> 


> 


> 




CHAI 
NID 


to 

ON 




ON 




s 




STAR 
TAA 


to 
vo 

LO 


to 

VO 


to 
vo 

OO 


vo 


to 
vo 
cn 




s 


O 


o 


O 


o 


o 




Psi Blast 


0.46 




0.49 








Verify 
score 


-1202.08 




-1202.08 








PMF 
score 




166.59 
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SEQFOL 
D score 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN 
CLASS I 

HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 
4 


HISTOCOMPATIBILITY 
ANTIGEN HUMAN 
CLASS I 

HISTOCOMPATIBILITY 
ANTIGEN IHSA 3 /HLA- 
B(ASTERISK)2705$ IHSA 
4 


HISTOCOMPATIBILITY 
ANTIGEN MURINE 
CLASS I MAJOR 
HISTOCOMPATIBILITY 
COMPLEX CONSISTING 
1H0C3 0FH-2D=B= 
B2-MICROGLOBULIN, 
AND A 9-RESIDUE 
PEPTIDE IHOC 4 


PROTEIN; CHAIN: P; | 


Compound 
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SEQFOL 
D score 


INTEGRIN; CHAIN: 
NULL; 


IMMUNOGLOBULIN 
NMC-4 IGGl; CHAIN: L; 
IMMUNOGLOBULIN 
NMC-4 IGGl; CHAIN: H; 
VON WILLEBRAND 
FACTOR; CHAIN: A; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
QD; 


INTEGRIN; CHAIN: A; 
COLLAGEN; CHAIN: B, 
QD; 


INTEGRIN ALPHA-1; 
CHAIN: A, B; 


Al DOMAIN OF VON 
WILLEBRAND FACTOR; 
CHAIN: NULL; 


VON WILLEBRAND 
FACTOR; CHAIN: A, B; 


INTEGRIN ALPHA 2 
BETA; CHAIN: A, B; 


IMMUNOGLOBULIN FAB 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD1 8 
2FGW 3 ANTIBODY 'H52' 
(HUH52-OZ FAB) 2FGW 4 


IMMUNOGLOBULIN FAB 
2FB4 4 


Compound 


CELL ADHESION PROTEIN A- | 
DOMAIN INTEGRIN, CELL \ 


IMMUNE SYSTEM VON 
WILLEBRAND FACTOR, 
GLYCOPROTEIN IBA (A:ALPHA) 
BINDING, 2 COMPLEX 
(WILLEBRAND/IMMUNOGLOBULI 
N), BLOOD COAGULATION TYPE 
3 2B VON WILLEBRAND DISEASE 


INTEGRIN INTEGRIN, COLLAGEN 


INTEGRIN INTEGRIN, COLLAGEN 


STRUCTURAL PROTEIN I- 
DOMAIN, METAL BINDING, 
COLLAGEN, ADHESION 


WILLEBRAND WILLEBRAND, 
BLOOD COAGULATION, 
PLATELET, GLYCOPROTEIN 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 


INTEGRIN INTEGRIN, CELL 
ADHESION, GLYCOPROTEIN 
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Verify 
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score 


PMF 1 


74.70 


90.36 


81.19 
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74.07 


! 79.59 


82.96 








SEQFOL 
D score 


OLIGO-1,6- 

GLUCOSIDASE; CHAIN: 
NULL; 


OLIGO-1,6- 

GLUCOSIDASE; CHAIN: 
NULL; 


ALPHA-AMYLASE; 
CHAIN: A; 


ALPHA AMYLASE (PPA); 
CHAIN: A; 


GLUCAN 1,4-ALPHA- 
MALTOTETRAHYDROL 
ASE; CHAIN: A; 


ALPHA-AMYLASE; 
CHAIN: A; 


CYCLODEXTRIN 
GLYCOSYLTRANSFERA 
SE; ICIU 6 CHAIN: NULL; 
ICIU 7 


ALPHAl BETAl 
INTEGRIN; CHAIN: A; 
ALPHAl BETAl 
INTEGRIN; CHAIN: B; 


CDllA; 1LFA 5 CHAIN: 
A, B; 1LFA6 




Compound 


GLUCOSIDASE GLUCOSIDASE, 
SUGAR DEGRADATION, 
HYDROLASE, TIM-BARREL 2 


GLUCOSIDASE GLUCOSIDASE, 
SUGAR DEGRADATION, 
HYDROLASE, TIM-BARREL 2 
GLYCOSIDASE, HYDROLASE 


HYDROLASE "MALTOGENIC" 
ALPHA AMYLASE; AMYLASE, 
GLYCOSIDE HYDROLASE, 
STARCH DEGRADATION 


HYDROLASE ALPHA-AMYLASE, 
INHIBITOR, CARBOHYDRATE, 
PANCREAS 




HYDROLASE BETA-ALPHA- 
BARREL. BETA SHEET 


HYDROLASE HYDROLASE, 
AMYLASE, FAMILY 13 


GLYCOSIDASE CGTASE; ICIU 8 
THERMOSTABLE ICIU 14 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


CELL ADHESION LFA-1, ALPHA- 1 
L\,BETA-2 INTEGRIN, A-DOMAIN; 
1LFA 8 | 


ADHESION PROTEIN, 
GLYCOPROTEIN, 
EXTRACELLULAR 2 MATRIX, 
CYTOSKELETON 
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SEQFOL 
D score 


1 DNA-BINDINGHIGH 


DNA-BINDINGHIGH 
MOBILITY GROUP 
PROTEIN FRAGMENT-B 
(HMGB) (DNA-BINDING 
IHME 3 HMG-BOX 
DOMAIN B OF RAT 
HMGl) (NMR, 1 
STRUCTURE) IHME 4 


HIGH MOBILITY GROUP 
1 PROTEIN; CHAIN: A; 
DNA(5'-D(*CP*CP*(IDO) 
CHAIN: B; DNA (5'- 
CHAIN: C; 


HIGH MOBILITY GROUP 
1 PROTEIN; CHAIN: A; 
DNA(5*-D(*CP*CP*(IDO) 
CHAIN: B; DNA (5- 
CHAIN: C; 


NON HISTONE PROTEIN 
6 A; CHAIN: A; 


HIGH MOBILITY GROUP 
PROTEIN; lAAB 5 
CHAIN: NULL; lAAB 6 


HIGH MOBILITY GROUP 
PROTEIN; lAAB 5 
CHAIN: NULL; lAAB 6 




Compound 






GENE REGULATION/DNA HMG-1, 
AMPHOTERIN, HEPARIN-BINDING 
PROTEIN P30; HIGH-MOBILITY 
GROUP DOMAIN, BENT DNA, 
PROTEIN-DRUG-DNA 2 COMPLEX, 
GENE REGULATION/DNA 


GENE REGULATION/DNA HMG-1, 
AMPHOTERIN, HEPARIN-BINDING 
PROTEIN P30; HIGH-MOBILITY 
GROUP DOMAIN, BENT DNA, 
PROTEIN-DRUG-DNA 2 COMPLEX, 
GENE REGULATION/DNA 


DNA BINDING PROTEIN HMG 
BOX, DNA BENDING, DNA 
RECOGNITION, CHROMATIN, 
NMR, DNA 2 BINDING PROTEIN 


DNA-BINDING HMGA DNA- 
BINDING HMG-BOX DOMAIN A 
OF RAT HMGl; lAAB 8 HMG-BOX 
i 1 AAB 20 


DNA-BINDING HMGA DNA- 
BINDING HMG-BOX DOMAIN A 
OF RAT HMGl; lAAB 8 HMG-BOX 
1AAB20 


GLYCOSIDASE, HYDROLASE | 
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424.95 
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SEQFOL 
D score 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; ALPHA-l- 
ANTITRYPSIN; CHAIN: 
B; TRYPSIN; CHAIN: C; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; ALPHA-l- 
ANTITRYPSIN; CHAIN: 
B; TRYPSIN; CHAIN: C; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; ALPHA-l- 
ANTITRYPSIN; CHAIN: 
B; TRYPSIN; CHAIN: C; 


Compound 


SERINE PROTEASE INHIBITOR 
ALPHA-l-PROTEINASE 
INHIBITOR, ALPHA- 1- 
ANTIPROTEINASE; SERINE 
PROTEASE INHIBITOR, SERPIN. 


SERINE PROTEASE INHIBITOR 
ALPHA-l-PROTEINASE 
INHIBITOR, ALPHA-1- 
ANTIPROTEINASE; SERINE 
PROTEASE INHIBITOR, SERPIN, 
GLYCOPROTEIN, SIGNAL, 2 
POLYMORPHISM, EMPHYSEMA, 
DISEASE MUTATION, ACUTE 
PHASE 


SERINE PROTEASE INHIBITOR 
ALPHA- 1-PROTEINASE 
INHIBITOR, ALPHA- 1- 
ANTIPROTEINASE; SERINE 
PROTEASE INHIBITOR, SERPIN, 
GLYCOPROTEIN, SIGNAL, 2 
POLYMORPHISM, EMPHYSEMA, 
DISEASE MUTATION, ACUTE 
PHASE 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEASE-INHIBITOR 
COMPLEX, SERPIN, ALPHA-l- 
ANTITRYPSIN, 2 TRYPSIN 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEASE-INHIBITOR 
COMPLEX, SERPIN, ALPHA-l- 
ANTITRYPSIN. 2 TRYPSIN 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEASE-INHIBITOR 
COMPLEX, SERPIN, ALPHA-l- 
ANTITRYPSIN, 2 TRYPSIN 
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score 


132.21 




142.48 












SEQFOL 
D score 


COMPLEMENT FACTOR 
HUMAN COMPLEMENT 
REGULATORY PROTEIN 


T-CELL SURFACE 
GLYCOPROTEIN CD4; 
CHAIN: NULL; 


COMPLEMENT 
REGULATORY PROTEIN 
CD59 (NMR, 20 
STRUCTURES) 1CDQ3 


ACID PHOSPHATASE; 
CHAIN: A; 


BETA TRYPSIN; CHAIN: 
NULL; 


TRYPSIN; CHAIN: E; 
TRYPSIN INHIBITOR; 
CHAIN: I 


SERINE PROTEINASE; 
CHAIN: NULL; 


HYDROLASE (SERINE 
PROTEINASE) TRYPSIN 
(E.C.3.4.21.4) 
COMPLEXED WITH THE 
INHIBITOR 1TRN3 
DIISOPROPYL- 
FLUOROPHOSPHOFLUO 
RIDATE (DFP) ITRN 4 
HUMAN TRYPSIN, DFP 
INHIBITED ITRN 6 


Compound 




T-CELL SURFACE 
GLYCOPROTEIN 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC, 
LIPOPROTEIN, T-CELL SURFACE 
GLYCOPROTEIN 




HYDROLASE ALL ALPHA 


SERINE PROTEASE HYDROLASE, 
SERINE PROTEASE, DIGESTION, 
PANCREAS, 2 ZYMOGEN. SIGNAL 


HYDROLASE/HYDROLASE 
INHIBITOR SERINE PROTEINASE, 
TRYPSIN INHIBITOR 


HYDROLASE HYDROLASE, 
SERINE PROTEASE 
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54.30 




SEQFOL 
D score 


APOLIPOPROTEINCII; 
CHAIN: A; 


APOLIPOPROTEIN CII; 
CHAIN: A; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; 
CHAIN: A, B; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


CD59 (EXTRACELLULAR 
1ERG 3 REGION, 
RESIDUES 1-70) (NMR, 
RESTRAINED 
MINIMIZED 1ERG4 
AVERAGE STRUCTURE) 
1ERG5 


Compound 


LIPID TRANSPORT APOC-II; 
PROTEIN-LIPID INTERACTION, 
AMPHIPATHIC ALPHA HELIX 


LIPID TRANSPORT APOC-II; 
PROTEIN-LIPID INTERACTION, 
AMPHIPATHIC ALPHA HELIX 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN, POLYMORPHISM 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 
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SEQFOL 
D score 


PYK2-ASS0CIATED 
PROTEIN BETA; CHAIN: 
A; 


CYCLIN-DEPENDENT 
KINASE 6; CHAIN: A; 
P19INK4D; CHAIN: B; 


GA BINDING PROTEIN 
ALPHA; CHAIN: A; GA 
BINDING PROTEIN BETA 
I; CHAIN: B; DNA; 
CHAIN: D, E; 


TRANSLATIONALLY 
CONTROLLED TUMOR 
PROTEIN; CHAIN: A; 


TRANSLATIONALLY 
CONTROLLED TUMOR 
PROTEIN; CHAIN: A; 


APOLIPOPROTEINCII; 
CHAIN: A; 


APOLIPOPROTEINCII; 
CHAIN: A; 


Compound 


METAL BINDING PROTEIN ZINC- 
BINDING MODULE, ANKYRIN 
REPEATS, METAL BINDING 
PROTEIN 


COMPLEX (INHIBITOR 
PROTEIN/KINASE) INHIBITOR 
PROTEIN, CYCLIN-DEPENDENT 
KINASE, CELL CYCLE 2 
CONTROL, ALPHA/BETA, 
COMPLEX (INHIBITOR 
PROTEIN/KINASE) 


COMPLEX (TRANSCRIPTION 
, REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), DNA- 
BINDING, 2 NUCLEAR PROTEIN, 
ETS DOMAIN, ANKYRIN 
REPEATS, TRANSCRIPTION 3 
FACTOR 


l TUMOR-ASSOCIATED PROTEIN 

TCTP, P23FYP; TUMOR- 
1 ASSOCIATED PROTEIN, 

FUNCTION UNKNOWN i 


TUMOR-ASSOCIATED PROTEIN 
TCTP, P23FYP; TUMOR- 
ASSOCIATED PROTEIN, 
" FUNCTION UNKNOWN 


LIPID TRANSPORT APOC-II; 
PROTEIN-LIPID INTERACTION, 
AMPHIPATfflC ALPHA HELIX 


LIPID TRANSPORT APOC-II; 
PROTEIN-LIPID INTERACTION, 
AMPHIPATHIC ALPHA HELIX 
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118.25 






SEQFOL 
D score 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
(VERTEBRATE) ICLL 3 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
COMPLEXED WITH 
CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


CALCIUM-BINDING 
PROTEIN CALMODULIN 
COMPLEXED WITH 
CALMODULIN-BINDING 
DOMAIN OF ICDM 3 
CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


CALMODULIN- 
DEPENDENT PROTEIN 
KINASE II ICDM 4 


Compound 
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SEQFOL 
D score 


NEURONAL CALCIUM 
SENSOR 1; CHAIN: A, B; 


CALMODULIN; CHAIN: 
A; 


CALCIUM-BINDING 
PROTEIN NCS-1; CHAIN: 
A; 


GRANCALCIN; CHAIN: 
A, B; 


CALMODULIN; CHAIN: 
A; 


CALMODULIN; CHAIN: 
A; 


CALMODULIN; CHAIN: 
A; 


CALMODULIN; CHAIN: 
A; 


CARDIAC TROPONIN C; 
CHAIN: A; 


CARDIAC TROPONIN C; ! 
CHAIN: A; 


CALCIUM-SATURATED 
CIB; CHAIN: A 


CALMODULIN 
(VERTEBRATE); ICMF 6 
CHAIN: NULL; ICMF 7 


Compound 


METAL BINDING PROTEIN 
FREQUENIN; CALCIUM BINDING- 


METAL BINDING PROTEIN EF- 
HAND, HELIX-LOOP-HELIX, 
FRAGMENT, CALCIUM, TR2C, C- 2 
TERMINAL DOMAIN, 
CALMODULIN 


METAL BINDING PROTEIN YEAST 
FREQUENIN EF-HAND, CALCIUM 


METAL TRANSPORT PENTA-EF- 
HAND PROTEIN, CALCIUM 
BINDING PROTEIN 


METAL TRANSPORT 
CALMODULIN, HIGH 
RESOLUTION, DISORDER 


METAL TRANSPORT 
CALMODULIN, HIGH 
RESOLUTION, DISORDER 


METAL TRANSPORT 
CALMODULIN, HIGH 
RESOLUTION, DISORDER 


METAL TRANSPORT 
CALMODULIN, HIGH 
RESOLUTION, DISORDER 


STRUCTURAL PROTEIN HELIX- 
TURN-HELIX 


STRUCTURAL PROTEIN HELIX- 
TURN-HELIX 


BLOOD CLOTTING HELICAL, EF- 
HANDS, BLOOD CLOTTING 


CALCIUM-BINDING PROTEIN 
CALMODULIN APO TR2C- 
DOMAIN; ICMF 9 
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SEQFOL 
D score 


CALMODULIN; CHAIN: 
A; RS20; CHAIN: B; 


CALMODULIN; CHAIN: 
A; RS20; CHAIN: B; 


CALMODULIN; CHAIN: 
A; RS20; CHAIN: B; 


CALMODULIN; CHAIN: 
A; RS20; CHAIN: B; 


MUSCLE PROTEIN 
TROPONIN C (TRIG 
FRAGMENT) (APO 
FORM) (NMR, 1 
STRUCTURE) 1TRF3 


CALCIUM BINDING 
PROTEIN CALMODULIN 
(/TR=2=C$ FRAGMENT 
COMPRISING RESIDUES 
78- 148 1TRC 3 OF THE 
INTACT MOLECULE) 
1TRC4 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C 
1TOP3 


PROTEIN TROPONIN C 
1TOP3 


Compound 


; CALMODULIN, CALCIUM 
BINDING, HELIX-LOOP-HELIX, 


CALMODULIN, CALCIUM 
BINDING, HELIX-LOOP-HELDC, 

, SIGNALLING, 2 

1 COMPLEX(CALCIUM-BINDING 

1 PROTEIN/PEPTIDE) 


CALMODULIN, CALCIUM 
BINDING, HELIX-LOOP-HELDC, 
1 SIGNALLING, 2 
COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 


CALMODULIN, CALCIUM 
BINDING, HELIX-LOOP-HELIX, 
SIGNALLING, 2 

COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 
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163.19 


67.20 




SEQFOL 
D score 


MYOSIN; CHAIN: A, B, C; 


MYOSIN; CHAIN: A, B, C; 


MYOSIN; CHAIN: A, B, C; 


MYOSIN; CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 


SCALLOP MYOSIN; 
CHAIN: A, B, C; 




Compound 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MUSCLE 
PROTEIN, MYOSIN 
SUBFRAGMENT-1, MYOSIN HEAD, 
2 MOTOR PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN 


MUSCLE PROTEIN MYOSIN, 
CALCIUM BINDING PROTEIN, 
MUSCLE PROTEIN ! 


SIGNALLING, 2 

COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 
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Verify 
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PMF 
score 






73.01 








121.44 




SEQFOL 
D score 


AXONIN-1; CHAIN: A; 


IG HEAVY CHAIN V 
REGIONS; CHAIN: A; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: B; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: C; IG 
HEAVY CHAIN V 
REGIONS; CHAIN: D; 


ANTIBODY R24 (LIGHT 
CHAIN); CHAIN: A; 
ANTIBODY R24 (HEAVY 
CHAIN); CHAIN: B; 


COMPLEX 

(ANTIBODY/ANTIGEN) 
HYHEL-5 FAB 
COMPLEXEDWITH 
BOBWHTE QUAIL 
LYSOZYME IBQL 3 
1BQL95 


FAB FRAGMENT; 
CHAIN: L,H,J,K; 
VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR; 
CHAIN: V, W; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 




Compound 


CELL ADHESION NEURAL CELL 
ADHESION 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, FAB 
COMPLEX, IDIOTOPE, ANTI- 
IDIOTOPE 


IMMUNE SYSTEM ANTIBODY 
(FAB FRAGMENT), IMMUNE 
SYSTEM 




COMPLEX (ANTIBODY/ANTIGEN) 
FAB-12; VEGF; COMPLEX 
(ANTIBODY/ANTIGEN), 
ANGIOGENIC FACTOR 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


HOMOPHILIC ADHESION | 
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-0.26 


Verify 
score 
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0.46 
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PMF 
score 






111,26 




75.44 


72.82 




SEQFOL 
D score 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1;VP2; CHAIN: 2; VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


POLIOVIRUS RECEPTOR; 
CHAIN: R;VP1; CHAIN: 
1;VP2; CHAIN: 2; VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1; VP2; CHAIN: 2; VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


IMMUNOGLOBULIN 3D6 
FAB IDFB 3 


IMMUNOGLOBULIN 3D6 
FAB IDFB 3 


CHIMERIC GERMLINE 
PRECURSOR OF OXY- 
COPE CHAIN: L; 
CHIMERIC GERMLINE 
PRECURSOR OF OXY- 
! COPE CHAIN: H; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: C, D; 


Compound 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 
RECEPTOR COMPLEX, 
VIRUSMRAL PROTEIN, 
RECEPTOR 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 
RECEPTOR COMPLEX, 
VIRUSMRAL PROTEIN, 
RECEPTOR 






IMMUNE SYSTEM IMMUNE 
SYSTEM 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 
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SEQFOL 
D score 


TELOKIN; CHAIN: A 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN 
EPSILON RECEPTOR 
CHAIN: A; IG EPSILON 
CHAIN C REGION; 
CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN 
EPSILON RECEPTOR 
CHAIN: A; IG EPSILON 
CHAIN C REGION; 
CHAIN: B, D; 


IMMUNOGLOBULIN 
EPSILON RECEPTOR 
CHAIN: A; 


Compound 


CONTRACTILE PROTEIN 
IMMUNOGLOBULIN FOLD, BETA | 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN 
FOLD, GLYCOPROTEIN, 
RECEPTOR, IGE-BINDING 2 
PROTEIN, IGE ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN 
FOLD, GLYCOPROTEIN, 
RECEPTOR, IGE-BINDING 2 
PROTEIN, IGE ANTIBODY, IGE-FC 


ALPHA; IMMUNOGLOBULIN 
FOLD, GLYCOPROTEIN, 
RECEPTOR, IGE-BINDING 2 
PROTEIN 
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SEQFOL 
D score 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


| LAMININ; CHAIN: NULL; 


LDL RECEPTOR; CHAIN: 
A; 


LDL RECEPTOR; CHAIN: 
A; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I, J, K, L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


Compound 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 


| GLYCOPROTEIN GLYCOPROTEIN | 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


SERINE PROTEINASE 
; COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 
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66.80 


95.22 
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SEQFOL 
D score 


NUMB PROTEIN; CHAIN: 
A; NUMB ASSOCIATE 
KINASE; CHAIN: B; 


Xll; CHAIN: A, B; 
PEPTIDE; CHAIN: C, D 


RHODOPSIN; CHAIN: A, 
B 


RHODOPSIN; CHAIN: A, 
B 


RHODOPSIN; CHAIN: A, 
B 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN2)9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 




Compound 


SIGNALING 

PROTEIN/TRANSFERASE NAK; 
COMPLEX, SIGNAL 
TRANSDUCTION, 
PHOSPHOTYROSINE BINDING 2 


COMPLEX (PEPTIDE BINDING 
MODULE/PEPTIDE), PEPTIDE 
BINDING 2 MODULE, PTB 
DOMAIN 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, 
MEMBRANE PROTEIN, 2 RETINAL 
PROTEIN, VISUAL PIGMENT 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, 
MEMBRANE PROTEIN, 2 RETINAL 
PROTEIN, VISUAL PIGMENT 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, 
MEMBRANE PROTEIN, 2 RETINAL 
PROTEIN, VISUAL PIGMENT 






CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 
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SEQFOL 
D score 


ALPHA-LYTIC 
PROTEASE; CHAIN: 
NULL; 


ALPHA-LYTIC 
PROTEASE; CHAIN: 
NULL; 


ALPHA-LYTIC 
PROTEASE; CHAIN: 
NULL; 


ALPHA-LYTIC 
PROTEASE; CHAIN: A; 


ALKALINE PROTEASE; 
IKAP 4 CHAIN: P; IKAP 5 
TETRAPEPTIDE (GLY 
SERASN SER); IKAP 9 
CHAIN: I; IKAP 10 


CUTINASE; CHAIN: 
NULL; 


RABPHILIN 3-A; CHAIN: 
A; 


RABPHILIN 3-A; CHAIN: 
A; 


PHOSPHOLIPASEA2; 
CHAIN: NULL; 




Compound 


SERINE PROTEASE SERINE 
PROTEASE, LOW TEMPERATURE, 
HYDROLASE, 2 SERINE 


SERINE PROTEASE SERINE 
PROTEASE, LOW TEMPERATURE, 
HYDROLASE, 2 SERINE 
PROTEINASE 


SERINE PROTEASE SERINE 
PROTEASE, LOW TEMPERATURE, 
HYDROLASE, 2 SERINE 
PROTEINASE 


HYDROLASE DOUBLE BETA 
BARREL, BACTERIAL SERINE 
PROTEASE 


ZINC METALLOPROTEASE P. 
AERUGINOSA ALKALINE 
PROTEASE; IKAP 6 CALCIUM 
BINDING PROTEIN IKAP 19 


SERINE ESTERASE HYDROLASE, 
SERINE ESTERASE, 
| GLYCOPROTEIN 


ENDOCYTOSIS/EXOCYTOSIS C2- 
DOMAINS, C2B-DOMAIN, 
RABPHILIN, 

ENDOCYTOSIS/EXOCYTOSIS 


ENDOCYTOSIS/EXOCYTOSIS C2- 
DOMAINS, C2B-DOMAIN, 
RABPHILIN, 

ENDOCYTOSIS/EXOCYTOSIS 


HYDROLASE CALB DOMAIN; 
HYDROLASE, C2 DOMAIN, CALB 
DOMAIN 


DOMAIN | 
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68.15 


61.82 














SEQFOL 
D score 


TRANSCRIPTIONAL 
REPRESSOR TUPl; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; 
CHAIN: A, B, C; 


RHODOPSIN; CHAIN: A, 
B 


RHODOPSIN; CHAIN: A, 
B 


ALPHA-LYTIC 
PROTEASE; CHAIN: 
NULL; 


ALPHA-LYTIC 
PROTEASE; CHAIN: 
NULL; 


ALPHA-LYTIC 
PROTEASE; CHAIN: A; 


SERINE-CARBOXYL 
PROTEINASE; CHAIN: A; 
FRAGMENT OF 
TYROSTATIN; CHAIN: I; 


CUTINASE; CHAIN: 
NULL; 




Compound 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, 
MEMBRANE PROTEIN, 2 RETINAL 
PROTEIN, VISUAL PIGMENT 


SIGNALING PROTEIN 
PHOTORECEPTOR, G PROTEIN- 
COUPLED RECEPTOR, 
MEMBRANE PROTEIN, 2 RETINAL 
PROTEIN, VISUAL PIGMENT 


SERINE PROTEASE SERINE 
PROTEASE, LOW TEMPERATURE, 
HYDROLASE, 2 SERINE 
PROTEINASE 


SERINE PROTEASE SERINE 
PROTEASE, LOW TEMPERATURE, 
HYDROLASE, 2 SERINE 
PROTEINASE 


HYDROLASE DOUBLE BETA 
BARREL, BACTERIAL SERINE 
PROTEASE 


HYDROLASE PSCP, 
PSEUDOMONAPEPSIN, 
PEPSTATIN-INSENSITIVE SERINE- 
CARBOXYL PROTEINASE 


SERINE ESTERASE HYDROLASE, 
SERINE ESTERASE, 
GLYCOPROTEIN 


PROTEINASE | 
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SEQFOL 
D score 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; 
GT-BETA; CHAIN: B; GT- 
GAMMA; CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; 
GT-BETA; CHAIN: B; GT- 
GAMMA; CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; 
GT-BETA; CHAIN: B; GT- 
GAMMA; CHAIN: G; 


GT-ALPHA/GI-ALPHA 
CHIMERA; CHAIN: A; 
GT-BETA; CHAIN: B; GT- 
GAMMA; CHAIN: G; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; 
CHAIN: A, B, C; 


TRANSCRIPTIONAL 
REPRESSOR TUPl; 
CHAIN: A, B, C; 


CHAIN:A,B,C; | 


Compound 


COMPLEX (GTP- 
BINDING/TRANS DUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 


COMPLEX (GTP- 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


COMPLEX (GTP- 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


COMPLEX (GTP- 
BINDING/TRANSDUCER) BETAl, 
TRANSDUCIN BETA SUBUNIT; 
GAMMAl, TRANSDUCIN GAMMA 
SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 


TRANSCRIPTION INHIBITOR 
BETA-PROPELLER 
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55.92 






SEQFOL 
D score 


D 

m 
o 

a 
s 

a 
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PEPTIDE INHIBITOR; 1 
CHAIN: A, B; HIV-1 
PROTEASE 

(RETROPEPSIN); CHAIN: 
QD; 


| PROTEASE; CHAIN: A, B; | 


HIV-1 PROTEASE; 
CHAIN: A, B; 


ROUS SARCOMA VIRUS 
PROTEASE; CHAIN: A, B; 
INHIBITOR; CHAIN: C; 


ROUS SARCOMA VIRUS 
PROTEASE; CHAIN: A, B; 
INHIBITOR; CHAIN: C; 


RETROPEPSIN; CHAIN: 
A,B; 




Compound 


| HYDROLASE DUTPASE, DUTP 


HYDROLASE HIV-1 PROTEASE, 
MUTANT, DIMER, INHIBITOR, 
OCCUPANCY 


| HYDROLASE HYDROLASE | 


HYDROLASE HIV-1 PROTEASE, 
HYDROLASE 


COMPLEX 

(PROTEASE/INHIBITOR) HUMAN 
IMMUNODEFICIENCY VIRUS 
PROTEASE, ROUS SARCOMA 
VIRUS 2 PROTEASE, CRYSTAL 
STRUCTURES, PROTEIN- 
MEDIATED INTERACTION, 3 
VIRAL MATURATION, COMPLEX 
(PROTEASE/INHIBITOR) HEADER 


COMPLEX 

(PROTEASE/INHIBITOR) HUMAN 
IMMUNODEFICIENCY VIRUS 
PROTEASE, ROUS SARCOMA 
VIRUS 2 PROTEASE, CRYSTAL 
STRUCTURES, PROTEIN- 
MEDIATED INTERACTION, 3 
VIRAL MATURATION, COMPLEX 
(PROTEASE/INHIBITOR) HEADER 


HYDROLASE/HYDROLASE 
INHIBITOR HIV-1 PR; COMPLEX 
(ACID PROTEINASE/PEPTIDE) 


SUBUNIT; COMPLEX (GTP- 
BINDING/TRANSDUCER), G 
PROTEIN, HETEROTRIMER 2 
SIGNAL TRANSDUCTION 
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SEQFOL 
D score 


POL POLYPROTEIN; 
CHAIN: A; 


POL POLYPROTEIN; 
CHAIN: A, B; 


POL POLYPROTEIN; 
CHAIN: A, B; 


POL POLYPROTEIN; 
CHAIN: A, B; 


DEOXYURIDINE 5'- 
TRIPHOSPHATE 
NUCLEOTIDOHYDROLA 
SE; CHAIN: A; 


DEOXYURIDINE 5'- 
TRIPHOSPHATE 
NUCLEOTIDOHYDROLA 
SE; CHAIN: A; 


DEOXYURIDINE 5'- 
TRIPHOSPHATE 
NUCLEOTIDOHYDROLA 
SE; CHAIN: A; 


DEOXYURIDINE 5'- 
TRIPHOSPHATE 
NUCLEODITOHYDROLA 
SE; CHAIN: NULL; 


DEOXYURIDINE 5'- 
TRIPHOSPHATE 
NUCLEODITOHYDROLA 
SE; CHAIN: NULL; 


TRIPHOSPHATE 
NUCLEODITOHYDROLA 
SE; CHAIN: NULL; 


Compound 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA BARREL 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA-BARREL 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA-BARREL 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA-BARREL 


HYDROLASE DUTPASE; JELLY 
ROLL, MERCURY DERIVATIVE 


HYDROLASE DUTPASE; JELLY 
ROLL, MERCURY DERIVATIVE 1 


HYDROLASE DUTPASE; JELLY 
ROLL, MERCURY DERIVATIVE 


HYDROLASE DUTPASE, DUTP 
PYROPHOSPHATASE; 
HYDROLASE, DUTPASE, EIAV, 
TRIMERIC ENZYME, ASPARTYL 
PROTEASE 


HYDROLASE DUTPASE, DUTP 
PYROPHOSPHATASE; 
HYDROLASE, DUTPASE, EIAV, 
TRIMERIC ENZYME, ASPARTYL 
PROTEASE 


PYROPHOSPHATASE; 
HYDROLASE, DUTPASE, EIAV, 
TRIMERIC ENZYME, ASPARTYL 
PROTEASE 
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71.47 






SEQFOL 
D score 


HYDROLASE(ACID 
PROTEINASE) HUMAN 
IMMUNODEFICIENCY 


HYDROLASE(ACID 
PROTEASE) HIV-1 
PROTEASE (TETHERED 
DIMER LINKED BY 
IHVC 3 GLY-GLY-SER- 
SER-GLY) COMPLEXED 
WITHA-76928 IHVC 4 


HYDROLASE(ACID 
PROTEASE) HIV-1 
PROTEASE (TETHERED 
DIMER LINKED BY 
IHVC 3 GLY-GLY-SER- 
SER-GLY) COMPLEXED 
WITHA-76928 IHVC 4 


HIV-1 PROTEASE; 
CHAIN: A; 


HIV-1 PROTEASE; 
CHAIN: A; 


EIAV PROTEASE; 
CHAIN: NULL; 


POL POLYPROTEIN; 
CHAIN: A; 




POL POLYPROTEIN; 
CHAIN- A: 




Compound 








HYDROLASE HYDROLASE 


HYDROLASE HYDROLASE 


HYDROLASE (ACID PROTEINASE) 
HYDROLASE (ACID 
PROTEINASE), RNA-DIRECTED 
DNA POLYMERASE, 2 ASPARTYL 
PROTEASE, ENDONUCLEASE, 
POLYPROTEIN 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA BARREL 
PROTEIN 


VIRUSMRAL PROTEIN EIGHT 
STRANDED BETA BARREL 
PROTEIN 


PROTEIN 1 
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SEQFOL 
D score 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B 5 C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B,C,E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 




Compound 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


ZINC FINGER PROTEIN I 
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SEQFOL 
D score 


YYl; CHAIN: C; ADENO- 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


i YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 




Compound 


COMPLEX (TRANSCRIPTION 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


(TRANSCRIPTION 
REGULATION/DNA) 
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| GLUTATfflONE S- 


GLUTATHIONE-^- 
TRANSFERASE; CHAIN: 
A; 


GLUTATHIONE S- 
TRANSFERASE I; CHAIN: 
A, B; 

LACTOYLGLUTATHION 
E; CHAIN: C, D 


GLUTATHIONE S- 
TRANSFERASE III; 
CHAIN: NULL; 


GLUTATHIONE S- 
TRANSFERASE; CHAIN: 
A, B; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


TRANSFERASE GLUTATHIONES- 1 


TRANSFERASE GST, 
GLUTATHIONE CONJUGATING, 
PUTATIVE OXIDOREDUCTASE 


COMPLEX 

(TRANSFERASE/LIGAND) I 
COMPLEX 

(TRANSFERASE/LIGAND), 
TRANSFERASE, HERBICIDE 2 
DETOXIFICATION HEADER 


TRANSFERASE TRANSFERASE, 
HERBICIDE DETOXIFICATION 1 


TRANSFERASE GST, 
GLUTATHIONE TRANSFERASE; 
TRANSFERASE, GLUTAHIONE 
CONJUGATION, 
DETOXIFICATION, 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 
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SEQFOL 
D score 


MONOCLONAL 
ANTIBODY MRK-16 
(LIGHT CHAIN); CHAIN: 


MONOCLONAL 
ANTIBODY MRK-16 
(LIGHT CHAIN); CHAIN: 

A, C; MONOCLONAL 
ANTIBODY MRK-16 
(HEAVY CHAIN); CHAIN: 

B, D; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


HEMOLIN; CHAIN: A, B; 


CAMPATH-IG 
ANTIBODY; CHAIN: A, B, 
C,D,E,F,G,H; 


IMMUNOGLOBULIN FAB 
FRAGMENT OF MURINE 
MONOCLONAL 
ANTIBODY AN02 
COMPLEX IBAF 3 WITH 
ITS HAPTEN (2,2,6,6- 
TETRAMETHYL- 1- 
PIPERIDINYLOXY- IBAF 
4 DINITROPHENYL) 
IBAF 5 


409.5.3 (IGG2A) FAB; 
CHAIN: A, B, L, H 


Compound 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


IMMUNE SYSTEM 
IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPfflLIC ADHESION 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPfflLIC ADHESION 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPfflLIC ADHESION 


ANTIBODY ANTIBODY, FAB, 
CAMPATH-IG, CD52 




IMMUNOGLOBULIN, C REGION, V 
REGION 
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SEQFOL 
D score 


COMPLEX 

(ANTIBIDY/IMMUNOSUP 


IGGl INTACT 
ANTIBODY MAB61.1.3; 
CHAIN: A, B, C, D 


IGG2A INTACT 
ANTIBODY -MAB231; 
CHAIN: A, B, C, D 


IGG2A INTACT 
ANTIBODY -MAB231; 
CHAIN: A, B,C,D 


COMPLEX 

(ANTIBODY/BINDING 
PROTEIN) IGGl FAB 
FRAGMENT 
COMPLEXEDWITH 
PROTEIN G (DOMAIN III) 
IIGC 5 PROTEIN G, 
STREPTOCOCCUS IIGC 
15 


COMPLEX 

(ANTIBODY/BINDING 
PROTEIN) IGGl FAB 
FRAGMENT 
COMPLEXEDWITH 
PROTEIN G (DOMAIN III) 
IIGC 5 PROTEIN G, 
STREPTOCOCCUS IIGC 
15 


IDIOTYPIC FAB 730.1.4 
(IGGl) OF VIRUS IIAI 5 
CHAIN: L, H; IIAI 7 
ANTI-IDIOTYPICFAB 
409.5.3 (IGG2A); IIAI 9 
CHAIN: M, I IIAI 10 


Compound 




IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN, V REGION, C 
REGION, HINGE REGION 


IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN V REGION C 
REGION, IMMUNOGLOBULIN 


IMMUNOGLOBULIN INTACT 
IMMUNOGLOBULIN V REGION C 
REGION, IMMUNOGLOBULIN 






COMPLEX (IMMUNOGLOBULIN 
IGG1/IGG2A) 
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SEQFOL 
D score 


| MFE-23 RECOMBINANT 


ANTIBODY; CHAIN: H, L; 
PROTEIN G-PRIME; 
CHAIN: A; MAJOR 
OUTER MEMBRANE 
PROTEIN PI. 16; CHAIN: 
P; 


ANTIBODY; CHAIN: L, H; 


SINGLE-CHAIN 
ANTIBODY FRAGMENT; 
CHAIN: A, C; 


N1G9 (IGG1=LAMBDA=); 
CHAIN: L, H; 


N15 ALPHA-BETA T- j 
CELL RECEPTOR; 
CHAIN: A, B, C, D; H57 
FAB; CHAIN: E, F, G, H 


HYDROLASE(0- 
GLYCOSYL) N9 
NEURAMINIDASE-NC41 
(E.C.3.2.1.18) COMPLEX 
WITH FAB INCA 3 


IMFA 4 D-MANNOSE 
(Pl-OME) (PART OF THE 
CELL-SURFACE 
CARBOHYDRATE IMFA 
5 OF PATHOGENIC 
SALMONELLA) IMFA 6 


Compound 


| IMMUNOGLOBULIN 


IMMUNE SYSTEM FAB, PORA, 
NEISSERIA MENINGITIDIS, PORIN 


IMMUNOGLOBULIN FAB, GD2- 
GANGLIOSIDE, CARBOHYDRATE, 
MELANOMA, IMMUNOGLOBULIN 


IMMUNOGLOBULIN VARIABLE 
HEAVY (VH) DOMAIN, VARIABLE 
LIGHT (VL) ANTIBODY 
FRAGMENT, MULTIVALENT 
ANTIBODY, DIABODY, DOMAIN 2 
SWAPPING, IMMUNOGLOBULIN 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, 


COMPLEX 

(IMMUNORECEPTOMMMUNOGL 
OBULIN) COMPLEX 
(IMMUNORECEPTOMMMUNOGL 
OBULIN) 
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SEQFOL 
D score 


IMMUNOGLOBULIN 
IGGl MONOCLONAL 
FAB FRAGMENT (TE33) 


SM3 ANTIBODY; CHAIN: 
L, H; PEPTIDE EPITOPE; 
CHAIN: P; 


SM3 ANTIBODY; CHAIN: 
L, H; PEPTIDE EPITOPE; 
CHAIN: P; 


MONOCLONAL 
ANTIBODY 3A2; CHAIN: 
H, L; 


MONOCLONAL 
ANTIBODY 3A2; CHAIN: 
H, L; 


MONOCLONAL 
ANTIBODY 3A2; CHAIN: 
H,L; 


IGG3-KAPPA ANTIBODY 
(LIGHT CHAIN); CHAIN: 
A, C; IGG3-KAPPA 
ANTIBODY (HEAVY 
CHAIN); CHAIN: B, D; 


MFE-23 RECOMBINANT 
ANTIBODY FRAGMENT; 
CHAIN: A; 


ANTIBODY FRAGMENT; 
CHAIN: A; 


Compound 




COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR 
ANTIBODY, 2 COMPLEX 
(ANTIBODY/PEPTIDE EPITOPE) 


COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR 
ANTIBODY, 2 COMPLEX 
(ANTIBODY/PEPTIDE EPITOPE) 


MONOCLONAL ANTIBODY 
MONOCLONAL ANTIBODY, FAB- 
FRAGMENT, REPRODUCTION 


MONOCLONAL ANTIBODY 
MONOCLONAL ANTIBODY, FAB- 
FRAGMENT, REPRODUCTION 


MONOCLONAL ANTIBODY 
MONOCLONAL ANTIBODY, FAB- 
FRAGMENT, REPRODUCTION 


IMMUNE SYSTEM PRELIMINARY, 
IMMUNE SYSTEM 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, SINGLE- 
CHAIN FV, ANTI- 

CARCINOEMBRYONIC 2 ANTIGEN 


IMMUNOGLOBULIN, SINGLE- 
CHAIN FV, ANTI- 

CARCINOEMBRYONIC 2 ANTIGEN 
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SEQFOL 
D score 


ANTIBODY (LIGHT 
CHAIN); CHAIN: L; 
ANTIBODY (HEAVY 


IGG4 REA; CHAIN: A; RF- 
AN IGM/LAMBDA; 
CHAIN: H, L; 


IGG4 REA; CHAIN: A; RF- 
AN IGM/LAMBDA; 
CHAIN: H, L; 


FAB FRAGMENT, 
ANTIBODY A5B7; 
CHAIN: A, B, C, D; 


IMMUNOGLOBULIN, 
DIELS ALDER 
CATALYTIC ANTIBODY; 
CHAIN: L, H, A, B; 


2E8 (IGG1=KAPPA=) 
ANTIBODY; CHAIN: L, H, 
M,P; 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB 3 


PHENYLARSONATE 
6FAB 3 ANTIBODY 36-71, 
FAB 36-71 6FAB4 


Compound 


IMMUNE SYSTEM 
IMMUNOGLOBULIN; 
IMMUNOGLOBULIN ANTIBODY 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN), RHEUMATOID FACTOR 2 
AUTO-ANTIBODY COMPLEX 


COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN) COMPLEX 

(IMMUNOGLOBULIN/AUTOANTIG 
EN), RHEUMATOID FACTOR 2 
AUTO-ANTIBODY COMPLEX 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, FAB 
FRAGMENT 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, ANTIBODY, 
CATALYTIC ANTIBODY, DIELS 
ALDER, 2 GERMLINE 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN 
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D score 


CAMP ATH- 1H:LIGHT 
CHAIN; CHAIN: L; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; 
CHAIN: NULL; 


ANTIBODY R24 (LIGHT 
CHAIN); CHAIN: A; 
ANTIBODY R24 (HEAVY 
CHAIN); CHAIN: B; 


ANTIBODY (CB 4-1); 
CHAIN: A, B; PEPTIDE; 
CHAIN: C; 


FAB FRAGMENT; 
CHAIN:L,H,J,K; 
VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR; 
CHAIN: V, W; 


FAB FRAGMENT; 
CHAIN: L, H, J, K; 
VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR; 
CHAIN: V, W; 


HEMOLIN; CHAIN: A, B; 




Compound 


ANTIBODY THERAPEUTIC, 
ANTIBODY, CD52 


T-CELL SURFACE 
GLYCOPROTEIN 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC, 
LIPOPROTEIN, T-CELL SURFACE 
GLYCOPROTEIN 


IMMUNE SYSTEM ANTIBODY 
(FAB FRAGMENT), IMMUNE 
SYSTEM 


COMPLEX (ANTIBODY/PEPTIDE) 
POLYSPECIFICITY, CROSS 
REACTIVITY, FAB-FRAGMENT, 
PEPTIDE, 2 HIV-1, COMPLEX 
(ANTIBODY/PEPTIDE) 


COMPLEX (ANTIBODY/ANTIGEN) 
FAB-12; VEGF; COMPLEX 
(ANTIBODY/ANTIGEN), 
ANGIOGENIC FACTOR 


COMPLEX (ANTIBODY/ANTIGEN) 
FAB-12; VEGF; COMPLEX 
(ANTIBODY/ANTIGEN), 
ANGIOGENIC FACTOR 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


HOMOPHILIC ADHESION | 
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SEQFOL 
D score 


IMMUNOGLOBULIN FAB 
FRAGMENT OF A 
HUMANIZED VERSION 


FC GAMMA RIIB; 
CHAIN: A; 


FC GAMMA RIIB; 
CHAIN: A; 


MUSCLE PROTEIN TITIN 
MODULE M5 
(CONNECTIN) 1TNM3 
(NMR, MINIMIZED 
AVERAGE STRUCTURE) 
1TNM4 1TNM58 


FAB 184.1; CHAIN: L, H; 
OUTER SURFACE 
PROTEIN A; CHAIN: 0; 


! N15 ALPHA-BETA T- 
CELL RECEPTOR; 
CHAIN: A, B, C, D; H57 
FAB; CHAIN: E, F, G, H 


TITIN; CHAIN: NULL; 


Compound 




IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 


IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 




STRAIN B31 


COMPLEX 

(IMMUNOGLOBULIN/LIPOPROTEI 
N) OSPA; COMPLEX 
(IMMUNOGLOBULIN/LIPOPROTEI 
N), OUTER SURFACE 2 PROTEIN A 
COMPLEXED WITH FAB184.1, 
BORRKLIA BURGDORFERI 3 


COMPLEX 

(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) COMPLEX 
(IMMUNORECEPTOR/IMMUNOGL 
OBULIN) 


MUSCLE PROTEIN CONNECTIN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN 
FOLD, ALTERNATIVE SPLICING, 
SIGNAL, 3 MUSCLE PROTEIN 
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SEQFOL 
D score 


| RIBONUCLEASE; CHAIN: 


RIBONUCLEASE; CHAIN: 
A, B; 


HYDROLASE 
ANGIOGENIN; CHAIN: A; 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB3 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB3 


METAL CHELATASE 
CATALYTIC ANTIBODY; 
CHAIN: A, C; METAL 
CHELATASE 
CATALYTIC ANTIBODY; 
CHAIN: B, D; 


NEURAL CELL 
ADHESION MOLECULE; 
CHAIN: NULL; 


OF THE ANTI-CD18 
2FGW 3 ANTIBODY 'H52' 
(HUH52-OZ FAB) 2FGW4 


Compound 


| MOLECULAR EVOLUTION | 


MOLECULAR EVOLUTION 
MOLECULAR EVOLUTION, 
RIBONUCLEASE 


HYDROLASE HYDROLASE 
(VASCULARIZATION) 






IMMUNE SYSTEM METAL 
CHELATASE, CATALYTIC 
ANTIBODY, FAB FRAGMENT, 
IMMUNE 2 SYSTEM 


CELL ADHESION NCAM DOMAIN 
1; CELL ADHESION, 
GLYCOPROTEIN, HEPARIN- 
BINDING, GPI-ANCHOR, 2 
NEURAL ADHESION MOLECULE, 
IMMUNOGLOBULIN FOLD, 
SIGNAL 
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SEQFOL 
D score 


TITIN; CHAIN: NULL; 


HEMOLIN CHAIN: A, B; ! 


U2RN^ 
CHAIN 
CHAIN 
CHAIN: 


U2RW 
CHAIN: 
CHAIN: 
CHAIN 


U2RM 
CHAIN 
CHAIN: 
CHAIN: 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


Compound 


a n ^ S 


o o ?o e 


l HAIRPIN IV; | 
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CONNECTIN A71, CONNECTIN; 
TITIN, CONNECTIN, FIBRONECTIN 
TYPE III 


INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
i (NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX 

(INfflBITOR/NUCLEASE) 
COMPLEX 

(INfflBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 
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SEQFOL 
D score 


| MUSCLE PROTEIN TITIN 


CELL ADHESION 
PROTEIN TENASCIN 
(THIRD FIBRONECTIN 
TYPE III REPEAT) ITEN 3 


TENASCIN; CHAIN: A, B; 


TITIN; CHAIN: NULL; 


FIBRONECTIN; CHAIN: 
NULL; 


FIBRONECTIN; CHAIN: 
NULL; 


SKP2; CHAIN: A, C; SKPl; 
CHAIN: B, D; 




Compound 






STRUCTURAL PROTEIN 
TENASCIN, FIBRONECTIN TYPE- 
III, HEPARIN, EXTRACELLULAR 2 
MATRIX, ADHESION, FUSION 
PROTEIN, STRUCTURAL PROTEIN 


MUSCLE PROTEIN CONNECTIN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN 
FOLD, ALTERNATIVE SPLICING, 
SIGNAL, 3 MUSCLE PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN, RGD, 
EXTRACELLULAR MATRIX, 2 
HEPARIN-BINDING, 
GLYCOPROTEIN 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED P45; CYCLIN 
A/CDK2-ASSOCIATED P19; SKPl, 
SKP2, F-BOX, LRRS, LEUCINE- 
RICH REPEATS, SCF, 2 UBIQUITIN, 
E3, UBIQUITIN PROTEIN LIGASE 


PROTEIN LIGASE 
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| LECTIN (AGGLUTININ) 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


BLOOD COAGULATION 
FACTOR XA; CHAIN: L, 
C; 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I IVMO 3 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I IVMO 3 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I IVMO 3 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I IVMO 3 


MEMBRANE PROTEIN 
VITELLINE MEMBRANE 
OUTER LAYER PROTEIN 
I IVMO 3 


COAGULATION FACTOR 
VIIA (HEAVY CHAIN); 
CHAIN: H; TRIPEPTIDYL 
INHIBITOR; CHAIN: C; 


Compound 






BLOOD COAGULATION rACTUK 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 
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U2 RNA HAIRPIN IV; 
CHAIN: Q,R;U2A»; 
CHAIN: A, C; U2 B M ; 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


Compound 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


COMPLEX 

(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 
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SEQFOL 
D score 


INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1; CHAIN: A; 


TROPOMYOSIN; CHAIN: 
A, B, C, D 


BETA 1,4 

GALACTOSYLTRANSFE 
RASE; CHAIN: A, B; 


BETA 1,4 

GALACTOSYLTRANSFE 
RASE; CHAIN: A, B; 


MMLV REVERSE 
TRANSCRIPTASE; IMML 
4 CHAIN: NULL; IMML 5 


PROMYELOCYTIC 
LEUKEMIA ZINC 
FINGER PROTEIN PLZF; 
CHAIN: A; 


RAB5C; CHAIN: A; 


RHOA; CHAIN: A; 


CHAIN: A; 


Compound 


SIGNALING PROTEIN GUANINE 
NUCLEOTIDE- BINDING PROTEIN 
1; GBP, GTP HYDROLYSIS, GDP, 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


TRANSFERASE BETA4GALT1; 
NUCLEOTIDE BINDING PROTEIN, 
ALPHA BETA ALPHA FOLD 


TRANSFERASE BETA4GALT1; 
NUCLEOTIDE BINDING PROTEIN, 
ALPHA BETA ALPHA FOLD 


REVERSE TRANSCRIPTASE 


GENE REGULATION POZ 
DOMAIN; PROTEIN-PROTEIN 
INTERACTION DOMAIN, 
TRANSCRIPTIONAL 2 
REPRESSOR, ZINC-FINGER 
PROTEIN, X-RAY 
CRYSTALLOGRAPHY, 3 PROTEIN 
STRUCTURE, PROMYELOCYTIC 
LEUKEMIA, GENE REGULATION 


PROTEIN TRANSPORT G- 
PROTEIN, GTP HYDROLYSIS, 
ENDOCYTOSIS, RAB PROTEIN, 2 
MEMBRANE TRAFFICKING 


GENE REGULATION/SIGNALING 
PROTEIN PROTEIN RHOA-GDP 
COMPLEX 


CRYSTALLOGRAPHY, 2 
SIGNALING PROTEIN 
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SEQFOL 
D score 


HULYS 11; CHAIN: A, B, 
D, E; LYSOZYME; 
CHAIN: C, F; 


FAB FRAGMENT; 
CHAIN: L, H, J,K; 
VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR; 
CHAIN: V, W; 


IMMUNOGLOBULIN; 
CHAIN: A, B; 


BENCE-JONES KAPPA I 
PROTEIN BRE; CHAIN: 
A, B, C; 


BENCE-JONES KAPPA I 
PROTEIN BRE; CHAIN: 
A, B, C; 


HYPOTHETICAL 23.7 
KDA PROTEIN IN ICC- 
TOLC CHAIN: A, B; 


NEUROLYSIN; CHAIN: P; 


INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1; CHAIN: A; 




Compound 


COMPLEX (HUMANIZED 
ANTIBODY/HYDROLASE) 
MURAMIDASE; HUMANIZED 


COMPLEX (ANTIBODY/ANTIGEN) 
FAB-12; VEGF; COMPLEX 
(ANTIBODY/ANTIGEN), 
ANGIOGENIC FACTOR 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN, KAPPA 
LIGHT-CHAIN DIMER HEADER 


IMMUNE SYSTEM BENCE-JONES; 
IMMUNOGLOBULIN, AMYLOID, 
IMMUNE SYSTEM 


IMMUNE SYSTEM BENCE-JONES; 
IMMUNOGLOBULIN, AMYLOID, 
IMMUNE SYSTEM 


HYDROLASE ADP-RIBOSE 
PYROPHOSPHATASE; NUDIX 
FOLD 


HYDROLASE NEUROPEPTIDASE, 
ZINC METALLOPEPTIDASE, 
ENDOPEPTIDASE 


SIGNALING PROTEIN GBP, GTP 
HYDROLYSIS, GDP, GMP, 
INTERFERON INDUCED, 
DYNAMIN 2 RELATED, LARGE 
GTPASE FAMILY. GMPPNP, 
GPPNHP. 


GMP, INTERFERON INDUCED, 
DYNAMIN 2 RELATED, LARGE 
GTPASE FAMILY, SIGNALING 
PROTEIN 
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121.32 
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111.65 




SEQFOL 
D score 


| IMMUNOGLOBULIN FV 


IMMUNOGLOBULIN FV 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5,VERSION81FVC3 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AA FV) IFGV 4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AA FV) IFGV 4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AA FV) IFGV 4 


IGMMEZ 

IMMUNOGLOBULIN; 
CHAIN: L; IGMMEZ 
IMMUNOGLOBULIN; 
CHAIN: H; 


IGMMEZ 

IMMUNOGLOBULIN; 
CHAIN: L; IGMMEZ 
IMMUNOGLOBULIN; 
CHAIN: H; 


CHAIN: G, H; | 


Compound 












IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD, 
ANTIBODY, IGM FV 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD, 
ANTIBODY, IGM, FV 


FAB VH3 3 SPECIFICITY | 
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SEQFOL 
D score 


IMMUNOGLOBULIN 
WAT, A VARIABLE 
DOMAIN FROM 
IMMUNOGLOBULIN 


IMMUNOGLOBULIN 
WAT, A VARIABLE 
DOMAIN FROM 
IMMUNOGLOBULIN 
LIGHT-CHAIN IWTL 3 
(BENCE-JONES 
PROTEIN) IWTL 4 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN FAB 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 


IMMUNOGLOBULIN FV 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 8 IFVC 3 


FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 8 IFVC 3 


Compound 
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SEQFOL 
D score 


3-ALPHA- 

HYDROXYSTEROID 
DEHYDROGENASE; 
CHAIN: A, B; 


OXIDOREDUCTASE 
ALDOSE REDUCTASE 
(E.C.Ll.1.21) COMPLEX 
WITHNADPH IADS 3 


OXIDOREDUCTASE 
ALDOSE REDUCTASE 
(E.C.Ll.1.21) COMPLEX 
WITHNADPH IADS 3 


2,5-DIKETO-D- 
GLUCONICACID 
REDUCTASE A; CHAIN: 
NULL; 


2,5-DIKETO-D- 
GLUCONICACID 
REDUCTASE A; CHAIN: 
NULL; 


IMMUNOGLOBULIN FAB 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 
2FGW 3 ANTIBODY 'H52' 
(HUH52-OZFAB)2FGW4 


LIGHT-CHAIN 1WTL3 
(BENCE-JONES 
PROTEIN) 1WTL4 


Compound 


OXIDOREDUCTASE 3-ALPHA- 
HSD; OXIDOREDUCTASE, NAD 






OXIDOREDUCTASE 2,5-DKG 
REDUCTASE A; 
OXIDOREDUCTASE, 
ALPHA8/BETA8 BARREL, 2,5- 
DIKETO-D-GLUCONIC 2 ACID, 
COMMERCIAL VITAMIN C 
SYNTHESIS 


OXIDOREDUCTASE 2,5-DKG 
REDUCTASE A; 
OXIDOREDUCTASE, 
ALPHA8/BETA8 BARREL, 2,5- 
DIKETO-D-GLUCONIC 2 ACID, 
COMMERCIAL VITAMIN C 
SYNTHESIS 
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SEQFOL 
D score 


| FR-1 PROTEIN; CHAIN: 


FR-1 PROTEIN; CHAIN: 
NULL; 


ALDOSE REDUCTASE; 
CHAIN: A; 


ALDOSE REDUCTASE; 
CHAIN: A; 


ALDEHYDE 
REDUCTASE; CHAIN: 
NULL; 


ALDEHYDE 
REDUCTASE; CHAIN: 
NULL; 


CHO REDUCTASE; 
CHAIN: A; 


CHO REDUCTASE; 
CHAIN: A; 


ALDOSE REDUCTASE; 
CHAIN: NULL; 


ALDOSE REDUCTASE; 
CHAIN: NULL; 


3-ALPHA- 

HYDROXYSTEROID 
DEHYDROGENASE; 
CHAIN: A, B; 


Compound 


3 
o 

i 

I 


OXIDOREDUCTASE (NADP) 
ALDO-KETO OXIDOREDUCTASE 
(NADP), TIM BARREL 


OXIDOREDUCTASE ALDOSE 
REDUCTASE, INHIBITION, 
DIABETES 


OXIDOREDUCTASE ALDOSE 
REDUCTASE, INHIBITION, 
DIABETES 


OXIDOREDUCTASE ALRl; TIM- 
BARREL, OXIDOREDUCTASE, 
NADP 


OXIDOREDUCTASE ALRl; TIM- 
BARREL, OXIDOREDUCTASE, 
NADP 


OXIDOREDUCTASE ALPHA/BETA 
TIM BARREL, PROTEIN-NADP+ 
COMPLEX 


OXIDOREDUCTASE ALPHA/BETA 
TIM BARREL, PROTEIN-NADP+ 
COMPLEX 


OXIDOREDUCTASE 
OXIDOREDUCTASE, ALDOSE 
REDUCTASE, INHIBITION, 
DIABETES 


OXIDOREDUCTASE 
OXIDOREDUCTASE, ALDOSE 
REDUCTASE, INHIBITION, 
DIABETES 


OXIDOREDUCTASE 3-ALPHA- 
HSD; OXIDOREDUCTASE, NAD 
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67.97 






SEQFOL 
D score 


| OUTER SURFACE 


OUTER SURFACE 
PROTEIN C; CHAIN: A; 


OUTER SURFACE 
PROTEIN C; CHAIN: A; 


OUTER SURFACE 
PROTEIN C; CHAIN: A; 


OUTER SURFACE 
PROTEIN C; CHAIN: A; 


OUTER SURFACE 
PROTEIN C; CHAIN: A, B, 
C,D; 


OUTER SURFACE 
PROTEIN C; CHAIN: A, B, 
C,D; 


OUTER SURFACE 
PROTEIN C; CHAIN: A, B, 
C,D; 


ALDEHYDE 
REDUCTASE; CHAIN: 
NULL; 


ALDEHYDE 
REDUCTASE; CHAIN: 
NULL; 


2,5-DIKETO-D- 
GLUC0N1CACID 
REDUCTASE; CHAIN: A; 


2,5-DIKETO-D- 
GLUCONICACID 
REDUCTASE; CHAIN: A; 


NULL; 


Compound 


| IMMUNE SYSTEM SURFACE 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


IMMUNE SYSTEM OSPC; LYME 
DISEASE, OSPC, HB19, HELICAL 
BUNDLE, DIMER 


IMMUNE SYSTEM OSPC; LYME 
DISEASE, OSPC, HB19, HELICAL 
BUNDLE, DIMER 


IMMUNE SYSTEM OSPC; LYME 
DISEASE, OSPC, HB19, HELICAL 
BUNDLE, DIMER 


OXIDOREDUCTASE ALRl; 
OXIDOREDUCTASE, TIM-BARREL 


OXIDOREDUCTASE ALRl; 
OXIDOREDUCTASE, TIM-BARREL 


OXIDOREDUCTASE APO-2,5- 
DIKETO-D-GLUCONATE 
REDUCTASE; ALDO-KETO 
REDUCTASE, TIM BARREL 


OXIDOREDUCTASE APO-2,5- 
DIKETO-D-GLUCONATE 
REDUCTASE; ALDO-KETO 
REDUCTASE, TIM BARREL 


ALDO-KETO OXIDOREDUCTASE 
(NADP), TIM BARREL 
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SEQFOL 
D score 


| YYl; CHAIN: C; ADENO- 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 




Compound 


| COMPLEX (TRANSCRIPTION 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


POLYMERASE 111,2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 
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131.75 








63.97 






SEQFOL 
D score 


| HLA CLASS I 


BETA=2=~ 
MICROGLOBULIN; 
IBMG 5 CHAIN: NULL 
IBMG 6 


PS2 PROTEIN; CHAIN: A, 
B; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


Compound 


| IMMUNE SYSTEM MAGE-4 1 


HISTOCOMPATIBILITY ANTIGEN 
LACTOLLIN, MHC-I 
HISTOCOMPATIBILITY ANTIGEN, 
LIGHT CHAIN IBMG 11 


GROWTH FACTOR PNR- 
2,PS2,TFF1,BREAST CANCER 
ESTROGEN INDUCIBLE GROWTH 
FACTOR, CELL MOTILITY, 
TUMOR SUPPRESSOR, TREFOIL 2 
DOMAIN, SIGNAL 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 
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SEQFOL 
D score 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 


ACUTOLYSIN A; CHAIN: 
A; 


TUMOR NECROSIS 
FACTOR-ALPHA- 
CONVERTING ENZYME; 
CHAIN: A, C, E,I; 


TUMOR NECROSIS 
FACTOR-ALPHA- 
CONVERTING ENZYME; 
CHAIN: A, C, E, I; 


ATROLYSINC; 1ATL4 
CHAIN: A, B, C, D; 1ATL 


ATROLYSIN C; 1ATL 4 
CHAIN: A, B, C, D; 1 ATL 
5 


GLUTACONATE 
COENZYME A- 
TRANSFERASE; CHAIN: 
A,B,C,D; 


GLUTACONATE 
COENZYME A- 
TRANSFERASE; CHAIN: 
A, B, C, D; 


GLUTACONATE 
COENZYME A- 
TRANSFERASE; CHAIN: 
A, B, C, D; 


TRANSFERASE; CHAIN: 
A,B, C, D; 


Compound 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 


TOXIN HEMORRHAGIN I, IAAH-I; 
METALLOPROTEINASE, SNAKE 
VENOM, MMP, TOXIN 


ZN-ENDOPEPTIDASE TACE; ZN- 
ENDOPEPTIDASE, HYDROLASE, 
TNF-ALPHA 


ZN-ENDOPEPTIDASE TACE; ZN- 
ENDOPEPTIDASE, HYDROLASE, 
TNF-ALPHA 


METALLOENDOPEPTIDASE 
HEMORRHAGIC TOXIN C } FORM 
D; 1ATL6 


METALLOENDOPEPTIDASE 
HEMORRHAGIC TOXIN C, FORM 
D; 1ATL6 


TRANSFERASE TRANSFERASE, 
COA, GLUTAMATE, PROTEIN 
FERMENTATION 


TRANSFERASE TRANSFERASE, 
COA, GLUTAMATE, PROTEIN 
FERMENTATION 


TRANSFERASE TRANSFERASE, 
COA, GLUTAMATE, PROTEIN 
FERMENTATION 


FERMENTATION 
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SEQFOL 
D score 


AGGREGATION 
INHIBITOR, GP 


LAMININ; 


LAMININ; 


LAMININ; 


INSULIN-] 
FACTOR I 
CHAIN: A 


METALLOPROTEASE 
ADAMALYSINII 
(PROTEINASE II) 
(E.C.3.4.24.46) HAG 3 


METALLOPROTEASE 
ADAMALYSINII 
(PROTEINASE II) 
(E.C.3.4.24.46) HAG 3 


FLAVORIDIN; IFVL 4 
CHAIN: NULL IFVL 5 


FLAVORIDIN; IFVL 4 
CHAIN: NULL IFVL 5 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 
SOLUBLE TISSUE 
FACTOR; CHAIN: T; 
5L15; CHAIN: I; 




Compound 


CHAIN; 


. CHAIN: 


. CHAIN: 


LIKE GROWTH 
IECEPT0R 1; 


: NULL; 


: NULL 


:NULL: 










GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


1 GLYCOPROTEIN GLYCOPROTEIN 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 






BLOOD COAGULATION 
INHIBITOR GPIIB/IIIA 
ANTAGONIST IFVL 9 


BLOOD COAGULATION 
INHIBITOR GPIIB/IIIA 
ANTAGONIST IFVL 9 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, RECEPTOR ENZYME, 3 
INHIBITOR, GLA, EGF, COMPLEX 
(SERINE 4 

PROTEASE/COFACTOR/LIGAND), 
BLOOD CLOTTING 


DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 
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SEQFOL 
D score 


| BLOOD COAGULATION 


HUMAN BETA2- 
GLYCOPROTEIN I; 
CHAIN: A; 


ACUTOLYSIN-C; CHAIN: | 
A; 


ACUTOLYSIN-C; CHAIN: 
A; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


AGGREGATION 
INHIBITOR, GP 
ANTAGONIST KISTRIN 
(NMR, 8 STRUCTURES) 
1KST3 


ANTAGONIST KISTRIN 
(NMR, 8 STRUCTURES) 
1KST3 


Compound 




MEMBRANE ADHESION SHORT 
CONSENSUS REPEAT, SUSHI, 
COMPLEMENT CONTROL 
PROTEIN, 2 N-GLYCOSYLATION, 
MULTI-DOMAIN, MEMBRANE 
ADHESION 


METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 


-a 

| 
2 


TOXIN HEMORRHAGIN 111 
METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 


j COMPLEX (BLOOD 

| COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 

1 BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 
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D score 


| ATROLYSINC; 1ATL4 


QGSRZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B, C; 


QGSRZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSRZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


INHIBITOR ECHISTATIN 
(NMR, 8 STRUCTURES) 
2ECH3 


Compound 


1 METALLOENDOPEPTIDASE 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 
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SEQFOL 
D score 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E,F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B,C,E,F; 


TFIIIA; CHAIN: A, D; 5S 
RIBOSOMAL RNA GENE; 
CHAIN: B, C, E, F; 


RECEPTOR PROTEIN 
TYROSINE 
PHOSPHATASE MU; 
CHAIN: A, B; 


> 


n 

© 
o 

1 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


1 COMPLEX (TRANSCRIPTION 
REGULATION/DNA) COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), RNA 
POLYMERASE III, 2 
TRANSCRIPTION INITIATION, 
ZINC FINGER PROTEIN 


RECEPTOR Dl; RECEPTOR, 
PHOSPHATASE, SIGNAL 
TRANSDUCTION, ADHESION, 2 
HYDROLASE 


METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 
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SEQFOL 
D score 


SERINE/THREONINE 
PROTEIN PHOSPHATASE 
5; CHAIN: NULL; 


SERINE/THREONINE 
PROTEIN PHOSPHATASE 
5; CHAIN: NULL; 


SERINE/THREONINE 
PROTEIN PHOSPHATASE 
5; CHAIN: NULL; 


GDP-MANNOSE 4,6- 
DEHYDRATASE; CHAIN: 
A; 


CALMODULIN; CHAIN: 
A; RS20; CHAIN: B; 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C 
1TOP3 


CONTRACTILE SYSTEM 
PROTEIN TROPONIN C 
1TOP3 


TROPONIN C; ITNX 4 
CHAIN: NULL; ITNX 5 


Compound 


HYDROLASE 

TETRATRICOPEPTIDE, TRP; 
HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN 
INTERACTIONS, TPR, 2 SUPER- 
HELIX, X-RAY STRUCTURE 


HYDROLASE 

TETRATRICOPEPTIDE, TRP; 
HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN 
INTERACTIONS, TPR, 2 SUPER- 
HELIX, X-RAY STRUCTURE 


HYDROLASE 

TETRATRICOPEPTIDE, TRP; 
HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN 
INTERACTIONS, TPR, 2 SUPER- 
HELIX, X-RAY STRUCTURE 


LYASE DEHYDRATASE, NADP, 
GDP-MANNOSE, GDP-FUCOSE 


CALMODULIN, CALCIUM 
BINDING, HELIX-LOOP-HELK, 
SIGNALLING, 2 

COMPLEX(CALCIUM-BINDING 
PROTEIN/PEPTIDE) 






CALCIUM-BINDING PROTEIN EF- 
, HAND ITNX 14 
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SEQFOL 
D score 


CYCLOPHILIN 40; 
CHAIN: A; 


CYCLOPHILIN 40; 
CHAIN: A; 


PEROXISOME 
TARGETING SIGNAL 1 
RECEPTOR PEX5; 
CHAIN: A; 


NEUTROPHIL CYTOSOL 
FACTOR 2; CHAIN: A; 


PEROXISOMAL 
TARGETING SIGNAL 1 
RECEPTOR; CHAIN: A, B; 
PTS1-CONTAINING 
PEPTIDE; CHAIN: C, D; 


PEROXISOMAL 
TARGETING SIGNAL 1 
RECEPTOR; CHAIN: A, B; 
PTS1-CONTAINING 
PEPTIDE; CHAIN: C, D; 


TPRl-DOMAIN OF HOP; 
CHAIN: A, B; HSC70- 
PEPTIDE; CHAIN: C, D; 


PEPTIDE; CHAIN: C, D; 


Compound 


ISOMERASE 40 KDA PEPTIDYL- 
PROLYL CIS-TRANS ISOMERASE; 
PPIASEIMMUNOPHILIN 
TETRATRICOPEPTIDE 


ISOMERASE 40 KDA PEPTIDYL- 
PROLYL CIS-TRANS ISOMERASE; 
PPIASE IMMUNOPHILIN 
TETRATRICOPEPTIDE 


TRANSPORT PROTEIN PEX5; 
ALPHA HELICAL 


PHAGOCYTE OXIDASE FACTOR 
P67PHOX, NCF-2; PHAGOCYTE 
OXIDASE FACTOR, SH3 DOMAIN, 
REPEAT, TPR REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, 
PTSl-BP, PEROXIN-5, PTSl 
PROTEIN-PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, 
TPR, 2 HELICAL REPEAT 


SIGNALING PROTEIN 
PEROXISMORE RECEPTOR 1, 
PTSl-BP, PEROXIN-5, PTSl 
PROTEIN-PEPTIDE COMPLEX, 
TETRATRICOPEPTIDE REPEAT, 
TPR, 2 HELICAL REPEAT 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSC70, 2 HSP70, 
PROTEIN BINDING 


REPEAT, HSC70,2HSP70, 
PROTEIN BINDING 
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SEQFOL 
D score 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T, U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 
CHAIN: C; 


PHOSPHOLIPASE A2; 
CHAIN: NULL; 


FACTOR VII; CHAIN: 
NULL; 




ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI: CHAIN: P: 


COAGULATION FACTOR 
EGF-LIKE MODULE OF 
BLOOD COAGULATION 
FACTOR X(N- 
TERMINAL, 1AP03APO 
FORM) (NMR, 13 
STRUCTURES) 1AP04 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 




Compound 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


HYDROLASE HYDROLASE, 
PHOSPHOLIPASE A2, PLATELET 
AGGREGATION INHIBITOR, 2 
PBPB HEADER MODRES 


BLOOD COAGULATION BLOOD 
COAGULATION, EGF, 
HYDROLASE, SERINE PROTEASE 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 




COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


REGULATION/DNA) | 
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SEQFOL 
D score 


FACTOR IX; CHAIN: B, C; 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 
CHAIN; CHAIN: M,N, 0, 
P; THROMBOMODULIN; 
CHAIN: I,J,K,L; 
THROMBIN INHIBITOR 
L-GLU-L-GLY-L-ARM; 
CHAIN: E, F, G, H; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: 
H,I; DES-GLA FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L, M; (DPN)-PHE- 
ARG; CHAIN: C, D; 
PEPTIDE E-76; CHAIN: X, 
Y; 


THROMBOMODULIN; 
CHAIN: A; 


THROMBOMODULIN; 
CHAIN: A; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T, U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 
CHAIN: C; 


Compound 


COAGULATION FACTOR 
CRYSTAL STRUCTURE, 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 
FETOMODULIN, TM, CD141 
ANTIGEN; EGR-CMK SERINE 
PROTEINASE, EGF-LIKE 
DOMAINS, ANTICOAGULANT 
COMPLEX, 2 ANTIFIBRINOLYTIC 
COMPLEX 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 


MEMBRANE PROTEIN NMR, 
THROMBIN, EGF MODULE, 
ANTICOAGULANT, 
GLYCOSYLATION 


MEMBRANE PROTEIN NMR, 
THROMBIN, EGF MODULE, 
ANTICOAGULANT, 
GLYCOSYLATION 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 
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PMF 1 
















SEQFOL 
D score 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L; 
BLOOD COAGULATION 
FACTOR VIIA; CHAIN: H; 


BLOOD COAGULATION 
FACTOR VII; CHAIN: A; 


TUMOR NECROSIS 
FACTOR RECEPTOR; 
CHAIN: A, B; 


PROSTAGLANDIN H2 
SYNTHASE-1; CHAIN: A, 
B; 


FIBRILLIN; CHAIN: 
NULL; 


GROWTH FACTOR 
EPIDERMAL GROWTH 
FACTOR (EGF) (NMR, 16 
STRUCTURES) 1EGF 3 




Compound 


BLOOD CLOTTING 
COMPLEX(SERINE 
PROTEASE/COFACTOR/LIGAND), 
BLOOD COAGULATION, 2 SERINE | 


BLOOD CLOTTING FACTOR VII, 
BLOOD COAGULATION, EGF-LIKE 
DOMAIN, BLOOD 2 CLOTTING 


SIGNALLING PROTEIN BINDING 
PROTEIN, CYTOKINE, 
SIGNALLING PROTEIN 


OXIDOREDUCTASE COX-1; EGF 
DOMAIN, NSAID BINDING, 
IBUPROFEN, MEMBRANE- 
BINDING 2 DOMAIN, 
CYCLOOXYGENASE, 
PEROXIDASE, 
OXIDOREDUCTASE, 3 
DIOXYGENASE, PEROXIDASE 


MATRIX PROTEIN 
EXTRACELLULAR MATRIX, 
CALCIUM-BINDING, 
GLYCOPROTEIN, 2 REPEAT, 
SIGNAL, MULTIGENE FAMILY, 
DISEASE MUTATION, 3 EGF-LIKE 
DOMAIN, HUMAN FIBRILLIN-1 
FRAGMENT, MATRIX PROTEIN 




EPIDERMAL GROWTH FACTOR, 
EGF, 2 CALCIUM-BINDING, EGF- 
LIKE DOMAIN, STRUCTURE AND 
FUNCTION, 3 HUMAN FACTOR IX, 
COAGULATION FACTOR 
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PMF 
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SEQFOL 
D score 


COAGULATION FACTOR 
X; CHAIN: NULL; 


PHOSPHOLIPASE A2; 
CHAIN: A, B; 


T-PLASMNOGEN 
ACTIVATOR Fl-G; ITPG 
7 CHAIN: NULL; ITPG 8 


COAGULATION FACTOR 
VIIA (LIGHT CHAIN); 
CHAIN: L; 

COAGULATION FACTOR 
VIIA (HEAVY CHAIN); 
CHAIN: H; TRIPEPTIDYL 
INHIBITOR; CHAIN: C; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


LAMININ; CHAIN: NULL; | 


CHAIN: A; | 


Compound 


GLYCOPROTEIN GLYCOPROTEIN, 
HYDROLASE, SERINE PROTEASE, 
PLASMA, BLOOD 2 


LIPID DEGRADATION 
PHOSPHOLIPASE A2 5 LIPID 
DEGRADATION, HYDROLASE 


PLASMINOGEN ACTIVATION 


SERINE PROTEASE FVIIA; FVIIA; 
BLOOD COAGULATION, SERINE 
PROTEASE 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN | 


FAMILY | 
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131.11 
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128.96 


114.49 






SEQFOL 
D score 


IMMUNOGLOBULIN FV 
FRAGMENT OF 
HUMANIZED ANTIBODY 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AAFV)1FGV4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52» 
(HUH52-AAFV) IFGV 4 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY 'H52' 
(HUH52-AA FV) IFGV 4 


IGMMEZ 

IMMUNOGLOBULIN; 
CHAIN: L; IGM MEZ 
IMMUNOGLOBULIN; 
CHAIN: H; 


IGMMEZ 

IMMUNOGLOBULIN; 
CHAIN: L; IGMMEZ 
IMMUNOGLOBULIN; 
CHAIN: H; 


FAB IDFB 3 


IMMUNOGLOBULIN 3D6 


CHAIN: G, H; | 


Compound 










IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD, 
ANTIBODY, IGM, FV 


IMMUNE SYSTEM 
IMMUNOGLOBULIN FOLD, 
ANTIBODY, IGM, FV 




FAB VH3 3 SPECIFICITY | 
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134.67 




SEQFOL 
D score 


| IMMUNOGLOBULIN 


TR1.9 FAB; CHAIN: L, H; 


N9 NEURAMINIDASE; 
INMB 4 CHAIN: N; INMB 
5 FAB NCIO; INMB 9 
CHAIN: L, H; INMB 10 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN M 
(IG-M) FV FRAGMENT 
1IGM3 


IMMUNOGLOBULIN FAB 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 


IMMUNOGLOBULIN FAB 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 


IMMUNOGLOBULIN FV 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5,VERSION81FVC3 


4D5,VERSION81FVC3 | 


Compound 




IMMUNOGLOBULIN TR1.9, ANTI- 
THYROID PEROXIDASE, 
AUTOANTIBODY, 2 
IMMUNOGLOBULIN 


COMPLEX 

(HYDROLASE/IMMUNOGLOBULI 

N) 
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131.61 




SEQFOL 
D score 


| RIBONUCLEASE 


HUMAN SKELETAL 
MUSCLE ALPHA- 
ACTINIA; CHAIN: A; 


ALPHA- ACTININ 2; 
CHAIN: A, B; 


ALPHA SPECTRIN; 
CHAIN: A, B, C; 


IMMUNOGLOBULIN FAB 
FRAGMENTOFA j 
HUMANIZED VERSION 
OF THE ANTI-CD18 
2FGW 3 ANTIBODY 'H52' 
(HUH52-OZ FAB) 2FGW4 


IMMUNOGLOBULIN 
WAT, A VARIABLE 
DOMAIN FROM 
IMMUNOGLOBULIN 
LIGHT-CHAIN IWTL 3 
(BENCE-JONES 
PROTEIN) IWTL 4 


WAT, A VARIABLE 
DOMAIN FROM 
IMMUNOGLOBULIN 
LIGHT-CHAIN IWTL 3 
(BENCE-JONES 
PROTEIN) IWTL 4 


Compound 


COMPLEX 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


TRIPLE-HELIX COILED COIL 
ALPHA ACTININ SKELETAL 
MUSCLE ISOFORM 2, TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN, 
MUSCLE, 2 Z-LINE, ACTIN- 
BINDING PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED- COILS, 
STRUCTURAL PROTEIN 
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SEQFOL 
D score 


U2 RNA HAIRPIN IV; 
CHAIN: Q, R; U2A'; 
CHAIN: A, C; U2 B"; 


U2RW 
CHAIN; 
CHAIN 
CHAIN 


U2RNi> 
CHAIN; 
CHAIN; 
CHAIN; 


U2RN/ 
CHAIN; 
CHAIN; 
CHAIN: 


U2RN* 
CHAIN: 
CHAIN: 
CHAIN: 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


U2RM 
CHAIN: 
CHAIN: 
CHAIN: 


INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


Compound 


W > O ffi 

tO tO Q 


W > p ffl 
to to a 


W > p ffi 

a o 2 

to to a 
cd > ^ 


l HAIRPIN IV; 
:Q,R;U2A'; 

A, C;U2B"; 

B, D; 


l HAIRPIN IV; 
Q,R;U2A'; 

A, C;U2B"; 

B, D; 


W > p ffi 

o o * 2 

to to a 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 
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SEQFOL 
D score 


RAB 

GERANYLGERANYLTRA 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RN/ 
CHAIN; 
CHAIN: 
CHAIN: 


U2RM 
CHAIN: 
CHAIN: 
CHAIN: 


U2KNW 
CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: B,D; | 


Compound 


o o * cS 
to to a 

CO > ^ 


l HAIRPIN IV; 
:Q,R;U2A'; 
: A, C; U2 B"; 
:B,D; 


u o p c 

to to a 


D o W > 
to to a 

CO > ^ 


TRANSFERASE CRYSTAL 
STRUCTURE, RAB 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


SNRNP,RIBONUCLEOPROTEIN | 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



892 



to 
o 



4^ 



to 

CD 



0\ 



O 

a 



g 



CD 



to 



to 

oo 



p 

t— » 

oo 



to 



52 2 S 





K 22 
O b & 




SEP 

w ffl o 
^ ^ 2 

I> Q H 

> § 

^ S Q 
n ffi w 

i j W 







WO 03/054152 



PCT/US02/39555 



893 



0\ 



*Z H 



3£ 



H CO 

g 



o 



o 2 



o 

O 



o 



o 

3 



Oh 



O GO 

g o 
f 





o w 



e 

pa 



O 



53 

O ca 




n 

o 

I 

o 









WO 03/054152 



PCT/US02/39555 



894 




WO 03/054152 



PCT/US02/39555 



895 



125 Q ™ 



9r 



o 



o 



08 



2 O 

85 



H 3 



IS 



O 

o 
o 



3 4 



r? 
o 



g 



GO 



rt> O 



- Q 

p 

> Q 




own 

5> £2 O 

> ffi H 

W ^ J?d 
mOO 

Oh 



M W O 

> °° 

M W 

2 a ^ 



o 



n 

o 

I 

o 
cs 



W GO 



1 

00 

oo 
W 



o 
o 

r 



00 GO 

td td 




w Q 




H0 N> 





WO 03/054152 



PCT/US02/39555 



896 



1551 


1551 


1551 


1551 


1551 


1551 


1551 






lqfm 




lmaa 


I—* 


ljkm 


I—* 

to 


1 




w ^ 
W 


> 


> 


> 




> 


> 


> 




CHAI 
NID 


LO 




lo 
o\ 


to 


LO 
O 
» 


LO 
4^ 
vo 






STAR 
TAA 


L/i 

vo 


Lh 
VO 
-J 


L/i 
VO 
to 


Lh 
VO 
L/l 


Lh 

51 


Lh 


oo 
o 






5.6e-81 


5.6e-81 


2.4e-34 


1.4e-54 


to 
bo 

CD 

to 
o\ 


4.8e-32 


4.2e-64 
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Verify 
score 
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0.00 
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0.16 




PMF 
score 




90.26 














SEQFOL 
D score 


PROLYL 

OLIGOPEPTIDASE; 


PROLYL 

OLIGOPEPTIDASE; 
CHAIN: A; 




ACETYLCHOLINESTERA 
SE: CHAIN: A. B. C. D: 


HYDROLASE LIPASE 
(E.C.3.1.1.3) 
(TRIACYLGLYCEROL 
LIPASE) COMPLEXED 
WITH ILPP 3 
HEXADECANESULFONA 
TE ILPP 4 ILPP 71 


BREFELDIN A 
ESTERASE; CHAIN: A, B; 


ACYL PROTEIN 
THIOESTERASE 1; 
CHAIN: A, B; 


BILE SALT ACTIVATED 
LIPASE; CHAIN: A; 




Compound 


HYDROLASE PROLYL 
ENDOPEPTIDASE, POST-PROLINE 


HYDROLASE PROLYL 
ENDOPEPTIDASE, POST-PROLINE 
CLEAVING PROLYL 
OLIGOPEPTIDASE, AMNESIA, 
ALPHA/BETA-HYDROLASE, 
BETA- 2 PROPELLER 


HYDROLASE MACHE; 
HYDROLASE, SERINE ESTERASE, 
ACETYLCHOLINESTERASE, 
TETRAMER, 2 HYDROLASE FOLD, 
GLYCOSYLATED PROTEIN 




SERINE HYDROLASE SERINE 
HYDROLASE, DEGRADATION OF 
BREFELDIN A, ALPHA/BETA 2 
HYDROLASE FAMILY 


HYDROLASE ALPHA/BETA 
HYDROLASE, SERINE 
HYDROLASE, SAD, ANOMALOUS 
2 DIFFRACTION 


HYDROLASE BILE SALT 
ACTIVATED LIPASE, ESTERASE, 
CATALYTIC DOMAIN 


TRANSFERASE; 30KDA 
SECRETORY PROTEIN; ANTIGEN 
85B 
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SEQFOL 
D score 


OXIDOREDUCTASES(AC 
TING ON CH-NH2 
DONOR) H-PROTEIN 


OXIDOREDUCTASES(AC 
TING ON CH-NH2 
DONOR) H-PROTEIN 
(E.C.1.4A2) COMPLEXED 
WITH LIPOIC ACID 
CHARGED IHTP 3 IN 
METHYLAMINE IHTP 4 
IHTP 77 


RNA POLYMERASE 
PRIMARY SIGMA 
FACTOR; CHAIN: NULL; 


LIGASE(SYNTHETASE) 
SERYL-TRNA 
SYNTHETASE 
(E.C.6.1.1.11) (SERINE- 
TRNA LIGASE) ISES 3 
COMPLEXED WITH 
SERYL- 

HYDROXAMATE-AMP 
ISES 4 


HUMAN SKELETAL 
MUSCLE ALPHA- 
ACTININ 2; CHAIN: A; 


ACUTOLYSIN-C; CHAIN: 
A; 


(PROTEINASE II) 
(E.C.3.4.24.46) HAG 3 


Compound 






TRANSCRIPTION REGULATION 
SIGMA70; RNA POLYMERASE 
SIGMA FACTOR, TRANSCRIPTION 
REGULATION 




CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


TOXIN HEMORRHAGIN ffi 
METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 
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SEQFOL 
D score 


POLYDENYLATE 
BINDING PROTEIN 1; 
CHAIN: A, B, C, D, E, F, 
G, H; RNA (5- 


POLYDENYLATE 
BINDING PROTEIN 1; 
CHAIN: A, B, C, D, E, F, 
G, H; RNA (5- 
R(*AP*AP*AP*AP*AP*A 
P*AP*AP*AP*AP*A)-3'); 
CHAIN: M,N, 0, P, Q,R, 
S,T; 


SXL-LETHAL PROTEIN; 
CHAIN: A, B; RNA (5 f - 
R(P*GP*UP*UP*GP*UP* 
UP*UP*UP*UP*UP*UP*U 
)- CHAIN: P, Q; 


LEUKOTRIENEA-4 
HYDROLASE; CHAIN: A; 


LEUKOTRIENEA-4 
HYDROLASE; CHAIN: A; 


LEUKOTRIENEA-4 
HYDROLASE; CHAIN: A; 


GELATION FACTOR; 
CHAIN: A, B; 


GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


Compound 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; 
RRM, PROTEIN-RNA COMPLEX, 
GENE REGULATION/RNA 


GENE REGULATION/RNA POLY(A) 
BINDING PROTEIN 1, PABP 1; 
RRM, PROTEIN-RNA COMPLEX, 
GENE REGULATION/RNA 


RNA-BINDING PROTEIN/RNA TRA 
PRE-MRNA; SPLICING 
REGULATION, RNP DOMAIN, RNA 
COMPLEX 


HYDROLASE PROTEIN-INHIBITOR 
COMPLEX, ALPHA-BETA 
PROTEIN 


HYDROLASE PROTEIN-INHIBITOR 
COMPLEX, ALPHA-BETA 
PROTEIN 


HYDROLASE PROTEIN-INHIBITOR 
COMPLEX, ALPHA-BETA 
PROTEIN 


ACTIN BINDING PROTEIN ACTIN 
BINDING PROTEIN 120; ACTIN 
BINDING PROTEIN, 
IMMUNOGLOBULIN, GELATION 
FACTOR, ABP-2 120 


PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 
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Verify 
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PMF 
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111.65 




110.88 






136.83 


50.63 








SEQFOL 
D score 


HYDROLASE 
INHIBITOR(SERINE 


HYDROLASE 
INHIBITOR(SERINE 
PROTEINASE) HORSE 
LEUKOCYTE ELASTASE 
INHIBITOR (HLEI)1HLE 
3 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; ALPHA-l- 
ANTITRYPSIN; CHAIN: 
B; TRYPSIN; CHAIN: C; 


ALPHA-l-ANTITRYPSIN; 
CHAIN: A; ALPHA-l- 
ANTITRYPSIN; CHAIN: 
B; TRYPSIN; CHAIN: C; 


ANTITHROMBIN-III; 
CHAIN: I; 

ANTITHROMBIN-III; 
CHAIN: L; 


ANTITHROMBIN-III; 
CHAIN: I; 

ANTITHROMBIN-III; 
CHAIN: L; 


PLASMINOGEN 
ACTIVATOR INHIBITOR- 
1; CHAIN: A, B; 


PLASMINOGEN 
ACTIVATOR INHIBITOR- 
1; CHAIN: A, B; 


PLASMINOGEN 
ACTIVATOR INHIBITOR- 
1; CHAIN: A, B; 


2; CHAIN: A; j 


Compound 






HYDROLASE/HYDROLASE 
INHIBITOR PROTEASE-INHIBITOR 
COMPLEX, SERPIN, ALPHA-l- 
ANTITRYPSIN, 2 TRYPSIN 


HYDROLASE/HYDROLASE 
INHIBITOR PROTEASE-INHIBITOR 
COMPLEX, SERPIN, ALPHA-l- 
ANTITRYPSIN, 2 TRYPSIN ; 


SERPIN SERPIN 


SERPIN SERPIN 


HYDROLASE INHIBITOR NATIVE 
SERPIN, HYDROLASE INHIBITOR 


HYDROLASE INHIBITOR NATIVE 
SERPIN, HYDROLASE INHIBITOR 


HYDROLASE INHIBITOR NATIVE 
SERPIN, HYDROLASE INHIBITOR 
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SEQFOL 
D score 


U2 RNA HAIRPIN IV; 
CHAIN: Q, R; U2 A'; 


U2RM 
CHAIN 
CHAIN 
CHAIN 


U2RW 
CHAIN 
CHAIN 
CHAIN 


U2 RW 
CHAIN 
CHAIN 
CHAIN 


CHAIN 
CHAIN 
CHAIN 


U2RM 
CHAIN: 
CHAIN: 
CHAIN: 


:NIVH0 
: NIVH0 
: NIVH0 


U2RN^ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RN^ 
CHAIN: 
CHAIN: 
CHAIN: 


Compound 


^HAIRPIN IV; 
:Q,R; U2 A'; 
: A, C; U2 B"; 
:B, D; 


^ HAIRPIN IV; 
:Q,R;U2A>; 
: A, C; U2 B"; 
:B,D; 


go > p a 
to to a 

CO > z 


co > p a 

to to « 
CO > ^ 


i HAIRPIN IV; 
:Q,R;U2A'; 
:A, C; U2B M ; 
:B, D; 


w > p a 

tO to fcj 

00 > ^ 


oo > p ffl 

a n ^ S 

tO tO M 

00 > 25 


.HAIRPIN IV; 
Q,R;U2A'; 

A, C; U2B H ; 

B, D; 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 
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SEQFOL 
D score 


TPR2A-D0MAIN OF HOP; 
CHAIN: A; HSP90- 
PEPTIDE MEEVD; 


KARYOPHERIN ALPHA; 
CHAIN: A, B; MYC 
PROTO-ONCOGENE 
PROTEIN; CHAIN: C, D, 
E,F; 


KARYOPHERIN ALPHA; 
CHAIN: A, B; MYC 
PROTO-ONCOGENE 
PROTEIN; CHAIN: C, D, 
E,F; 


TROPOMYOSIN; CHAIN: 
A, B, C, D 


PROTEIN PHOSPHATASE 
PP2A; CHAIN: A, B; 


PROTEIN PHOSPHATASE 
PP2A; CHAIN: A, B; 


SERINE/THREONINE 
PROTEIN PHOSPHATASE 
5; CHAIN: NULL; 


FACTOR PML; CHAIN: 
| NULL; | 

i ! 


Compound 


CHAPERONE HOP, TPR-DOMAIN, 
PEPTIDE-COMPLEX, HELICAL 
REPEAT, HSP90, 2 PROTEIN 


TRANSPORT PROTEIN SERINE- 
RICH RNA POLYMERASE I 
SUPPRESSOR PROTEIN; ARM 
REPEAT 


TRANSPORT PROTEIN SERINE- 
RICH RNA POLYMERASE I 
SUPPRESSOR PROTEIN; ARM 
REPEAT 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


SCAFFOLD PROTEIN SCAFFOLD 
PROTEIN, PP2A, 
PHOSPHORYLATION, HEAT 
REPEAT 


SCAFFOLD PROTEIN SCAFFOLD 
PROTEIN, PP2A, 
PHOSPHORYLATION, HEAT 
REPEAT 


HYDROLASE 

TETRATRICOPEPTIDE, TRP; 
HYDROLASE, PHOSPHATASE, 
PROTEIN-PROTEIN 
INTERACTIONS, TPR, 2 SUPER- 
HELIX, X-RAY STRUCTURE 


PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION REGULATION 
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SEQFOL 
D score 


i CUTINASE; CHAIN: 


CUTINASE; CHAIN: 
NULL; 


CUTINASE; CHAIN: 
NULL; 


CUTINASE; CHAIN: 
NULL; 


CYTOCHROME BCl 
COMPLEX; CHAIN: A, B, 
C,D,E,F, G,H,I,J,K; 


CRPl; CHAIN: A; 


AMINOPEPTIDASE; 
CHAIN: A, B, C, D, E, F; 


INOSINE 

MONOPHOSPHATE 
DEHYDROGENASE 2; 
CHAIN: A, B; 


INOSINE 

MONOPHOSPHATE 
DEHYDROGENASE 2; 
CHAIN: A, B; 




Compound 


| SERINE ESTERASE HYDROLASE, 


SERINE ESTERASE HYDROLASE, 
SERINE ESTERASE, 
GLYCOPROTEIN 


SERINE ESTERASE HYDROLASE, 
SERINE ESTERASE, 
GLYCOPROTEIN 


SERINE ESTERASE HYDROLASE, 
SERINE ESTERASE, 
GLYCOPROTEIN 


ELECTRON TRANSPORT 
UBIQUINOL CYTOCHROME C 
OXIDOREDUCTASE, COMPLEX 
ELECTRON TRANSPORT, 
CYTOCHROME, MEMBRANE 
PROTEIN 


CONTRACTILE LIM DOMAIN, 
CRP, NMR, MUSCLE 
DIFFERENTIATION, 
CONTRACTILE 


HYDROLASE DMPA; 
HYDROLASE, PEPTIDE 
DEGRADATION, NTN 
HYDROLASE 


DEHYDROGENASE MU, 1MPJJH; 
DEHYDROGENASE, IMPD, IMPDH, 
GUANINE NUCLEOTIDE 
SYNTHESIS 


DEHYDROGENASE IMPD, IMPDH; 
DEHYDROGENASE, IMPD, IMPDH, 
GUANINE NUCLEOTIDE 
SYNTHESIS 


OXIDOREDUCTASE, TETRAMER, 
C4-TETRAMER 
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SEQFOL 
D score 


| PHENOL 


PHENOL 

HYDROXYLASE; CHAIN: 
A, B, C, D; 


SACCHAROPINE 
REDUCTASE; CHAIN: A; 


OXIDOREDUCTASE (CH- 
OH(D> 

CYTOCHROME(A)) 
FLAVOCYTOCHROME 
$B=2=(E.C.1.1.2.3) IFCB 


OXIDOREDUCTASE (CH- 
OH(D)- 

CYTOCHROME(A)) 
FLAVOCYTOCHROME 
$B-2=(E.C.1.1.2.3) IFCB 


KDPG ALDOLASE; 
CHAIN: A, B, C; 


KDPG ALDOLASE; 
CHAIN: A, B, C; 


KDPG ALDOLASE; 
CHAIN: A, B, C; 


INOSINE 5'- 
MONOPHOSPHATE 
DEHYDROGENASE; 
CHAIN: A, B; 


DECARBOXYLASE; 
CHAIN: A, B, C, D; 


Compound 


FLAVIN FLAVIN, PHENOL 


FLAVIN FLAVIN, PHENOL 
HYDROXYLASE, 
MONOOXYGENASE, 
OXIDOREDUCTASE 


OXIDOREDUCTASE LYSINE 
BIOSYNTHESIS, ALPHA- 
AMINOADIPATE PATHWAY, 2 
SACCHAROPINE REDUCTASE, 
DEHYDROGENASE 






LYASE BETA BARREL, TRIMER, 
CARBINOLAMINE 


LYASE BETA BARREL, TRIMER, 
CARBINOLAMINE 


LYASE BETA BARREL, TRIMER, 
CARBINOLAMINE 


OXIDOREDUCTASE ALPHA-BETA 
BARREL, TIM BARREL, IMPDH, 
IMP DEHYDROGENASE, 2 LOOP-6, 
PURINE BIOSYNTHESIS 
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SEQFOL 
D score 


ALKALINE PROTEASE; 
1 KAP 4 CHAIN: P; IKAP 5 


CYSTEINE RICH 
INTESTINAL PROTEIN; ! 
CHAIN: NULL; 


TRYPTOPHAN 
SYNTHASE ALPHA- 
SUBUNIT; CHAIN: A, B; 


SERINE-CARBOXYL 
PROTEINASE; CHAIN: A; 
FRAGMENT OF 
TYROSTATIN; CHAIN: I; 


SERINE-CARBOXYL 
PROTEINASE; CHAIN: A; 
FRAGMENT OF 
TYROSTATIN; CHAIN: I; 


SERINE-CARBOXYL 
PROTEINASE; CHAIN: A; 
FRAGMENT OF 
TYROSTATIN; CHAIN: I; 


SERINE-CARBOXYL 
PROTEINASE; CHAIN: A; 
FRAGMENT OF 
TYROSTATIN; CHAIN: I; 


PHENOL 

HYDROXYLASE; CHAIN: 
A, B,C,D; 


HYDROXYLASE; CHAIN: 
A, B, C, D; 


Compound 


ZINC METALLOPROTEASE P. 
AERUGINOSA ALKALINE 


METAL-BINDING PROTEIN CRIP; 
METAL-BINDING PROTEIN, LIM 
DOMAIN PROTEIN 


LYASE TRYPTOPHAN SYNTHASE 
ALPHA-SUBUNIT, 
HYPERTHERMOPHILE, 2 
PYROCOCCUS FURIOSUS, X-RAY 
ANALYSIS, STABILITY, 
CALORIMETRY 


HYDROLASE PSCP, 
PSEUDOMONAPEPSIN, 
PEPSTATIN-INSENSITIVE SERINE- 
CARBOXYL PROTEINASE 


HYDROLASE PSCP, 
PSEUDOMONAPEPSIN, 
PEPSTATIN-INSENSITIVE SERINE- 
CARBOXYL PROTEINASE 


HYDROLASE PSCP, 
PSEUDOMONAPEPSIN, 
PEPSTATIN-INSENSITIVE SERINE- 
CARBOXYL PROTEINASE 


HYDROLASE PSCP, 
PSEUDOMONAPEPSIN, 
PEPSTATIN-INSENSITIVE SERINE- 
CARBOXYL PROTEINASE 


FLAVIN FLAVIN, PHENOL 
HYDROXYLASE, 
, MONOOXYGENASE, 
OXIDOREDUCTASE 


HYDROXYLASE, 

MONOOXYGENASE, 

OXIDOREDUCTASE 
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SEQFOL 
D score 


LOW-DENSITY 
LIPOPROTEIN 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR; CHAIN: A; 


SERUM ALBUMIN; 
CHAIN: A, B; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, 
H; SOLUBLE TISSUE 
FACTOR; CHAIN: T,U; D- 
PHE-PHE-ARG- 
CHLOROMETHYLKETO 
NE (DFFRCMK) WITH 
CHAIN: C; 


TROPOMYOSIN; CHAIN: 
A,B,C,D 


TROPOMYOSIN; CHAIN: 
A, B, C,D 


TROPOMYOSIN; CHAIN: 
A,B,C,D 


TOLL-LIKE RECEPTOR 2; 
CHAIN: A; 


TOLL-LIKE RECEPTOR 1; 
CHAIN: A; 


RECEPTOR; CHAIN: A; 


Compound 


LIPID BINDING PROTEIN LDL 
RECEPTOR; BETA HAIRPIN, 3-10 


LIPID BINDING PROTEIN LUL 
RECEPTOR; BETA HAIRPIN, 3-10 
HELIX, CALCIUM BINDING 


CARRIER PROTEIN CAKKlbK 
PROTEIN, ALBUMIN, GENERAL 
ANESTHETIC, PROPOFOL 


BLOOD COAGULATION, SEK1NL 
PROTEASE, COMPLEX, CO- 
FACTOR, 2 RECEPTOR ENZYME, 
INHIBITOR, GLA, EGF, 3 
COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


CONTRACTILE PRO TE1N 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


SIGNALING PROTEIN BETA- 
ALPHA-BETAFOLD 


SIGNALING PROTEIN BETA- 
ALPHA-BETA FOLD PARALLEL 
BETA SHEET 


HELIX, CALCIUM BINDING | 
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68.47 






SEQFOL 
D score 


| TRYPSIN II, ANIONIC; 


DUODENASE; CHAIN: A; 


HYDROLASE (SERINE 
PROTEASE) PORCINE E- 
TRYPSIN (E.C.3A21.4) 
1EPT3 


COMPLEMENT CIS 
COMPONENT; CHAIN: A; 


ELASTASE; 1ELT 4 
CHAIN: NULL; 1ELT5 


DELTA- 

CHYMOTRYPSIN; 
CHAIN: A, C; DELTA- 
CHYMOTRYPSIN; 
CHAIN: B, D; 


PLASMINOGEN; CHAIN: 
A,B,C,D; 


THROMBIN; CHAIN: A; 
DECAPEPTIDE 
INHIBITOR; CHAIN: I; 


HEPARIN BINDING 
PROTEIN; CHAIN: NULL; 


TRYPSIN; CHAIN: A, B, 
C, D; 




Compound 


| HYDROLASE/HYDROLASE 


HYDROLASE BOVINE 
DUODENASE, SERINE PROTEASE, 
DUAL SPECIFICITY, 2 CRYSTAL 
STRUCTURE 




HYDROLASE TRYPSIN-LIKE 
SERIN PROTEASE, CCP (OR SUSHI 
ORSCR)MODULE 


SERINE PROTEINASE 


HYDROLASE DELTA- i 
CHYMOTRYPSIN, PEPTIDIC 
INHIBIOR, CHLOROMETHYL 
KETONE 


BLOOD CLOTTING 
PLASMINOGEN, CATALYTIC 
DOMAIN 


HYDROLASE/HYDROLASE 
INHIBITOR HYDROLASE, 
THROMBIN, THROMBIN 
INHIBITOR 


SERINE PROTEASE HOMOLOG 
CAP37, AZUROCIDIN; SERINE 
PROTEASE HOMOLOG, 
ENDOTOXIN BINDING, HEPARIN 


SERINE PROTEASE SERINE 
PROTEINASE, TRYPSIN, 
HYDROLASE 


PHOSPHORYLASE 2 (PNPASE) | 
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SEQFOL 
D score 


FACTOR IXA; CHAIN: C, 
L,; D-PHE-PRO-ARG; 
CHAIN: I; 


GRANZYME B; CHAIN: 
A; ACETYL-ISOLEUCYL- 
GLUTAMYL-PROLYL- 
ASPARTYL- CHAIN: B; 


| HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H7.0) 
1GCT3 


PR3; CHAIN: A, B, C, D; 


TRYPSIN; CHAIN: A; 


NATURAL KILLER CELL 
PROTEASE 1; CHAIN: A, 
B; ECOTIN; CHAIN: C, E; 
ECOTIN; CHAIN: D, F; 


CHAIN: A; PANCREATIC 
TRYPSIN INHIBITOR; 
CHAIN: I; 


Compound 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
CHRISTMAS FACTOR; COMPLEX, 
INHIBITOR, HEMOPHILIA/EGF, 
BLOOD COAGULATION, 2 
PLASMA, SERINE PROTEASE, 
CALCIUM-BINDING, HYDROLASE, 
3 GLYCOPROTEIN 


HYDROLASE ACE-ILE-GLU-PRO- 
ASP-CHO; HYDROLASE 




HYDROLASE (SERINE PROTEASE) 
MYELOBLASTIN; HYDROLASE, 
SERINE PROTEASE, 
GLYCOPROTEIN, ZYMOGEN, 
SIGNAL 


HYDROLASE SERINE PROTEASE, 
HYDROLASE 


HYDROLASE/HYDROLASE 
INHIBITOR GRANZYME B; 
COMPLEX (SERINE 
PROTEASE/INHIBITOR), 
PROTEASE SUBSTRATE 2 
INTERACTIONS, BETA STRAND 
STRUCTURE, CHYMOTRYPSIN 
FOLD, 3 GRANZYME B. ECOTIN 


INHIBITOR BPTI SERINE 
PROTEASE, TRYPSIN PRECURSOR 
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SEQFOL 
D score- 


ALPHA-ACTININ 2; 
CHAIN: A, B; 


TROPOMYOSIN; CHAIN: 
A, B, C, D 


EXTRACELLULAR 
RIBONUCLEASE LE; 
CHAIN: A; 


EXTRACELLULAR 
RIBONUCLEASE LE; 
CHAIN: A; 


RIBONUCLEASE MCI; 
CHAIN: A; 


RIBONUCLEASE MCI; 
CHAIN: A; 


ASPARTATE 
CARBAMOYLTRANSFER 
ASE; CHAIN: A, B, C; 


ASPARTATE 
CARBAMOYLTRANSFER 
ASE; CHAIN: A, B, C; 


ASPARTATE 
CARBAMOYLTRANSFER 
ASE (ASPARTATE 
TRANSCARBAMYLASE) 
2ATC4(E.C.2.1.3.2)2ATC 


Compound 


TRIPLE-HELIX COILED COIL 
ALPHA ACTININ SKELETAL 
MUSCLE ISOFORM 2, TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN, 
MUSCLE, 2 Z-LINE, ACTIN- 


CONTRACTILE PROTEIN 
TROPOMYOSIN COILED-COIL 
ALPHA-HELICAL, CONTRACTILE 
PROTEIN 


HYDROLASE ALPHA PLUS BETA 


HYDROLASE ALPHA PLUS BETA 


HYDROLASE RNASE MCI; 
HYDROLASE (NUCLEIC ACID, 
RNA) 


HYDROLASE RNASE MCI; 
HYDROLASE (NUCLEIC ACID, 
RNA) 


TRANSFERASE (CARBAMOYL-P, 
ASPARTATE) TRANSFERASE 
(CARBAMOYL-P, ASPARTATE) 


TRANSFERASE (CARBAMOYL-P, 
ASPARTATE) TRANSFERASE 
(CARBAMOYL-P, ASPARTATE) 
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SEQFOL 
D score 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


METALLOTHIONEIN 
METALLOTHIONEIN 
ISOFORMII4MT2 3 


ANTISTASIN; CHAIN: 
NULL; 


LAMININ; 


LAMININ; 


LAMININ; 


LAMININ; 


INSULIN-] 
FACTOR I 
CHAIN: A: 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B, C; 


COMPLEMENT 
CONTROL PROTEIN; 
CHAIN: A, B,C; 


Compound 


; CHAIN: 


1 CHAIN: 


, CHAIN: 


CHAIN: 


LIKE GROWTH 
IECEPT0R 1; 


: NULL: 


: NULL: 


;NULL: 


NULL: 


















SERINE PROTEASE INHIBITOR 
FACTOR XA INHIBITOR; 
ANTISTASIN, CRYSTAL 
STRUCTURE, FACTOR XA 
INHIBITOR, 2 SERINE PROTEASE 
INHIBITOR, THROMBOSIS 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


GLYCOPROTEIN GLYCOPROTEIN 


HORMONE RECEPTOR HORMONE 
RECEPTOR, INSULIN RECEPTOR 
FAMILY 


IMMUNE SYSTEM BETA, MODULE 


IMMUNE SYSTEM BETA, MODULE 
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SEQFOL 
D score 


VONWILLEBRAND 
FACTOR; CHAIN: A, B; 


VON WILLEBRAND 
FACTOR; CHAIN: A, B; 


VONWILLEBRAND 
FACTOR; CHAIN: A, B; 


INTEGRIN ALPHA 2 
BETA; CHAIN: A, B; 


INTEGRIN ALPHA 2 
BETA; CHAIN: A, B; 


INTEGRIN ALPHA 2 
BETA; CHAIN: A, B; 


INTEGRIN ALPHA-1; 
CHAIN: A, B; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA 3 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


Compound 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 


COLLAGEN-BINDING COLLAGEN- 
BINDING, HEMOSTASIS, 
DINUCLEOTIDE BINDING FOLD 


INTEGRIN INTEGRIN, CELL 
ADHESION, GLYCOPROTEIN 


INTEGRIN INTEGRIN, CELL 
ADHESION, GLYCOPROTEIN 


INTEGRIN INTEGRIN, CELL 
ADHESION, GLYCOPROTEIN 


STRUCTURAL PROTEIN I- 
DOMAIN, METAL BINDING, 
COLLAGEN, ADHESION 
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SEQFOL 
D score 


FAB HGR-2 F6; CHAIN: L; 
FAB HGR-2 F6; CHAIN: 
H; 


IGM-KAPPACOLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPA COLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


IGM-KAPPACOLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPACOLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


IGM-KAPPACOLD 
AGGLUTININ (LIGHT 
CHAIN); CHAIN: A, C; 
IGM-KAPPACOLD 
AGGLUTININ (HEAVY 
CHAIN); CHAIN: B, D; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


V 


CHAIN: L. H: S 


Compound 


IMMUNE SYSTEM GLUCAGON 
RECEPTOR, MONOCLONAL 
ANTIBODY, FAB, RECEPTOR 2 
ANTAGONIST, TYPICAL 
IMMUNOGLOBULIN FOLD, LIGHT 
CHAIN, HEAVY 3 CHAIN, 
ANTIGEN BINDING SITE, 
COMPLEMENTARITY- 
DETERMINING 4 REGIONS 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


IMMUNE SYSTEM FAB, IGM, 
ANTIBODY, COLD AGGLUTININ, 
HUMAN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


(IMMUNOGLOBULIN); 
IMMUNOGLOBULIN C REGION, 
GLYCOPROTEIN, 
TRANSMEMBRANE 
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SEQFOL 
D score 


COMPLEX(ANTIBODY- 
ANTIGEN) FV 
FRAGMENT (IGGl, 
KAPPA) (LIGHT AND 
HEAVY VARIABLE 


LYSOZYME BINDING IG 
KAPPA CHAIN; CHAIN: 
L; IGGl FAB CHAIN H; 
CHAIN: H; LYSOZYME C; 
CHAIN: Y; 


IMMUNOGLOBULIN IGG 
FAB (IGG2B, KAPPA) 
FRAGMENT (40-50 FAB) 
COMPLEXED WITH IIBG 
3 OUABAIN 1IBG4 


IGG2A KAPPA 
ANTIBODY CB41 (LIGHT 
CHAIN); CHAIN: A; 
IGG2A KAPPA 
ANTIBODY CB41 
(HEAVY CHAIN); CHAIN: 
B; PEPTIDE 5: CHAIN: C: 


MUTANT AL2 6E7S9G; 
CHAIN: A; 


THIOREDOXINM; 
CHAIN: A, B; 


ANTIBODY FAB 
FRAGMENT (LIGHT 
CHAIN); CHAIN: L; 
ANTIBODY FAB 
FRAGMENT (HEAVY 
CHAIN); CHAIN: H 


FRAGMENT (HEAVY 
CHAIN); CHAIN: H 


Compound 




PROTEIN BINDING/HYDROLASE 
ANTIGEN-ANTIBODY COMPLEX, 
HYHEL-10, ANTI-HEN EGG WHITE 
2 LYSOZYME ANTIBODY 




COMPLEX (ANTIBODY/PEPTIDE) 
POLYSPECIFICITY, 
CROSSREACTIVITY, FAB- 
FRAGMENT, PEPTIDE, 2 HIV-1 


ANTIBODY ANTIBODY, 
FRAMEWORK 


ELECTRON TRANSPORT ] 

RT.F.CTROInJ TT? AMSPHPT 


IMMUNE SYSTEM MONOCLONAL 
ANTIBODY, ANTIGEN-BINDING 
FRAGMENT, INTERLEUKIN- 2 2, 
X-RAY ANALYSIS, CRYSTAL 
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IMMUNOGLOBULIN 
IMMUNOGLOBULIN 
FAB' NEW (LAMBDA 
LIGHT CHAIN) 7FAB 3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN 
FAB' NEW (LAMBDA 
LIGHT CHAIN) 7FAB3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN 
FAB' NEW (LAMBDA 
LIGHT CHAIN) 7FAB 3 


IGGl ANTIBODY 32C2; 
CHAIN: A; IGGl 
ANTIBODY 32C2; 
CHAIN: B; 


ELECTRON TRANSPORT 
THIOREDOXIN 2TRXA 2 
2TRXA3 


SM3 ANTIBODY; CHAIN: 
L,H; PEPTIDE EPITOPE; 
CHAIN: P; 


ANTIBODY; CHAIN: H, L; 
PROTEIN G-PRIME; 
CHAIN: A; MAJOR 
OUTER MEMBRANE 
PROTEIN PI. 16; CHAIN: 

P; 




Compound 








IMMUNE SYSTEM FAB, 
ANTIBODY, AROMATASE, P450 




COMPLEX (ANTIBODY/PEPTIDE 
EPITOPE) ANTIBODY, PEPTIDE 
ANTIGEN, ANTITUMOR 
ANTIBODY, 2 COMPLEX 
(ANTIBODY/PEPTIDE EPITOPE) 


IMMUNE SYSTEM FAB, PORA, 
NEISSERIA MENINGITIDIS, PORIN 


STRAIN B31 


N), OUTER SURFACE 2 PROTEIN A 
COMPLEXED WITH FAB184.1, 
BORRRLTA BURGDORFERI 3 
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D score 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN F; 
CHAIN: A; 


THIOREDOXIN F; 
CHAIN: A, B; 


THIOREDOXIN; CHAIN: 
NULL; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


CHLOROPLAST 
THIOREDOXIN MCH2; 
CHAIN: A; 


THIOREDOXIN; CHAIN: 
A; REF-1 PEPTIDE; 
CHAIN: B; 


CHAIN: B; | 


Compound 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


OXIDOREDUCTASE DIMER, 
THIOREDOXIN, X-RAY 
CRYSTALLOGRAPHY, 
OXIDOREDUCTASE 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
| THIOREDOXIN 


OXIDOREDUCTASE 
1 THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


OXIDOREDUCTASE 
THIOREDOXIN M, THIOREDOXIN 
CH2, CHLOROPLASTIC 
THIOREDOXIN 


COMPLEX (ELECTRON 
! TRANSPORT/PEPTIDE) COMPLEX, 
I ELECTRON TRANSPORT/PEPTIDE 


ELECTRON TRANSPORT/PEPTIDE I 
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SEQFOL 
D score 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


PROTEIN DISULFIDE 
ISOMERASE; CHAIN: 
NULL; 


THIOREDOXINM; 
CHAIN: A, B; 


THIOREDOXINM; 
CHAIN: A, B; 


THIOREDOXINM; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


THIOREDOXIN M; 
CHAIN: A, B; 


Compound 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REE 
ACTIVE CENTER, ISOMERAS! 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REE 
ACTIVE CENTER, ISOMERASI 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, REE 
ACTIVE CENTER, ISOMERASI 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, RED 
ACTIVE CENTER, ISOMERASI 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT, RED' 
ACTIVE CENTER, ISOMERASE 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT^ 
ELECTRON TRANSPORT, RED* 
ACTIVE CENTER, ISOMERASE 
ENDOPLASMIC RETICULUM 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 


ELECTRON TRANSPORT 
ELECTRON TRANSPORT 
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D score 


PROTEIN KINASE C 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB 3 


FC GAMMA RIIB; 
CHAIN: A; 


FC GAMMA RIIB; 
CHAIN: A; 


MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


MHC CLASS INK CELL 
RECEPTOR PRECURSOR; 
CHAIN: A; 


T-CELL SURFACE 
GLYCOPROTEIN CD4; 
CHAIN: A, B; 




Compound 


CALCIUM-BINDING PROTEIN 




IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 


IMMUNE SYSTEM CD32; 
RECEPTOR, FC, CD32, IMMUNE 
SYSTEM 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
i IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


IMMUNE SYSTEM P58 NATURAL 
KILLER CELL RECEPTOR; KIR, 
NATURAL KILLER RECEPTOR, 
INHIBITORY RECEPTOR, 2 
IMMUNOGLOBULIN 


GLYCOPROTEIN CD4; 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, T-CELL, 2 MHC 
LIPOPROTEIN. POLYMORPHISM 


LIPOPROTEIN. POLYMORPHISM 1 
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D score 


COMPLEMENT FACTOR 
HUMAN COMPLEMENT 
REGULATORY PROTEIN 
CD59 (EXTRACELLULAR 
1ERG 3 REGION, 


17-BETA- 

HYDROXYSTEROID- 
DEHYDROGENASE; 
CHAIN: NULL; 


17-BETA- 

HYDROXYSTEROID- 
DEHYDROGENASE; 
CHAIN: NULL; 


BETA-KETO ACYL 
CARRIER PROTEIN 
REDUCTASE; CHAIN: A; 


BETA-KETO ACYL 
CARRIER PROTEIN 
REDUCTASE; CHAIN: A; 


IMMUNOGLOBULIN FV 
FRAGMENT OF A 
HUMANIZED VERSION 
OF THE ANTI-CD18 IFGV 
3 ANTIBODY H52' 
(HUH52-AAFV)1FGV4 


IGMRF2A2; CHAIN: A, 
C, E; IGMRF2A2; 
CHAIN: B, D, F; 
IMMUNOGLOBULIN G 
BINDING PROTEIN A; 
CHAIN: G, H; 


CHAIN: D, H; 
CONALBUMIN PEPTIDE; 
CHAIN: P, Q; 


Compound 




DEHYDROGENASE 
DEHYDROGENASE, 17-BETA- 
HYDROXYSTEROID 


DEHYDROGENASE 
DEHYDROGENASE, 17-BETA- 
HYDROXYSTEROID 


OXIDOREDUCTASE NUCLEOTIDE 
FOLD, ROSSMANN FOLD 


OXIDOREDUCTASE NUCLEOTIDE 
FOLD, ROSSMANN FOLD 




IMMUNE SYSTEM FAB-IBP 
COMPLEX CRYSTAL STRUCTURE 
2.7A RESOLUTION BINDING 2 
OUTSIDE THE ANTIGEN 
COMBINING SITE SUPERANTIGEN 
FAB VH3 3 SPECIFICITY 
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Verify 1 
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PMF I 


| 82.54 














SEQFOL 
D score 


1 PHOSPHOTRANSFERASE | TRANSFERASE PROTO- | 


TYROSINE-PROTEIN 
KINASE SRC; CHAIN: 
NULL; 


TYROSINE-PROTEIN 
KINASE SRC; CHAIN: 
NULL; 


VASCULAR ! 
ENDOTHELIAL 
GROWTH FACTOR 
RECEPTOR CHAIN: A; 


HAEMATOPOETIC CELL 
KINASE (HCK); CHAIN: 
A; 


PHOSPHORYLASE 
KINASE; CHAIN: NULL; 


SUBSTRATE; CHAIN: B; 


Compound 


PHOSPHOTRANSFERASE C-SRC, 
P60-SRC; SRC, TYROSINE KINASE, 
PHOSPHORYLATION, SH2, SH3, 2 
PHOSPHOTYROSINE, PROTO- 
ONCOGENE, 

PHOSPHOTRANSFERASE 


PHOSPHOTRANSFERASE C-SRC, 
P60-SRC; SRC, TYROSINE KINASE, 
PHOSPHORYLATION, SH2, SH3, 2 
PHOSPHOTYROSINE, PROTO- 
ONCOGENE, 

PHOSPHOTRANSFERASE 


TRANSFERASE KDR; TYROSINE 
KINASE 


TYROSINE KINASE TYROSINE 
KINASE-INHIBITOR COMPLEX, 
DOWN-REGULATED KINASE, 2 
ORDERED ACTIVATION LOOP ! 


KINASE RABBIT MUSCLE 
PHOSPHORYLASE KINASE; 
GLYCOGEN METABOLISM, 
TRANSFERASE, 

SERINE/THREONINE-PROTEIN, 2 
KINASE, ATP-BINDING, 
CALMODULIN-BINDING 


TYROSINE KINASE, SIGNAL 
TRANSDUCTION, 
PHOSPHOTRANSFERASE, 2 
COMPLEX (KINASE/PEPTIDE 
SUBSTRATE/ATP ANALOG), 
ENZYME, 3 COMPLEX 
(TRANSFERASE/SUBSTRATE) 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



992 




WO 03/054152 



PCT/US02/39555 



993 



oo 
co 




to 
o 
-J 



o 
to 

oo 




o\ 
oo 
oo 



to 







£1 



00 
oo 





00. 
oo 



o 





r< L - 



r 1 

o 
to 



WO 03/054152 



PCT/US02/39555 



994 



1693 


1693 


1693 


1693 


1693 


1693 


1693 


1688 




2 H ES 


la9n 


la9n 


la9n 


vo 
P 


la9n 


VO 

P 


la4y 


ltud 




& 


-a 

o 

3d 


O 


O 


> 


> 


> 


> 


> 






NID 




LA 
i— » 


1 


OS 
0\ 

vo 


OS 
vo 


L/i 


to 


O 
OS 


LO 
-J 




STAR 
TAA 


s 


os 


-J 

On 
O 


1 




L/i 

OO 

^1 


5? 

t— » 


OO 

oo 






to 

'f 

I— 1 

VO 


3.5e-17 


3e-09 


to 

CD 
i 

OS 


to 

CD 
i 

to 
to 


1.2e-14 


Lh 

VO 
CD 
LO 
O 


to 

z 




Psi Blast 


| 0.10 


0.32 


0.17 


0.32 


0.07 


0.15 


0.19 


0.37 




score 


Verify i 


o 
to 
o 


-0.20 


-0.20 


<b 
to 
o 


<b 
to 
o 


-0.20 


-0.20 


-0.20 




score 


PMF 1 




















SEQFOL 
D score 


| U2 RNA HAIRPIN IV; 


U2RN7 
CHAIN 
CHAIN 
CHAIN 


mm/ 

CHAIN 
CHAIN 
CHAIN 


U2RN* 
CHAIN; 
CHAIN 
CHAIN 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RN4 
CHAIN: 
CHAIN: 
CHAIN: 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


ALPHA-SPECTRIN; 
CHAIN: NULL; 


(SH3 DOMAIN) ISHF 4 | 


Compound 


>l HAIRPIN IV; 
:Q,R; U2 A'; 
: A, C; U2 B"; 
:B, D; 


l HAIRPIN IV; 
:Q,R;U2A' ; 
: A, C; U2 B"; 


i HAIRPIN IV; 
:Q,R;U2A'; 
: A, C; U2 B"; 
;B, D; 


l HAIRPIN IV; 

:Q, R; U2 A'; 
: A, C; U2 B"; 
:B, D; 


.HAIRPIN IV; 
Q, R; U2 A'; 

A, C; U2 B"; 

B, D; 


COMPLEX (NUCLEAR j 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX 

(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 


CYTOSKELETON CAPPING 
PROTEIN, CALCIUM-BINDING, 
DUPLICATION, REPEAT, 2 SH3 
DOMAIN, CYTOSKELETON 
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SEQFOL 
D score 


GRB2; CHAIN: A; SOS-1; 
CHAIN: B; 


PROTO-ONCOGENE 
TYROSINE-PROTEIN 
KINASE FYN; CHAIN: A, 
B; 


PHOSPHOTRANSFERASE 
FYN; CHAIN: A; 3BP-2; 
CHAIN: B; 


FYN TYROSINE KINASE; 
CHAIN: A, C; HIV-INEF 
PROTEIN; CHAIN: B, D; 


DOMAIN) (C-CRKSH3-N) 
COMPLEXEDWITH ! 
ICKA 3 C3G PEPTIDE 
(PRO-PRO-PRO-ALA- 
LEU-PRO-PRO-LYS-LYS- 
ARG) ICKA 4 


Compound 


COMPLEX (SIGNAL 
TRANSDUCTION/PEPTIDE) 
COMPLEX (SIGNAL 


TRANSFERASE P59-FYN; BETA 
BARREL, ANTIPARALLEL BETA 
SHEET, ALPHA HELIX, 3-10 2 
HELIX 


TRANSFERASE PROTO- 
ONCOGENE TYROSINE KINASE; 
PROTO-ONCOGENE, 
TRANSFERASE, TYROSINE- 
PROTEIN KINASE, 2 
PHOSPHORYLATION, ATP- 
BINDING, MYRISTYLATION, SH3 
DOMAIN, 3 COMPLEX 
(PHOSPHOTRANSFERASE/PEPTID 
E) 


COMPLEX (SH3 DOMAINMRAL 
ENHANCER) SRC-HOMOLOGY 3 
DOMAIN; COMPLEX (SH3 
DOMAIN/VIRAL ENHANCER), 
PROTO-ONCOGENE, 2 
TRANSFERASE, TYROSINE- 
PROTEIN KINASE, 
PHOSPHORYLATION, 3 AIDS, 
MYRISTYLATION, GTP-BINDING, 
ATP-BINDING, SH3 DOMAIN, 4 
SH2 DOMAIN, PPII HELIX, PXXP 
MOTIF 
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SEQFOL 
D score 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B, C; 


RIBONUCLEASE 
INHIBITOR; CHAIN: 
NULL; 




RIBONUCLEASE 
INHIBITOR; CHAIN: 
NULL; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


NF-KAPPA-B P65; 
CHAIN: A, C; NF-KAPPA- 
BP50; CHAIN: B, D; I- 
KAPPA-B-ALPHA; 
CHAIN: E, F; 


NF-KAPPA-B P65; 
CHAIN: A, C; NF-KAPPA- 
B P50; CHAIN: B, D; I- 
KAPPA-B-ALPHA; 
CHAIN: E, F; 


Compound 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


ACETYLATIONRNASE 
INHIBITOR, 

RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


ACETYLATIONRNASE 
INHIBITOR, 

RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 


COMPLEX (TRANSCRIPTION 
REG/ANK REPEAT) COMPLEX 
(TRANSCRIPTION 
REGULATION/ANK REPEAT), 
ANKYRIN 2 REPEAT HELIX 
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score 
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53.15 
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74.99 




SEQFOL 
D score 


14.3.DT CELL ANTIGEN 
RECEPTOR; IBEC 5 


ANTIBODY (LIGHT 
CHAIN); CHAIN: L; 
ANTIBODY (HEAVY 
CHAIN); CHAIN: H; 


ZINC FINGER PROTEIN 
GLIl; CHAIN: A; DNA; 
CHAIN: C, D; 


ADRl; CHAIN: NULL; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT 
DNA; CHAIN: A, B; 




Compound 


RECEPTOR T CELL RECEPTOR 
IBEC 14 


IMMUNE SYSTEM 
IMMUNOGLOBULIN; 
IMMUNOGLOBULIN ANTIBODY 
ENGINEERING, HUMANIZED AND 
CHIMERIC ANTIBODY, FAB, 2 X- 
RAY STRUCTURE, THREE- 
DIMENSIONAL STRYCTURE, 
GAMMA- 3 INTERFERON, 
IMMUNE SYSTEM 


COMPLEX (DNA-BINDING 
PROTEIN/DNA) FIVE-FINGER GLI; 
GLI, ZINC FINGER, COMPLEX 
(DNA-BINDING PROTEIN/DNA) 


TRANSCRIPTION REGULATION 
TRANSCRIPTION REGULATION, 
ADRl, ZINC FINGER, NMR 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 
1; TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (ZINC FINGER/DNA) | 
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score 




56.22 










SEQFOL 
D score 


FATTY ACID SYNTHESIS 
PROTEIN ACYL 
CARRIER PROTEIN 


FATTY ACID SYNTHESIS 
PROTEIN ACYL 
CARRIER PROTEIN 
(NMR, 2 STRUCTURES) 
1ACP3 


ALPHA-AMYLASE; 
CHAIN: A; 


HISTOCOMPATIBILITY 
ANTIGEN MURINE 
CLASS I MAJOR 
HISTOCOMPATIBILITY 
COMPLEX CONSISTING 
1H0C3 0FH-2I>=B=, 
B2-MICROGLOBULIN, 
AND A 9-RESIDUE 
PEPTIDE 1HOC4 


HLA CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN, DR CHAIN: A; 
HLA CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN, DR-1 CHAIN: 
B; HEMAGGLUTININ 
HAl PEPTIDE CHAIN; 
CHAIN: C; T-CELL 
RECEPTOR ALPHA 
CHAIN; CHAIN: D; T- 
CELL RECEPTOR BETA 
CHAIN; CHAIN: E; 


CHAIN: NULL; 1BEC 6 | 


Compound 






HYDROLASE "MALTOGENIC" 
ALPHA AMYLASE; AMYLASE, 
GLYCOSIDE HYDROLASE, 
STARCH DEGRADATION 




IMMUNE SYSTEM HLA-DRl, DRA; 
HLA-DRl, DRBl 0101; TCRHA1.7 
ALPHA CHAIN; TCR HA1.7 BETA 
CHAIN; PROTEIN-PROTEIN 
COMPLEX, IMMUNOGLOBULIN 
FOLD 
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SEQFOL 
D score 


HIV-l REVERSE 
TRANSCRIPTASE 
(CHAIN A); CHAIN: A; 
HIV-l REVERSE 
TRANSCRIPTASE 
(CHAIN B); CHAIN: B; 
ANTIBODY (LIGHT 


HIV-l REVERSE 
TRANSCRIPTASE (A- 
CHAIN); CHAIN: A; HIV-l 
REVERSE 

TRANSCRIPTASE (B- 
CHAIN); CHAIN: B; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN 2) 9WGA3 


METALLOTHIONEIN CD- 
7 METALLOTHIONEIN-2 
(ALPHA DOMAIN) 
(/NMR$) 1MHUA 2 


CATHEPSIN L: HEAVY 
CHAIN; CHAIN: A, C; 
CATHEPSIN L: LIGHT 
CHAIN; CHAIN: B, D; 
INVARIANT CHAIN; 
CHAIN: I, J; 


INSULIN-LIKE GROWTH 
FACTOR-BINDING 
PROTEIN-5 CHAIN: A; 


(NMR, 2 STRUCTURES) 
1ACP3 


Compound 


TRANSFERASE/IMMUNE 
SYSTEM/DNA HIV-l RT; HIV-l RT; 
HIV, REVERSE TRANSCRIPTASE, 
MET184ILE, 3TC, PROTEIN-DNA 2 
COMPLEX, DRUG RESISTANCE, 
Ml 841, TRANSFERASE/IMMUNE 3 
SYSTEM/DNA 


TRANSFERASE HIV-l REVERSE 
TRANSCRIPTASE, AIDS, NON- 
NUCLEOSIDE INHIBITOR, 2 DRUG 
DESIGN 






HYDROLASE II FRAGMENT, CD74 
FRAGMENT CYSTEINE 
PROTEINASE, CATHEPSIN, MHC 
CLASS II, INVARIANT 2 CHAIN, 
THYROGLOBULIN TYPE-1 
DOMAIN 


HORMONE/GROWTH FACTOR 
MINI-IGFBP-5 MINI-IGFBP-5, 
IGFBP-5, IGF, INSULIN-LIKE 
GROWTH FACTOR 2 BINDING 
PROTEIN, NMR, 
HORMONE/GROWTH FACTOR 
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EPITHELIAL CADHERIN; 


EPITHELIAL CADHERIN; 
CHAIN: NULL; 


EPITHELIAL CADHERIN; 
CHAIN: NULL; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; ! 


N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; INCG 3 


N-CADHERIN; INCG 3 


N-CADHERIN; INCG 3 


OUTER SURFACE 
PROTEIN C; CHAIN: A; 


OUTER SURFACE | 
PROTEIN C; CHAIN: A; 


TS; CHAIN: B, D; 


Compound 


| CELL ADHESION UVOMORULIN; 


CELL ADHESION UVOMORULIN; 
CADHERIN, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION UVOMORULIN; 
CADHERIN, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


IMMUNE SYSTEM SURFACE 
PROTEIN, ALPHA HELIX PROTEIN 


UNSTABLE, ELONGATION 
FACTOR FOR TRANSFER, HEAT 
STABLE, ELONGATION FACTOR, 
COMPLEX (TWO ELONGATION 
FACTORS) 
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54.74 
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SEQFOL 
D score 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN FC 


H-2 CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN; CHAIN: A, D; 
MHC CLASS II NOD; 
CHAIN: B, E; LYSOZYME 
C; CHAIN: P, Q; 


H-2 CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN; CHAIN: A; 65 
KD GLUTAMIC ACID 
DECARBOXYLASE+H-2 
CLASS II CHAIN: B; 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC RECEPTOR 
CHAIN: C; FC 
FRAGMENT OF HUMAN 
IGGl; CHAIN: A, B; 


IMMUNOGLOBULIN 
LAMBDA HEAVY 
CHAIN; CHAIN: A, B; 
ENGINEERED PEPTIDE; 
CHAIN: E, F; 


CH3 DOMAIN OF MAK33 
ANTIBODY; CHAIN: A, 
B; 


CH3 DOMAIN OF MAK33 
ANTIBODY; CHAIN: A, 
B; 




Compound 




IMMUNE SYSTEM 
HISTOCOMPATIBILITY ANTIGEN, 
MHC, PEPTIDE COMPLEX 


IMMUNE SYSTEM 
HISTOCOMPATIBILITY ANTIGEN, 
CLASS IIMHCI-A(G7) 


COMPLEX CD16; IGGl-FC 
COMPLEX, FC FRAGMENT, IGG, 
FC, RECEPTOR, CD16, GAMMA 


IMMUNE SYSTEM FC IGG PHAGE 
DISPLAY PEPTIDE 


IMMUNE SYSTEM CONSTANT 
DOMAIN, Cl-SUBSET, 
IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


IMMUNE SYSTEM CONSTANT 
DOMAIN, Cl-SUBSET, 
IMMUNOGLOBULIN, IMMUNE 
SYSTEM 


DOMAIN, B-TREFOIL 
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SEQFOL 
D score 


| HEMOCHROMATOSIS 


MHC CLASS II I-AD; 
CHAIN: A, B; 


IMMUNOGLOBULIN 
$P/F$C(PRIME) 
FRAGMENT OF AN 
IG*G1 1PFC4 


IGGAMMA-2A CHAIN C 
REGION; CHAIN: A, B; 


CLASS II 

HISTOCOMPATIBILITY 
ANTIGEN, M ALPHA 
CHAIN: A; CLASS II 
HISTOCOMPATIBILITY 
ANTIGEN, M BETA 
CHAIN: B; 


MAJOR 

HISTOCOMPATIBILITY 
COMPLEX ALPHA 
CHAIN; CHAIN: A, D; 
MAJOR 

HISTOCOMPATIBILITY 
COMPLEX BETA CHAIN; 
CHAIN: B, E; MYELIN 
BASIC PROTEIN; CHAIN: 
QF; 


MHC CLASS III-EK, 
ALPHA CHAIN; CHAIN: 
A, C; MHC CLASS III-EK, 
BETA CHAIN; CHAIN: B, 
D; 


AND FRAGMENT B OF 
PROTEIN A COMPLEX 
1FC2 4 


Compound 


| METAL TRANSPORT 


MHC II MHC II, CLASS II MHC I- 
AD 




IMMUNE SYSTEM IGG2A; IGG, FC 


IMMUNE SYSTEM RING6, HLA- 
DMA; RING7, HLA-DMB; 
HISTOCOMPATIBILITY PROTEIN, 
IMMUNE SYSTEM 


IMMUNE SYSTEM MHC CLASS II 
DR2A 


IMMUNE SYSTEM 
HISTOCOMPATIBILITY ANTIGEN, 
MHC, PEPTIDE, IMMUNE SYSTEM 
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IMMUNOGLOBULIN FAB 
(IGG2A, KAPPA) 
FRAGMENT (26-10) 


IDIOTYPIC FAB 730.1.4 
(IGGl) OF VIRUS IIAI 5 
CHAIN: L 5 H; IIAI 7 
ANTI-IDIOTYPIC FAB 
409.5.3 (IGG2A); IIAI 9 
CHAIN: M, I IIAI 10 


IDIOTYPIC FAB 730.1.4 
(IGGl) OF VIRUS IIAI 5 
CHAIN: L, H; IIAI 7 
ANTI-IDIOTYPIC FAB 
409.5.3 (IGG2A); IIAI 9 
CHAIN: M } I IIAI 10 


ANTIBODY M41; CHAIN: 
L,H,M,I; 


IMMUNOGLOBULIN FAB 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 


IMMUNOGLOBULIN FAB 
FRAGMENT OF 
HUMANIZED ANTIBODY 
4D5, VERSION 4 IFVD 3 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN Gl 
(KAPPA LIGHT CHAIN) 
FAB' FRAGMENT IFIG 3 


Compound 




COMPLEX (IMMUNOGLOBULIN 
IGG1/IGG2A) 


COMPLEX (IMMUNOGLOBULIN 
IGG1/IGG2A) 


IMMUNOGLOBULIN PROTEIN 
ENGINEERING, ANTIBODY 
DESIGN, IMMUNOGLOBULIN 2 
STRUCTURE, ANTIGEN-BINDING 
SITE, CANONICAL 
CONFORMATION, 3 
COMPLEMENTARITY- 
DETERMINING REGION 
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PMF 
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88.75 




SEQFOL 
D score 


NITROGEN FIXATION 
REGULATORY PROTEIN 
FIXL; CHAIN: A; 


NITROGEN FIXATION 
REGULATORY PROTEIN 
FIXL; CHAIN: A; 


NITROGEN FIXATION 
REGULATORY PROTEIN 
FIXL; CHAIN: A; 


NITROGEN FIXATION 
REGULATORY PROTEIN 
FIXL; CHAIN: A; 


HUMAN ERG 
POTASSIUM CHANNEL; 
CHAIN: A; 


INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1; CHAIN: A; 


INTERFERON-INDUCED 
GUANYLATE-BINDING 
PROTEIN 1; CHAIN: A; 




Compound 


SIGNALING PROTEIN OXYGEN 
SENSOR, HISTIDINE KINASE, PAS, 
HIGH-RESOLUTION, TWO- 2 
COMPONENT SYSTEM 


SIGNALING PROTEIN OXYGEN 
SENSOR, HISTIDINE KINASE, PAS, 
HIGH-RESOLUTION, TWO- 2 
COMPONENT SYSTEM 


SIGNALING PROTEIN OXYGEN 
SENSOR, HISTIDINE KINASE, PAS, 
HIGH-RESOLUTION, TWO- 2 
COMPONENT SYSTEM 


SIGNALING PROTEIN OXYGEN 
SENSOR, HISTIDINE KINASE, PAS, 
HIGH-RESOLUTION, TWO- 2 
COMPONENT SYSTEM 


MEMBRANE PROTEIN PAS 
DOMAIN, POTASSIUM CHANNEL 
DOMAIN, MEMBRANE PROTEIN 


SIGNALING PROTEIN GBP, GTP 
HYDROLYSIS, GDP, GMP, 
INTERFERON INDUCED, 
DYNAMIN 2 RELATED, LARGE 
GTPASE FAMILY. GMPPNP, 
GPPNHP. 


SIGNALING PROTEIN GBP, GTP 
HYDROLYSIS, GDP, GMP, 
INTERFERON INDUCED, 
DYNAMIN 2 RELATED, LARGE 
GTPASE FAMILY. GMPPNP, 
GPPNHP. 


DYNAMIN 2 RELATED, LARGE 
GTPASE FAMILY, SIGNALING 
PROTEIN 
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PMF 
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55.65 












SEQFOL 
D score 


THROMBIN LIGHT 
CHAIN; CHAIN: A, B, C, 
D; THROMBIN HEAVY 


IMMUNOGLOBULIN 
LAMBDA HEAVY 
CHAIN; CHAIN: A, B; 
ENGINEERED PEPTIDE; 
CHAIN: E, F; 


POLIOVIRUS RECEPTOR; 
CHAIN: R; VPl; CHAIN: 
1;VP2; CHAIN: 2;VP3; 
CHAIN: 3; VP4; CHAIN: 4; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: C, D; 


FIBROBLAST GROWTH 
FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: C, D; 


FACTOR 2; CHAIN: A, B; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 1; 
CHAIN: C, D; 


Compound 


SERINE PROTEINASE 
COAGULATION FACTOR II; 
COAGULATION FACTOR II; 


IMMUNE SYSTEM FC IGG PHAGE 
DISPLAY PEPTIDE 


VIRUSMRAL PROTEIN, 
RECEPTOR CD155, PVR, HUMAN 
POLIOVIRUS, ELECTRON 
MICROSCOPY, 2 POLIOVIRUS- 
RECEPTOR COMPLEX, 
VIRUSMRAL PROTEIN, 
RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 


FACTOR RECEPTOR FGF, FGFR, 
IMMUNOGLOBULIN-LIKE, 
SIGNAL TRANSDUCTION, 2 
DIMERIZATION, GROWTH 
FACTOR/GROWTH FACTOR 
RECEPTOR 
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SEQFOL 
D score 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


IMMUNOGLOBULIN 
IMMUNOGLOBULIN FC 
AND FRAGMENT B OF 
PROTEIN A COMPLEX 
1FC2 4 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


IMMUNOGLOBULIN 
EPSILON RECEPTOR 
CHAIN: A; IG EPSILON 
CHAIN C REGION; 
CHAIN: B, D; 


Compound 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, ! 
CD32 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 




CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN 
FOLD, GLYCOPROTEIN, 
RECEPTOR, IGE-BINDING 2 
PROTEIN, IGE ANTIBODY, IGE-FC 
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-0.11 


t 

o 
o 
oo 
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PMF 
score 












51.87 






SEQFOL 
D score 


NEURAL CELL 
ADHESION MOLECULE; 


INTERLEUKIN-6 
RECEPTOR BETA 
CHAIN; CHAIN: A; 
VIRAL IL-6; CHAIN: B; 


IGGAMMA-2A CHAIN C 
REGION; CHAIN: A, B; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H,K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


IMMUNOGLOBULIN 
HEAVY CHAIN; CHAIN: 
H,K; 

IMMUNOGLOBULIN 
LIGHT CHAIN; CHAIN: L, 
M; 


LDL RECEPTOR; CHAIN: 
A; 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC REGION 
CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; 
CHAIN: A; 


Compound 


CELL ADHESION N-CAM; 
INTERMEDIATE 


CYTOKINE GP130; FUNCTIONAL 
INTERLEUKIN-6 HOMOLOG; 
CYTOKINE/RECEPTOR COMPLEX, 
GP130, VIRAL IL-6, CRYSTAL 2 
STRUCTURE 


IMMUNE SYSTEM IGG2A; IGG, FC j 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


IMMUNE SYSTEM IGG; 
IMMUNOGLOBULIN, ANTIBODY, 
B12 


CELL-SURFACE RECEPTOR CELL- 
SURFACE RECEPTOR, CALCIUM- 
BINDING, EGF-LIKE DOMAIN, 2 
MODULE, APO-E, APO-B, LDL, 
VLDL 


IMMUNE SYSTEM RECEPTOR 
BETA SANDWICH, 
IMMUNOGLOBULIN-LIKE, 
RECEPTOR 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
IMMUNOGLOULIN, LEUKOCYTE, 
CD32 
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0.06 
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PMF 
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143.57 


109.20 










SEQFOL 
D score 


PHOSPHATIDYLINOSITO 
L PHOSPHATE 
PHOSPHATASE; CHAIN: 


PHOSPHATIDYLINOSITO 
L PHOSPHATE 
PHOSPHATASE; CHAIN: 
A; 


PHOSPHATIDYLINOSITO 
L PHOSPHATE 
PHOSPHATASE; CHAIN: 
A; 


PHOSPHATIDYLINOSITO 
L PHOSPHATE 
PHOSPHATASE; CHAIN: 
A; 


DNA-(APURINIC OR 
APYRIMIDINICSITE) 
LYASE; CHAIN: A; 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB3 


IMMUNOGLOBULIN FAB 
FRAGMENT FROM 
HUMAN 

IMMUNOGLOBULIN 
IGGl (LAMBDA, HIL) 
8FAB3 




Compound 


HYDROLASE SYNAPTOJANIN; 
SPSYNAPTOJANIN, IPP5C, IP3, IP2, 


HYDROLASE SYNAPTOJANIN; 
SPSYNAPTOJANIN, IPP5C, IP3, IP2, 


HYDROLASE SYNAPTOJANIN; 
SPSYNAPTOJANIN, IPP5C, IP3, IP2, 


HYDROLASE SYNAPTOJANIN; 
SPSYNAPTOJANIN, IPP5C, IP3, IP2, 


DNA REPAIR AP ENDONUCLEASE 
1, HAPl, REFl, APEl; DNA REPAIR, 
ENDONUCLEASE, APEl, HAPl, 
REF-1 






IMMUNOGLOBULIN FOLD, 
HOMOPHILIC 3 BINDING, CELL 
ADHESION PROTEIN 
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SEQFOL 
D score 


| FIBROBLAST GROWTH 


AXONIN-1; CHAIN: A; 


U2RM 
CHAIN 
CHAIN 
CHAIN 


U2RW 
CHAIN 
CHAIN 
CHAIN 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


U2RNJ* 
CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: 
CHAIN: 
CHAIN: 


Compound 


i HAIRPIN IV; 
:Q,R;U2A'; 
: A, C; U2 B"; 


^ HAIRPIN IV; 
:Q,R;U2 A'; 
: A, C; U2 B"; 


^ HAIRPIN IV; 
:Q, R; U2 A'; 
:A,C;U2B"; 
:B, D; 


l HAIRPIN IV; 
: Q, R; U2 A'; 
: A, C; U2 B"; 
:B, D; 


, HAIRPIN IV; 
: Q, R; U2 A'; 
: A, C; U2 B"; 
:B,D; 


l HAIRPIN IV; 

Q,R;U2A'; 
: A, C; U2 B"; 

B,D; 


to >p ffi 
„0 o ?a C 

to to a 
w > ^ 


w >p 
a p # 

to to 


GROWTH FACTOR/GROWTH 


CELL ADHESION NEURAL CELL 
ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


! PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 
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SEQFOL 
D score 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


NUCLEAR RNA EXPORT 
FACTOR 1; CHAIN: A, B; 


LOW AFFINITY 
IMMUNOGLOBULIN 
GAMMA FC REGION 
CHAIN: A; 


JUNCTION ADHESION 
MOLECULE; CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN 
EPSILON RECEPTOR 
CHAIN: A; IG EPSILON 
CHAIN C REGION; 
CHAIN: B, D; 


Compound 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


RNA BINDING PROTEIN TAP 
(NFXl); RIBONUCLEOPROTEIN 
(RNP,RBD OR RRM) AND 
LEUCINE-RICH-REPEAT 2 (LRR) 


IMMUNE SYSTEM RECEPTOR 
BETA SANDWICH, 
IMMUNOGLOBULIN-LIKE, 
RECEPTOR 


CELL ADHESION 
IMMUNOGLOBULIN 
SUPERFAMILY, BETA-SANDWICH 
FOLD 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC(EPSILON) 
IGE-FC; IMMUNOGLOBULIN 
FOLD, GLYCOPROTEIN, 
RECEPTOR, IGE-BINDING 2 
PROTEIN, IGE ANTIBODY, IGE-FC 


PDB annotation 



WO 03/054152 



PCT/US02/39555 



1053 



ON 



2 B X 



03 



as n 
B 5 



6 

o 



o 



On 



Ol 

I — » 
OO 



OO 
CD 



p 
o 



o 
o 



o s 



o 

3 



I 

CD 
Oh 



© CO 
«> Q 




M S ^ 



CO 








o « 

O to 

s 

I 



Q > E 
•S K B 



89 

OO 

^ ^ 

w Q 

9* 



> 

CO 
OO 

O 

n 

a q 

oi 





a c r 
o to 2 

CO 

9 



oo « 

GO P 



2SS 



I 



do ffi 



> 

00 

O 

O 
i— < 

go 



53 Q 



WO 03/054152 



PCT/US02/39555 



1054 



1763 


1763 


1763 


1763 


1763 




S3 K G« 


ltnm 




lnct 


1 


Ei 




w 








to 


O 




CHAI 
NID 


O 






O 






STAR 
TAA 


Lh 
O 




o 

1 — ' 




OO 






LO 
? 




? 


Lie-: 


? 


4^ 

bo 
? 




Psi Bl 






OO 




t— ' 


0! 




en 


o 






0.58 


p 


o 




8 | 










o 




* if 


o 
o 




0.82 


O 


0.27 




PMF 
score 
















SEQFOL 
D score 
















MUSCLE PROTEIN TITIN 
MODULE M5 
(CONNECTIN) ITNM 3 
(NMR, MINIMIZED 
AVERAGE STRUCTURE) 


MUSCLE PROTEIN TITIN 
MODULE M5 
(CONNECTIN) ITNM 3 
(NMR, MINIMIZED 
AVERAGE STRUCTURE) 
ITNM 4 ITNM 58 


TITIN; CHAIN: NULL; 


INTERLEUKIN-1 BETA; 
CHAIN: A; TYPE 1 
INTERLEUKIN-1 
RECEPTOR; CHAIN: B; 


HEPARIN-BINDING 
GROWTH FACTOR 2; 
CHAIN: A, B, C, D; 
FIBROBLAST GROWTH 
FACTOR RECEPTOR 2; 
CHAIN: E, F, G, H; 




Compound 






MUSCLE PROTEIN CONNECTIN, 
NEXTM5; CELL ADHESION, 
GLYCOPROTEIN, 
TRANSMEMBRANE, REPEAT, 
BRAIN, 2 IMMUNOGLOBULIN 
FOLD, ALTERNATIVE SPLICING, 
SIGNAL, 3 MUSCLE PROTEIN 


COMPLEX 

(IMMUNOGLOBULIN/RECEPTOR) 
IMMUNOGLOBULIN FOLD, 
TRANSMEMBRANE, 
GLYCOPROTEIN, RECEPTOR, 2 
SIGNAL, COMPLEX 
(IMMUNOGLOBULIN/RECEPTOR) 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF2, HBGF- 
2, BASIC FIBROBLAST GROWTH 
FACTOR, FGFR2, KERATINOCYTE 
GROWTH FACTOR RECEPTOR; 
IMMUNOGLOBULIN LIKE 
DOMAIN, B-TREFOIL 


LIGAND, 2 GLYCOPROTEIN, LFA-1 
LIGAND, IMMUNOGLOBULIN 
FOLD, 3 TRANSMEMBRANE 
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SEQFOL 
D score 


RIBONUCLEASE 
INHIBITOR; CHAIN: 


RIBONUCLEASE 
INHIBITOR; CHAIN: 
NULL; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


1TNM 4 1TNM 58 [ 


Compound 


ACETYLATION RNASE 
INHIBITOR, 


ACETYLATION RNASE 
INHIBITOR, 

RIBONUCLEASE/ANGIOGENIN 
INHIBITOR ACETYLATION, 
LEUCINE-RICH REPEATS 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 




PDB annotation 



WO 03/054152 



PCT/US02/39555 



1056 




WO 03/054152 



PCT/US02/39555 



1057 




WO 03/054152 



PCT/US02/39555 



1058 



1768 


1768 


1768 


1768 


1764 


1764 


1764 


5! H & 


21iv 


2ibp 


lewk 


lewk 


9wga 


lqua 


i— * 


H * 

8 S 






> 


> 


> 


> 


> 


CHAI 
NID 


oo 


OO 




O 


O 
O 


I—* 
o\ 


Cm 

to 


STAR 
TAA 




s 


to 


bo 
vo 


OO 


-o 


u> 

Oh 




0.00084 


9.6e-08 


9.6e-20 


7e-42 


8.4e-15 


9.6e-32 


5.6e-28 


Psi Blast 


0.08 


-0.06 


0.39 


t 

o 
'to 
o 


0.00 


0.77 


0.87 


Verify 
score 


0.09 


0.06 


o 
o 


0.10 


p 

vo 


»— ' 

o 
o 


1.00 


PMF 
score 
















SEQFOL 
D score 


PERIPLASMIC BINDING 
PROTEIN 

LEUCINE(SLASH)*ISOLE 
UCINE(SLASH) SALINE- 
BINDING PROTEIN 2LIV 
4 (/LIVBPS) 2LIV 5 


PERIPLASMIC BINDING 
PROTEIN LEUCINE- 
BINDING PROTEIN 
(/LBP$) 2LBP4 


METABOTROPIC 
GLUTAMATE RECEPTOR 
SUBTYPE 1; CHAIN: A, B; 


METABOTROPIC 
GLUTAMATE RECEPTOR 
SUBTYPE 1; CHAIN: A, B; 


LECTIN (AGGLUTININ) 
WHEAT GERM 
AGGLUTININ 
(ISOLECTIN2)9WGA3 


ACUTOLYSIN-C; CHAIN: 
A; 


ACUTOLYSIN-C; CHAIN: 
A; 


Compound 






SIGNALING PROTEIN MGLURl; 
SIGNAL TRANSDUCTION, 
NEUROTRANSMITTER, CNS, 
NEURON 


SIGNALING PROTEIN MGLURl ; 
SIGNAL TRANSDUCTION, 
NEUROTRANSMITTER, CNS, 
NEURON 




TOXIN HEMORRHAGIN III 
METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 


TOXIN HEMORRHAGIN III 
METALLOPROTEASE, 
HEMORRHAGIC TOXIN, SNAKE 
VENOM PROTEINASE, 2 CRYSTAL 
STRUCTURE, AGKISTRODON 
ACUTUS 
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D score 


RABPHILIN 3-A; CHAIN: 
A; 


RABPHILIN 3-A; CHAIN: 
A; 


RABPHILIN 3-A; CHAIN: 
A; 


CALCIUM/PHOSPHOLIPI 
D BINDING PROTEIN 
SYNAPTOTAGMINI 
(FIRST C2 DOMAIN) 
(CALB) IRSY 3 


CALCIUM/PHOSPHOLIPI 
D BINDING PROTEIN 
SYNAPTOTAGMIN I 
(FIRST C2 DOMAIN) 
(CALB) IRSY 3 


CALCIUM/PHOSPHOLIPI 
D BINDING PROTEIN 
SYNAPTOTAGMINI 
(FIRST C2 DOMAIN) 
(CALB) IRSY 3 


CALCIUM/PHOSPHOLIPI 
D BINDING PROTEIN 
SYNAPTOTAGMINI 
(FIRST C2 DOMAIN) 
(CALB) IRSY 3 


D BINDING PROTEIN 
SYNAPTOTAGMINI 
(FIRST C2 DOMAIN) 
(CALB) IRSY 3 


Compound 


ENDOCYTOSIS/EXOCYTOSIS C2- 
DOMAINS, C2B-D0MAIN, 


ENDOCYTOSIS/EXOCYTOSIS C2- 
DOMAINS, C2B-DOMAIN, 
RABPHILIN, 

ENDOCYTOSIS/EXOCYTOSIS 


ENDOCYTOSIS/EXOCYTOSIS C2- 
DOMAINS, C2B-D0MAIN, 
RABPHILIN, 

ENDOCYTOSIS/EXOCYTOSIS 
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SEQFOL 
D score 


| N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


? 

| 

> 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; CHAIN: A; 


N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; INCI 3 


N-CADHERIN; INCG 3 


N-CADHERIN; INCG 3 


N-CADHERIN; INCG 3 


N-CADHERIN; INCG 3 




Compound 


| CELL ADHESION PROTEIN CELL 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL 
ADHESION PROTEIN 


CELL ADHESION PROTEIN CELL [ 
ADHESION PROTEIN 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCI 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CELL ADHESION PROTEIN 
CADHERIN INCG 13 


CADHERIN INCG 13 J 
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PMF 
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324.98 




129.54 


299.60 


118.41 




SEQFOL 
D score 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, A-2 CHAIN: A; 
BETA-2- 

MICROGLOBULIN; 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, A-2 CHAIN: A; 
BETA-2- 

MICROGLOBULIN; 
CHAIN: B; MELANOMA- 
ASSOCIATED ANTIGEN 
4; CHAIN: C; 


HLA CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN, A-2 CHAIN: A; 
BETA-2- 

MICROGLOBULIN; 
CHAIN: B; MELANOMA- 
ASSOCIATED ANTIGEN 
4; CHAIN: C; 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


HISTOCOMPATIBILITY 
ANTIGEN CLASS I 
HISTOCOMPATIBILITY 
ANTIGEN AW68.1 
(LEUCOCYTE IHSB 3 
ANTIGEN) IHSB 4 


B(ASTERISK)2705$ IHSA 
4 


Compound 


IMMUNE SYSTEM MAGE-4 
ANTIGEN; MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
HUMAN LEUKOCYTE ANTIGEN, 2 
MELANOMA-ASSOCIATED 


IMMUNE SYSTEM MAGE-4 
ANTIGEN; MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
HUMAN LEUKOCYTE ANTIGEN, 2 
MELANOMA-ASSOCIATED 
ANTIGEN 


IMMUNE SYSTEM MAGE-4 
ANTIGEN; MAJOR 
HISTOCOMPATIBILITY COMPLEX, 
HUMAN LEUKOCYTE ANTIGEN, 2 
MELANOMA-ASSOCIATED 
ANTIGEN 
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Psi Blast 
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0.47 
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0.52 
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Verify 
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0.10 


1.00 
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SEQFOL 
D score 


1 CAG-ALPHA; CHAIN: A, 


HIGH-AFFINITY 
BRANCHED-CHAIN 
AMINO ACID CHAIN: A 


HIGH-AFFINITY 
BRANCHED-CHAIN 
AMINO ACID CHAIN: A 


MALTOSE TRANSPORT 
PROTEIN MALK; CHAIN: 
1,2; 


YEAST INITIATION 
FACTOR 4A; CHAIN: A, 
B; 


HYPOTHETICAL ABC 
TRANSPORTER ATP- 
BINDING PROTEIN 
CHAIN: A; 


HYPOTHETICAL ABC 
TRANSPORTER ATP- 
BINDING PROTEIN 
CHAIN: A; 


GUANYLATE KINASE; 
CHAIN: A; 


CHROMOSOME 
SEGREGATION SMC 
PROTEIN; CHAIN: A, B, 
C, D, E, F; 




Compound 


1 HYDROLASE TRAFFIC ATPASE; 


TRANSPORT PROTEIN BETA- 
CORE DOMAIN, ABC SPECIFIC- 
BETA-STRAND DOMAIN ALPHA- 2 
HELIX DOMAIN 


TRANSPORT PROTEIN BETA- 
CORE DOMAIN, ABC SPECIFIC- 
BETA-STRAND DOMAIN ALPHA- 2 
HELIX DOMAIN 


SUGAR BINDING PROTEIN MALK; 
ATPASE, ACTIVE TRANSPORT, 
MALTOSE UPTAKE AND 
REGULATION 


TRANSLATION EUKARYOTIC 
INITIATION FACTOR 4A; IF4A, 
HELICASE, DEAD-BOX PROTEIN 


STRUCTURAL GENOMICS 
TRANSPORTER 


STRUCTURAL GENOMICS 
TRANSPORTER 


TRANSFERASE GUANYLATE 
KINASE, SUBSTRATE-INDUCED 
FIT, DOMAIN MOVEMENT, 2 GMP, 
ATP, SUBSTRATE SPECIFICITY, X- 
RAY DIFFRACTION 


CHROMOSOME SEGREGATION 
SMC, STRUCTURAL 
MAINTENANCE OF 
CHROMOSOMES, COILED COIL 


2 KINASE, TRANSFERASE 
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SEQFOL 
D score 


U2 RNA HAIRPIN IV; 
CHAIN: Q, R; U2 A'; 
CHAIN: A, C; U2 B"; 


U2RN/ 
CHAIN: 
CHAIN: 
CHAIN 


U2 RN/ 
CHAIN: 
CHAIN: 
CHAIN: 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


RIBONUCLEASE 
INHIBITOR; CHAIN: A, D; 
ANGIOGENIN; CHAIN: B, 
E; 


Compound 


« > P a 




to to a 

03 > 5 


to to a 

w > 2 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, \ 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 1 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


COMPLEX 

(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 


COMPLEX 

(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 


COMPLEX 

(INHIBITOR/NUCLEASE) 
COMPLEX 

(INHIBITOR/NUCLEASE), 
COMPLEX (RI-ANG), HYDROLASE 
2 MOLECULAR RECOGNITION, 
EPITOPE MAPPING, LEUCINE- 
RICH 3 REPEATS 
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SEQFOL 
D score 


> g 


INTERNALIN B; CHAIN: 
A; 


U2RW 
CHAIN 
CHAIN 
CHAIN 


U2RN7 
CHAIN 
CHAIN 
CHAIN 


U2RM 
CHAIN 
CHAIN 
CHAIN 


U2RW 
CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: 
CHAIN: 
CHAIN: 


U2RN^ 
CHAIN: 
CHAIN: 
CHAIN: 


mmA 

CHAIN: 
CHAIN: 
CHAIN: 


CHAIN: 


Compound 


^ALIN B; CHAIN: 


w>pa 

to to « 
Gd > ^ 


I HAIRPIN IV; 
:Q,R;U2A'; 
:A,C;U2B"; 
:B,D; 


w > p w 
oops 

to to a 
Cd > ^ 


l HAIRPIN IV; 
:Q,R;U2A«; 
:A,C;U2B"; 
:B, D; 


l HAIRPIN IV; 
Q,R;U2A'; 

A, C; U2B"; 

B, D; 


.HAIRPIN IV; 
Q,R;U2A' ; 

A, C;U2B M ; 

B, D; 


.HAIRPIN IV; 
Q,R;U2A'; 

A, C; U2 B"; 

B, D; 


cd 
o 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLE0PR0TEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNP,RIBONUCLEOPROTEIN 


COMPLEX (NUCLEAR 
PROTEIN/RNA) COMPLEX 
(NUCLEAR PROTEIN/RNA), RNA, 
SNRNPjRIBONUCLEOPROTEIN 


SNRNPjRIBONUCLEOPROTEIN | 
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0.03 
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1.00 
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0.83 
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SEQFOL 
D score 


INTERNALINB; CHAIN: 
A; 


INTERNALINB; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALINB; CHAIN: 
A; 


INTERNALINB; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALIN B; CHAIN: 
A; 


INTERNALINB; CHAIN: 
A; 


INTERNALINB; CHAIN: 
A; 




Compound 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION LEUCINE RICH 
REPEAT, CALCIUM BINDING, 
CELL ADHESION 


CELL ADHESION I 
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SEQFOL 
D score 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


GTPASE-ACTIVATING 
PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 


SKP2; CHAIN: A, C, E, G, 
I, K,M,0; SKPl; CHAIN: 
B,D,F,H,J,L,N,P; 


TELOKIN; CHAIN: A 


Compound 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMIHEDRAL 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMIHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 


TRANSCRIPTION RNAIP; 
RANGAP; GTPASE-ACTIVATING 
PROTEIN FOR SPIl, GTPASE- 
ACTIVATING PROTEIN, GAP, 
RNAIP, RANGAP, LRR, LEUCINE- 
2 RICH REPEAT PROTEIN, 
TWINNING, HEMIHEDRAL 
TWINNING, 3 MEROHEDRAL 
TWINNING, MEROHEDRY 


LIGASE CYCLIN A/CDK2- 
ASSOCIATED PROTEIN P45; 
CYCLIN A/CDK2-ASSOCIATED 
PROTEIN P19; SKPl, SKP2, F-BOX, 
LRR, LEUCINE-RICH REPEAT, SCF, 
UBIQUITIN, 2 E3, UBIQUITIN 
PROTEIN LIGASE 


CONTRACTILE PROTEIN 
IMMUNOGLOBULIN FOLD, BETA 
BARREL 
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^ H EG 
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6e-42 


3e-35 


3e-23 


1.3e-28 


7.5e-24 


0.00012 




Psi Blast 


0.06 


0.43 


0.13 


0.07 


0.04 


0.40 




Verify 
score 


-0.20 


i 

o 
to 
o 


-0.20 


o 
to 
o 


-0.20 


-0.20 




PMF 
score 
















SEQFOL 
D score 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC 
FINGER PROTEIN; 
CHAIN: C, F, G; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B,C; 


QGSR ZINC FINGER 
PEPTIDE; CHAIN: A; 
DUPLEX 

OLIGONUCLEOTIDE 
BINDING SITE; CHAIN: 
B, C; 


ALPHAl BETAl 
INTEGRIN; CHAIN: A; 
ALPHAl BETAl 
INTEGRIN; CHAIN: B; 




Compound 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) 
ZINC FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 
2 CRYSTAL STRUCTURE, 
COMPLEX (ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


CELL ADHESION INTEGRIN, CELL 
ADHESION 


EXTRACELLULAR 2 MATRIX, 
CYTOSKELETON 
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Verify 
score 














PMF 
score 


| 85.15 


61.32 


71.86 


71.90 


89.76 




SEQFOL 
D score 


1 COMPLEMENT FACTOR 


CYCLIN-DEPENDENT 
KINASE 6; CHAIN: A; 
MULTIPLE TUMOR 
SUPPRESSOR; CHAIN: B; 


P19INK4D CDK4/6 
INHIBITOR; CHAIN: 
NULL; 


GA BINDING PROTEIN 
ALPHA; CHAIN: A; GA 
BINDING PROTEIN BETA 
1; CHAIN: B; DNA; 
CHAIN: D, E; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO- 
MAI; CHAIN: P; 




Compound 


| SERINE PROTEASE SERINE 


COMPLEX (KINASE/ANTI- 
ONCOGENE) CDK6; P16INK4A, 
MTSl; CYCLIN DEPENDENT 
KINASE, CYCLIN DEPENDENT 
KINASE INHIBITORY 2 PROTEIN, 
CDK, INK4, CELL CYCLE, 
MULTIPLE TUMOR SUPPRESSOR, 
3 MTSl, COMPLEX (KINASE/ANTI- 
ONCOGENE) HEADER 


TUMOR SUPPRESSOR TUMOR 
SUPPRESSOR, CDK4/6 INHIBITOR, 
ANKYRIN MOTIF 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) GABPALPHA; 
GABPBETAl; COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA), DNA- 
BINDING, 2 NUCLEAR PROTEIN, 
ETS DOMAIN, ANKYRIN 
REPEATS, TRANSCRIPTION 3 
FACTOR 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
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Psi Blast 




















Verify 
score 




















PMF 
score 


64.46 


99.87 


90.30 


77.08 


89.59 


79.05 


94.78 


78.10 




SEQFOL 
D score 


CYCLIN-DEPENDENT 
KINASE 6 INfflBITOR; 
CHAIN: A, B; 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.2L1) ($P*H7.0) 
1GCT3 


HYDROLASE (SERINE 
PROTEINASE) GAMMA- 
*CHYMOTRYPSIN *A 
(E.C.3.4.21.1) ($P*H7.0) 
1GCT3 


COAGULATION FACTOR 
XA-TRYPSIN CHIMERA; 
CHAIN: A; D-PHE-PRO- 
ARG- 

CHLOROMETHYLKETO 
NE (PPACK) WITH 
CHAIN: I; 


TRYPSIN; CHAIN: A; 


TRYPSIN; CHAIN: A; 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L 


BETA-ACROSIN HEAVY 
CHAIN; CHAIN: A; BETA- 
ACROSIN LIGHT CHAIN; 
CHAIN: L 


ACROSIN LIGHT CHAIN; 
CHAIN: L 


Compound 


CELL CYCLE INHIBITOR PI 8- 
INK4C(INK6); CELL CYCLE 
INHIBITOR, P18-INK4C(INK6), 






COMPLEX 

(PROTEASE/INHIBITOR) TRYPSIN, 
COAGULATION FACTOR XA, 
CHIMERA, PROTEASE, PPACK, 2 
CHLOROMETHYLKETONE, 
COMPLEX 

(PROTEASE/INHIBITOR) 


HYDROLASE SERINE PROTEASE, 1 
HYDROLASE 


HYDROLASE SERINE PROTEASE, 
HYDROLASE 


HYDROLASE ANTI-PARALLEL 
BETA-BARREL 


HYDROLASE ANTI-PARALLEL 
BETA-BARREL 
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SEQ ID 


Position 


Maximum score 


Average score 


916 


1-35 


0.978 


0.879 


918 


1-21 


0.953 


0.900 


921 


1-19 


0.957 


0.919 


923 


1-45 


0.961 


0.594 


924 


1-64 


0.994 


0.484 


925 


1-25 


0.936 


0.635 


926 


1-32 


0.997 


0.899 


927 


1-21 


0.991 


0.946 


928 


1-25 


0.929 


0.613 


929 


1-35 


0.850 


0.485 


930 


1-39 


0.861 


0.465 


931 


1-29 


0.906 


0.675 


932 


1-24 


0.905 


0.496 


933 


1-24 


0.993 


0.956 


934 


1-31 


0.937 


0.811 


935 


1-21 


0.989 


0.939 


936 


1-20 


0.973 


0.941 


937 


1-11 


0.800 


0.691 


938 


1-24 


0.912 


0.763 


939 


1-22 


0.992 


0.851 


940 


1-20 


0.975 


0.934 


941 


1-16 


0.978 


0.797 


942 


1-23 


0.966 


0.816 


943 


1-19 


0.905 


0.660 


944 


1-56 


0.966 


0.498 


945 ! 


1-40 


0.859 


0.533 


946 


1-20 


0.953 


0.751 


947 


1-31 


0.833 


0.502 


948 


1-39 


0.986 


0.641 


949 


1-26 


0.978 


0.932 


950 


1-67 


0.976 


0.450 


951 


1-20 


0.991 


0.708 


952 


1-36 


0.954 


0.637 


953 


1-16 


0.844 


0.480 


954 


1-32 


0.953 


0.521 


955 


1-58 


0.989 


0.500 


956 


1-25 


0.981 


0.896 


957 


1-28 


0.954 


0.836 


958 


1-19 


0.982 


0.943 


959 


1-55 


0.988 


0.465 


960 


1-33 


0.925 


0.630 


961 


1-40 


0.916 


0.477 


962 


1-13 


0.816 


0.645 


963 


1-17 


0.946 / ^ 


0.734 


964 


1-17 


0.985 * ^ 


0.930 


965 


1-51 


0.991 


0.500 


966 


1-49 


0.963 


0.508 


967 


1-51 


0.971 


0.493 


968 


1-15 


0.959 


0.731 


969 


1-13 


0.822 


0.537 


970 


1-16 


0.862 


0.496 


971 


1-31 


0.956 


0.674 


972 


1-32 


0.855 


0.601 


973 


1-15 


0.826 


0.532 


974 


1-57 


0.954 


0.648 
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SEQ ID 


Position 


Maximum score 


Average score 


975 


1-20 


0.976 


0.854 


976 


1-22 


0.873 


0.586 


977 


1-26 


0.985 


0.839 


978 


1-49 


0.963 


0.774 


979 


1-26 


0.843 


0.609 


980 


1-57 


0.966 


0.561 


981 


1-16 


0.912 


0.705 


982 


1-22 


0.962 


0.827 


983 


1-13 


0.803 


0.580 


984 


1-51 


0.932 


0.462 


985 


1-20 


0.982 


0.890 


986 


1-20 


0.977 


0.923 


987 


1-21 


0.977 


0.868 


988 


1-23 


0.838 


0.580 


989 


1-39 


0.905 


0.628 


990 


1-23 


0.989 


0.864 


991 


1-20 


0.884 


0.650 


992 


1-19 


0.985 


0.920 


993 


1-28 


0.887 


0.505 


994 


1-27 


0.955 


0.743 


995 


1-27 


0.992 


0.934 


996 


1-53 


0.953 


0.723 


997 


1-30 


0.905 


0.522 


998 


1-20 


0.954 


0.746 


999 


1-19 


0.821 


0.482 


1000 


1-20 


0.978 


0.953 


1001 


1-45 


0.890 


0.608 


1037 


1-31 


0.921 


0.630 


1038 


1-36 


0.972 


0.563 


1039 


1-39 


0.976 


0.551 


1040 


1-26 


0.937 


0.703 


1041 


1-74 


0.991 


0.543 


1042 


1-23 


0.945 


0.797 


1043 


1-16 


0.977 


0.506 


1044 


1-21 


0.967 


0.759 


1045 


1-22 


0.861 


0.539 


1046 


1-27 


0.934 


0.682 


1047 


1-18 


0.983 


0.962 


1048 


1-22 


0.827 


0.517 


1049 


1-34 


0.980 


0.703 


1050 


1-15 


0.987 


0.955 


1051 


1-28 


0.995 


0.945 


1052 


1-18 


0.995 


0.977 


1053 


1-25 


0.935 


0.739 


1054 


1-19 


0.976 


0.950 


1055 


1-49 


0.933 


0.538 


1056 


1-22 


0.918 


0.723 


1057 


1-25 


0.972 


0.902 


1058 


1-52 


0.981 


0.622 


1059 


1-75 


0.969 


0.541 


1060 


1-75 


0.979 


0.817 


1061 


1-22 


0.957 


0.756 


1062 


1-45 


0.978 


0.852 


1063 


1-29 


0.984 


0.954 


1064 


1-36 


0.994 


0.713 
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SEQ ID 


Position 


Maximum score 


Average score 


1065 


1-34 


0.914 


0.608 


1066 


1-25 


0.986 


0.952 


1067 


1-38 


0.990 


0.909 


1068 


1-54 


0.908 


0.565 


1069 


1-20 


0.870 


0.708 


1070 


1-27 


0.985 


0.856 


1071 


1-23 


0.966 


0.812 


1072 


1-54 


0.967 


0.524 


1160 


1-36 


0.948 


0.776 


1161 


1-26 


0.962 


0.783 


1162 


1-31 


0.919 


0.543 


1164 


1-26 


0.988 


0.886 


1166 


1-53 


0.998 


0.751 


1167 


1-20 


0.993 


0.968 


1168 


1-24 


0.987 


0.924 


1169 


1-21 


0.973 


0.760 


1170 


1-56 


0.962 


0.585 


1171 


1-39 


0.978 


0.768 


1172 


1-25 


0.969 


0.937 


1173 


1-79 


0.967 


0.612 


1174 


1-23 


0.990 


0.934 


1175 


1-23 


0.990 


0.932 


1176 


1-42 


0.977 


0.677 


1177 


1-22 


0.969 


0.897 


1178 


1-17 


0.869 


0.641 


1179 


1-45 


0.987 


0.656 


1180 


1-31 


0.956 


0.853 


1181 


1-64 


0.996 


0.558 


1182 


1-50 


0.986 


0.566 


1183 


1-38 


0.987 


0.864 


1184 


1-28 


0.987 


0.924 


1185 


1-48 


0.911 


0.557 


1186 


1-58 


0.914 


0.462 


1187 


1-20 


0.992 


0.794 


1188 


1-24 


0.894 


0.608 


1189 


1-19 


0.986 


0.967 


1190 


1-23 


0.965 


0.817 


1191 


1-23 


0.965 


0.817 


1192 


1-53 


0.884 


0.504 


1193 


1-31 


0.982 


0.857 


1194 


1-18 


0.805 


0.583 


1195 


1-20 


0.950 


0.843 


1196 


1-25 


0.803 


0.575 


1197 


1-26 


0.869 


0.476 


1198 


1-23 


0.982 


0.866 


1199 


1-21 


0.959 


0.915 


1200 


1-30 


0.983 


0.652 


1201 


1-21 


0.932 


0.654 


1202 


1-21 


0.932 


0.654 


1203 


1-20 


0.932 


0.697 


1204 


1-26 


0.946 


0.775 


1205 


1-20 


0.986 


0.948 


1206 


1-64 


0.865 


0.475 


1207 


1-44 


0.881 


0.488 


1208 


1-19 


0.991 


0.973 
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SEQ ID 


Position 


Maximum score 


Average score 


1209 


1-38 


0.938 


0.459 


1210 


1-17 


0.954 


0.741 


1211 


1-20 


0.916 


0.802 


1212 


1-24 


0.961 


0.879 


1213 


1-20 


0.938 


0.716 


1214 


1-19 


0.959 


0.752 


1215 


1-55 


0.946 


0.682 


1216 


1-23 


0.965 


0.883 


1217 


1-23 


0.965 


0.883 


1218 


1-21 


0.909 


0.693 


1219 


1-17 


0.971 


0.886 


1220 


1-22 


0.833 


0.607 


1221 


1-16 


0.948 


0.836 


1222 


1-27 


0.968 


0.788 


1223 


1-23 


0.853 


0.523 


1224 


1-23 


0.842 


0.544 


1225 


1-19 


0.937 


0.713 


1226 


1-47 


0.947 


0.637 


1227 


1-21 


0.984 


0.933 


1228 


1-34 


0.969 


0.882 


1229 


1-15 


0.894 


0.745 


1230 


1-26 


0.966 


0.760 


1231 


1-28 


0.883 


0.474 


1232 


1-32 


0.939 


0.693 


1233 


1-28 


0.915 


0.655 


1234 [ 


1-28 


0.990 


0.959 


1235 


1-39 


0.926 


0.696 


1236 


1-17 


0.882 


0.558 


1237 


1-16 


0.967 


0.933 


1238 


1-19 


0.942 


0.799 


1239 


1-19 


0.910 


0.773 


1240 


1-19 


0.947 


0.805 


1241 


1-20 


0.976 


0.875 


1242 


1-20 


0.976 


0.875 


1243 


1-15 


0.973 


0.942 


1244 


1-16 


0.965 


0.911 


1245 


1-17 


0.985 


0.956 


1246 


1-21 


0.883 


0.620 


1247 


1-13 


0.874 


0.527 


1248 


1-21 


0.945 


0.852 


1249 


1-24 


0.903 


0.540 


1250 


1-26 


0.959 


0.786 


1251 


1-30 


0.934 


0.585 


1252 


1-27 


0.962 


0.740 


1253 


1-25 


0.842 


0.507 


1254 


1-19 


0.965 


0.797 


1255 


1-20 


0.970 


0.804 


1256 


1-19 


0.956 


0.791 


1257 


1-21 


0.977 


0.868 


1258 


1-23 


0.932 


0.770 


1259 


1-31 


0.960 


0.635 


1260 


1-24 


0.954 


0.803 


1261 


1-43 


0.949 


0.467 


1262 


1-28 


0.951 


0.667 


1263 


1-28 


0.963 


0.844 



WO 03/054152 



PCT/US02/39555 



Table 6 
1109 



SEQ ID 
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Maximum score 


Average score 


1264 


1-60 


0.971 


0.472 


1265 


1-17 


0.950 


0.878 


1266 


1-22 


0.984 


0.852 


1267 


1-32 


0.939 


0.646 


1268 


1-50 


0.939 


0.491 


1269 


1-42 


0.841 


0.500 


1270 


1-23 


0.923 


0.602 


1271 


1-20 


0.881 


0.514 


1272 


1-20 


0.881 


0.514 


1273 


1-21 


0.839 


0.518 


1274 


1-33 


0.955 


0.547 


1275 


1-26 


0.884 


0.488 


1276 


1-27 


0.867 


0.555 


1277 


1-18 


0.822 


0.521 


1278 


1-20 


0.814 


0.562 


1279 


1-24 


0.921 


0.756 


1280 


1-50 


0.845 


0.490 


1332 


1-27 


0.968 


0.912 


1335 


1-38 / 


0.937 


0.565 


1338 


1-23 


0.993 


0.924 


1339 


1-28 


0.995 


0.945 


1340 


1-28 


0.995 


0.945 


1345 


1-73 


0.997 


0.744 


1346 


1-24 


0.985 


0.690 


1347 


1-16 


0.983 


0.952 


1348 


1-16 


0.949 


0.869 


1349 


1-16 


0.996 


0.959 


1354 


1-49 


0.989 


0.784 


1355 


1-77 


0.963 


0.578 


1356 


1-24 


0.974 


0.882 


1357 


1-38 


0.979 


0.810 


1359 


1-24 


0.981 


0.938 


1361 


1-24 


0.998 


0.972 


1362 


1-22 


0.963 


0.884 


1364 


1-33 


0.997 


0.864 


1366 


1-20 


0.890 


0.648 


1367 


1-23 


0.996 


0.932 


1370 


1-38 


0.983 


0.894 


1371 


1-19 


0.960 


0.521 


1372 


1-18 


0.986 


0.589 


1373 


1-20 


0.929 


0.768 


1375 


1-17 


0.977 


0.921 


1377 


1-19 


0.985 


0.935 


1379 


1-40 


0.961 


0.521 


1382 


1-21 


0.985 


0.922 


1384 


1-51 


0.976 


0.781 


1385 


1-29 


0.984 


0.954 


1386 


1-19 


0.981 


0.939 


1387 


1-17 


0.979 


0.938 


1388 


1-33 


0.953 


0.828 


1390 


1-42 


0.974 


0.616 


1394 


1-28 


0.989 


0.912 


1396 


1-33 


0.979 


0.698 


1397 


1-24 


0.927 


0.726 


1399 


1-46 


0.991 


0.727 
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Average score 


1400 


1-44 


0.900 


0.657 


1401 


1-17 


0.996 


0.872 


1402 


1-39 


0.989 


0.941 


1403 


1-21 


0.975 


0.604 


1404 


1-16 


0.957 


0.870 


1405 


1-20 


0.990 


0.875 


1407 


1-71 


0.981 


0.566 


1408 


1-18 


0.985 


0.928 


1410 


1-14 


0.941 


0.861 


1412 


1-32 


0.935 


0.549 


1419 


1-20 


0.989 


0.960 


1420 


1-50 


0.987 


0.583 


1421 


1-27 


0.977 


0.849 


1422 


1-45 


0.981 


0.817 


1423 


1-28 


0.988 


0.923 


1430 


1-73 


0.997 


0.731 


1432 


1-23 


0.992 


0.956 


1433 


1-23 


0.992 


0.956 


1437 


1-60 


0.976 


0.737 


1442 


1-27 


0.975 


0.937 


1467 


1-57 


0.931 


0.514 


1468 


1-15 


0.810 


0.544 


1469 


1-41 


0.926 


0.451 


1470 


1-31 


0.914 


0.618 


1471 


1-33 


0.889 


0.542 


1472 


1-20 


0.941 


0.754 


1473 


1-55 


0.929 


0.541 


1474 


1-36 


0.951 


0.625 


1475 


1-18 


0.897 


0.608 


1476 


1-22 


0.980 


0.917 


1477 


1-73 


0.959 


0.450 


1478 


1-18 


0.946 


0.791 


1479 


1-22 


0.951 


0.739 


1480 


1-73 


0.968 


0.456 


1481 


1-25 


0.879 


0.674 


1482 


1-20 


0.946 


0.806 


1483 


1-24 


0.992 


0.825 


1484 


1-17 


0.836 


0.620 


1485 


1-53 


0.917 


0.508 


1486 


1-19 


0.872 


0.724 


1487 


1-58 


0.954 


0.452 


1488 


1-19 


0.966 


0.927 


1489 


1-36 


0.979 


0.862 


1490 


1-25 


0.976 


0.803 


1491 


1-26 


0.898 


0.658 


1492 


1-22 


0.948 


0.844 


1493 


1-22 


0.894 


0.673 


1494 


1-52 


0.937 


0.660 


1495 


1-37 


0.930 


0.614 


1496 


1-25 


0.979 


0.914 


1497 


1-22 


0.972 


0.929 


1498 


1-21 


0.978 


0.816 


1499 


1-24 


0.839 


0.692 


1500 


1-15 


0.823 


0.560 


1501 


1-16 


0.960 


0.845 
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Average score 


1502 


1-29 


0.940 


0.683 


1503 


1-12 


0.832 


0.523 


1504 


1-36 


0.945 


0.749 


1505 


1-33 


0.843 


0.488 


1506 


1-23 


0.901 


0.626 


1507 


1-20 


0.834 


0.505 


1508 


1-25 


0.919 


0.690 


1509 


1-44 


0.885 


0.513 


1510 


1-23 


0.974 


0.825 


1511 


1-13 


0.941 


0.847 


1512 


1-42 


0.885 


0.513 


1513 


1-18 


0.903 


0.592 


1514 


1-30 


0.948 


0.816 


1515 


1-16 


0.944 


0.818 


1516 


1-23 


0.849 


0.631 


1517 


1-13 


0.8 13 


0.610 


1518 


1-20 


0.962 


0.921 


1519 


1-23 


0.823 


0.495 


1520 


1-2 


0.893 


0.458 


1521 


1-17 


0.902 


0.762 


1522 


1-20 


0.952 _j 


0.772 


1523 


1-23 


0.991 


0.925 


1524 


1-15 


0.858 


0.647 


1525 


1-15 


0.979 


0.961 


1526 


1-23 


0.926 


0.669 


1527 


1-23 


0.942 


0.686 


1528 


1-22 


0.967 


0.923 


1529 


1-22 


0.913 


0.519 


1530 


1-27 


0.867 


0.508 


1531 


1-15 


0.807 


0.631 


1532 


1-39 


0.907 


0.475 


1533 


1-24 


0.823 


0.503 


1534 


1-16 


0.892 


0.668 


1535 


1-16 


0.860 


0.607 


1536 


1-28 


0.920 


0.686 


1537 


1-20 


0.801 


0.618 


1538 


1-28 


0.887 


0.643 


1539 


1-21 


0.927 


0.568 


1540 


1-22 


0.997 


0.951 


1541 


1-22 


0.985 


0.827 


1542 


1-24 


0.962 


0.871 


1543 


1-22 


0.972 


0.932 


1544 


1-23 


0.908 


0.678 


1545 


1-27 


0.899 


0.579 


1546 


1-16 


0.988 


0.955 


1547 


1-16 


0.873 


0.622 


1548 


1-28 


0.840 


0.493 


1549 


1-15 


0.889 


0.689 


1550 


1-21 


0.889 


0.633 


1551 


1-44 


0.981 


0.617 


1552 


1-18 


0.881 


0.649 


1553 


1-16 


0.805 


0.631 


1554 


1-22 


0.965 


0.807 


1555 


1-26 


0.928 


0.744 


1556 


1-37 


0.934 


0.475 
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Position 
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Average score 


1557 


1-31 


0.935 


0.631 


1558 


1-20 


0.990 


0.868 


1559 


1-13 


0.853 


0.530 


1560 


1-21 


0.833 


0.604 


1561 


1-25 


0.922 


0.642 


1562 


1-24 


0.960 


0.683 


1563 


1-31 


0.908 


0.677 


1564 


1-22 


0.980 


0.937 


1565 


1-17 


0.857 


0.523 


1566 


1-15 


0.878 


0.521 


1567 


1-19 


0.861 


0.660 


1568 


1-18 


0.963 


0.812 


1569 


1-32 


0.965 


0.659 


1570 


1-21 


0.919 j 


0.457 


1571 


1-33 


0.931 


0.799 


1572 


1-11 


0.962 


0.656 


1573 


1-22 


0.969 


0.862 


1574 


1-18 


0.907 


0.508 


1575 


1-19 


0.952 


0.751 


1576 


1-16 


0.949 


0.807 


1577 


1-20 


0.984 


0.912 


1578 


1-47 


0.956 


0.583 


1579 


1-30 


0.965 


0.639 


1580 


1-33 


0.921 


0.650 


1581 


1-21 


0.996 


0.915 


1582 


1-18 


0.992 


0.969 


1583 


1-19 


0.820 


0.492 


1584 


1-33 


0.895 


0.469 


1585 


1-37 


0.987 


0.701 


1586 


1-19 


0.882 


0.606 


1587 


1-19 


0.896 


0.652 


1588 


1-11 


0.896 


0.803 


1589 


1-22 


0.977 


0.769 


1590 


1-21 


0.916 


0.742 


1591 


1-15 


0.909 


0.760 


1592 


1-13 


0.925 


0.649 


1593 


1-26 


0.963 


0.782 


1594 


1-13 


0.840 


0.530 


1595 


1-35 


0.984 


0.835 


1596 


1-17 


0.813 


0.642 


1597 


1-20 


0.822 


0.535 


1598 


1-22 


0.996 


0.965 


1599 


1-17 


0.866 


0.603 


1600 


1-28 


0.983 


0.873 


1601 


1-36 


0.963 


0.813 


1602 


1-16 


0.942 


0.750 


1603 


1-16 


0.827 


0.545 


1604 


1-19 


0.990 


0.935 


1605 


1-28 


0.910 


0.588 


1606 


1-28 


0.990 


0.772 


1607 


1-30 


0.944 


0.835 


1608 


1-36 


0.963 


0.842 


1609 


1-18 


0.937 


0.839 


1610 


1-25 


0.966 


0.910 


1611 


1-33 


0.963 


0.577 
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1612 


1-27 


0.985 


0.755 


1613 


1-20 


0.970 


0.737 


1614 


1-11 


0.872 


0.706 


1615 


1-19 


0.932 


0.541 


1616 


1-14 


0.987 


0.968 


1617 


1-52 


0.948 


0.570 


1618 


1-31 


0.842 


0.473 


1619 


1-27 


0.985 


0.927 


1620 


1-17 


0.822 


0.503 


1621 


1-61 


0.982 


0.770 


1622 


1-20 


0.876 


0.549 


1623 


1-19 


0.872 


0.552 


1624 


1-16 


0.928 


0.878 


1625 


1-20 


0.914 


0.495 


1626 


1-17 


0.874 


0.653 


1627 


1-16 


0.969 


0.917 


1628 


1-21 


0.886 


0.568 


1629 


1-45 


0.931 


0.487 


1630 


1-25 


0.937 


0.771 


1631 


1-23 


0.811 


0.514 


1632 


1-21 


0.923 


0.724 


1633 


1-17 


0.833 


0.543 


1634 


1-24 


0.979 


0.927 


1635 


1-20 


0.972 


0.802 


1636 


1-22 


0.932 


0.577 


1637 


1-18 


0.894 


0.643 


1638 


1-29 


0.952 


0.789 


1639 


1-38 


0.918 


0.617 


1640 


1-26 


0.848 


0.707 


1641 


1-23 


0.989 


0.864 


1642 


1-67 


0.995 


0.642 


1643 


1-18 


0.970 


0.681 


1644 


1-16 


0.982 


0.941 


1645 


1-24 


0.971 


0.908 


1646 


1-25 


0.969 


0.794 


1647 


1-40 


0.863 


0.548 


1648 


1-20 


0.984 


0.822 


1649 


1-15 


0.803 


0.568 


1650 


1-57 


0.986 


0.558 


1651 { 


1-26 


0.975 


0.763 


1652 


1-19 


0.996 


0.953 


1653 


1-34 


0.994 


0.961 


1654 


1-37 


0.973 


0.869 


1655 


1-21 


0.901 


0.716 


1656 


1-21 


0.990 


0.932 


1657 


1-42 


0.959 


0.693 


1658 


1-18 


0.928 


0.723 


1659 


1-16 


0.914 


0.756 


1660 


1-16 


0.841 


0.615 


1661 


1-70 


0.932 


0.496 


1662 


1-25 


0.835 


0.567 


1663 


1-39 


0.958 


0.854 


1664 


1-19 


0.977 


0.790 


1665 


1-19 


0.962 


0.794 


1666 


1-22 


0.991 


0.954 
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Average score 


1667 


1-39 


0.842 


0.525 


1668 


1-24 


0.907 


0.722 


1669 


1-52 


0.950 


0.667 


1670 


1-17 


0.940 


0.583 


1671 


1-26 


0.926 


0.642 


1672 


1-20 


0.964 


0.902 


1673 


1-22 


0.947 


0.788 


1674 


1-27 


0.889 


0.642 


1675 


1-16 


0.985 j 


0.900 


1676 


1-42 


0.869 


0.503 


1677 


1-73 


0.976 


0.512 


1678 


1-21 


0.951 


0.816 


1679 


1-25 


0.946 


0.771 


1680 


1-18 


0.975 


0.954 


1681 


1-19 


0.926 


0.729 


1682 


1-16 


0.975 


0.891 


1683 


1-25 


0.967 


0.881 


1684 


1-30 


0.877 


0.492 


1685 


1-23 


0.990 


0.905 


1686 


1-16 


0.985 


0.964 


1687 


1-20 


0.969 


0.803 


1688 


1-23 


0.965 


0.829 


1689 


1-20 


0.894 


0.502 


1690 


1-23 


0.884 


0.582 


1691 


1-19 


0.972 


0.705 


1692 


1-16 


0.887 


0.670 


1693 


1-21 


0.961 


0.741 


1694 


1-16 


0.913 


0.452 


1695 


1-25 


0.946 


0.829 


1696 


1-44 


0.985 


0.510 


1697 


1-32 


0.920 


0.468 


1698 


1-38 


0.886 


0.519 


1699 


1-15 


0.978 


0.904 


1700 


1-20 


0.930 


0.666 


1701 


1-25 


0.988 


0.903 


1702 


1-26 


0.901 


0.507 


1703 


1-38 


0.971 


0.793 


1704 


1-20 


0.977 


0.888 


1705 


1-30 


0.822 


0.550 


1706 


1-22 


0.844 


0.472 


1707 


1-16 


0.923 


0.641 


1708 


1-21 


0.906 


0.624 


1709 


1-22 


0.977 


0.880 


1710 


1-19 


0.946 


0.798 


1711 


1-24 


0.928 


0.557 


1712 


1-27 


0.969 


0.901 


1713 


1-30 


0.926 


0.755 


1714 


1-20 


0.975 


0.913 


1715 


1-23 


0.927 


0.606 


1716 


1-18 


0.924 


0.715 


1717 


1-31 


0.826 


0.618 


1718 


1-33 


0.977 


0.811 


1719 


1-26 


0.934 


0.636 


1720 


1-38 


0.960 


0.483 


1721 


1-36 


0.962 


0.537 
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Average score 


1722 


1-70 


0.993 


0.825 


1723 


1-33 


0.928 


0.642 


1724 


1-38 


0.967 


0.807 


1725 


1-18 


0.988 


0.897 


1726 


1-16 


0.974 


0.949 


1727 


1-42 


0.951 


0.468 


1728 


1-29 


0.904 


0.595 


1729 


1-41 


0.938 


0.600 


1730 


1-22 


0.829 


0.545 


1731 


1-52 


0.990 


0.466 


1732 


1-26 


0.970 


0.909 


1733 


1-24 


0.971 


0.865 


1734 


1-39 


0.996 


0.867 


1735 


1-45 


0.961 


0.593 


1736 


1-22 


0.991 


0.950 


1737 


1-25 


0.978 


0.623 


1738 


1-18 


0.991 


0.947 


1739 


1-47 


0.926 


0.486 


1740 


1-20 


0.985 


0.959 


1741 


1-32 


0.995 


0.972 


1742 


1-17 


0.958 


0.656 


1743 


1-62 


0.964 


0.638 


1744 


1-18 


0.892 


0.717 


1745 


1-27 


0.992 


0.934 


1746 


1-21 


0.989 


0.962 


1747 


1-67 


0.991 


0.685 


1748 


1-22 


0.898 


0.623 


1749 


1-15 


0.981 


0.945 


1750 


1-19 


0.971 


0.786 


1751 


1-36 


0.983 


0.664 


1752 


1-30 


0.988 


0.830 


1753 


1-28 


0.953 


0.866 


1754 


1-64 


0.967 


0.707 


1755 


1-56 


0.967 


0.554 


1756 


1-35 


0.945 


0.589 


1757 


1-19 


0.922 


0.737 


1758 


1-30 


0.990 


0.739 


1759 


1-25 


0.951 


0.864 


1760 


1-55 


0.990 


0.483 


1761 


1-29 


0.982 


0.792 


1762 


1-27 


0.936 


0.759 


1763 


1-29 


0.960 


0.805 


1764 


1-18 


0.978 


0.906 


1765 


1-22 


0.962 


0.920 


1766 


1-21 


0.997 


0.955 


1767 


1-25 


0.942 


0.847 


1768 


1-32 


0.989 


0.830 


1769 


1-25 


0.880 


0.630 


1770 


1-26 


0.872 


0.514 


1771 


1-29 


0.990 


0.648 


1772 


1-38 


0.901 


0.670 


1773 


1-25 


0.992 


0.955 


1774 


1-22 


0.839 


0.574 


1775 


1-33 


0.970 


0.887 


1776 


1-43 


0.987 


0.716 
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SEQ ID 


Position 


Maximum score 


Average score 


1777 


1-29 


0.831 


0.548 


1778 


1-42 


0.992 


0.654 


1779 


1-20 


0.936 


0.580 


1780 


1-24 


0.985 


0.941 


1781 


1-48 


0.996 


0.882 


1782 


1-58 


0.973 


0.595 


1783 


1-22 


0.899 


0.602 


1784 


1-46 


0.958 


0.672 


1785 


1-10 


0.881 


0.707 


1786 


1-22 


0.818 


0.577 


1787 


1-53 


0.903 


0.465 


1788 


1-77 


0.976 


0.483 


1789 


1-72 


0.973 


0.475 


1790 


1-79 


0.972 


0.734 


1791 


1-71 


0.958 


0.519 


1792 


1-25 


0.996 


0.966 


1793 


1-75 


0.975 


0.571 


1794 


1-43 


0.853 


0.587 


1795 


1-17 


0.948 


0.513 


1796 


1-26 


0.981 


0.938 


1797 


1-33 


0.986 


0.753 


1798 


1-21 


0.866 


0.619 


1799 


1-31 


0.965 


0.885 


1800 


1-51 


0.989 


0.477 


1801 


1-65 


0.977 


0.706 


1802 


1-21 


0.994 


0.891 


1803 


1-24 


0.986 


0.956 


1804 


1-25 


0.880 


0.602 


1805 


1-24 


0.991 


0.954 


1806 


1-24 


0.914 


0.580 


1807 


1-71 


0.956 


0.528 


1808 


1-38 


0.901 


0.670 


1809 


1-43 


0.975 


0.765 


1810 


1-14 


0.814 


0.590 


1811 


1-14 


0.909 


0.490 


1812 


1-54 


0.990 


0.503 


1813 


1-76 


0.999 


0.992 


1814 


1-61 


0.912 


0.473 


1815 


1-20 


0.975 


0.944 


1816 


1-45 


0.983 


0.465 


1817 


1-24 


0.936 


0.769 


1818 


1-62 


0.997 


0.748 


1819 


1-74 


0.986 


0.539 


1820 


1-59 


0.959 


0.559 


1821 


1-22 


0.860 


0.539 


1822 


1-30 


0.996 


0.822 
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SEQ ID 


Chromsomal location 


1 


13ql2-ql4 


2 


9q33-q34 


3 


2 


4 


13ql2-ql4 


5 


13ql2-ql4 


6 


13ql2-ql4 


7 


13ql2-ql4 


8 


13ql2-ql4 


9 


13ql2-ql4 


10 


17qll.2 


11 


19ql3.3-ql3.4 


12 


2 


13 


13ql2-ql4 


14 


8q 


15 


8q 


16 


Ilql3.1-ql3.3 


17 


20ql2 


18 


20ql2 


19 


20ql2 


20 


4 


21 


4 


22 


9q33-q34 


23 


9q33-q34 


24 


9q33-q34 


25 


9q33-q34 


26 


9q33-q34 


27 


9q33-q34 


28 


9q33~q34 


29 


9q33-q34 


30 


9q33-q34 


31 


9q33-q34 


32 


2 


33 


2 


34 


13ql2-ql4 


35 


13ql2-ql4 


36 


13ql2-ql4 


37 


13ql2-ql4 


38 


13ql2-ql4 


39 


13ql2-ql4 


40 


13ql2-ql4 


41 


13ql2-ql4 


42 


13ql2-ql4 


43 


13ql2-ql4 


44 


13ql2-ql4 


45 


13ql2-ql4 


46 


13ql2-ql4 


47 


13ql2-ql4 


48 


13ql2-ql4 


49 


13ql2-ql4 


50 


13ql2-ql4 


51 


13ql2-ql4 


52 


13ql2-ql4 


53 


13ql2-ql4 


54 


13ql2-ql4 


55 


13ql2-ql4 
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1 1 

1 ] 


1 o 
la 


SEQ ID 


Chromsomal location 


56 


13ql2-ql4 


57 


13ql2-ql4 


58 


13ql2-ql4 


59 


13ql2-ql4 


60 


13ql2-ql4 


61 


13ql2-ql4 


62 


13ql2-ql4 


63 


13ql2-ql4 


64 


13ql2-ql4 


65 


13ql2-ql4 


66 


13ql2-ql4 


67 


13ql2-ql4 


68 


13ql2-ql4 


69 


13ql2-ql4 


70 


13ql2-ql4 


71 


13ql2-ql4 


72 


13ql2-ql4 


73 


13ql2-ql4 


74 


13ql2-ql4 


75 


13ql2-ql4 


76 


13ql2-ql4 


77 


2 


78 


7 


79 


2 


80 


2 


81 


19ql3.3-ql3.4 


82 


19ql3.3-ql3.4 


83 


19ql3.3-ql3.4 


84 


19ql3.3-ql3.4 


85 


19ql3.3~ql3.4 


86 


19ql3.3-ql3.4 


87 


19ql3.3-ql3.4 


88 


19ql3.3-ql3.4 


89 


19ql3.3-ql3.4 


90 


19ql3.3-ql3.4 


91 


9q33-q34 


92 


13ql2-ql4 


93 


20ql2 


94 


7 


95 


4 


96 


4 


97 


4 


98 


9q33-q34 


99 


9q33-q34 


100 


9q33-q34 


101 


9q33-q34 


102 


9q33-q34 


103 


9q33-q34 


104 


9q33-q34 


105 


2 


106 


13ql2-ql4 


107 


13ql2-ql4 


108 


13ql2-ql4 


109 


13ql2-ql4 


110 


13ql2-ql4 
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11 


.19 


SEQ ID 


Chromsomal location 


111 


13ql2-ql4 


112 


13ql2-ql4 


113 


13ql2-ql4 


114 


13ql2-ql4 


115 


13ql2-ql4 


116 


13ql2-ql4 


117 


13ql2-ql4 


118 


13ql2-ql4 


119 


13ql2-ql4 


120 


2 


121 


2 


122 


2 


123 


19ql3.3-ql3.4 


124 


19ql3.3-ql3.4 


125 


19ql3.3-ql3.4 


126 


13ql2-ql4 


127 


13ql2-ql4 


128 


20ql2 


129 


4 


130 


4 


131 


9q33-q34 


132 


2 


133 


2 


134 


2 


135 


13ql2-ql4 


136 


13ql2-ql4 


137 


13ql2-ql4 


138 


13ql2-ql4 


139 


13ql2-ql4 


140 


17 


141 


6p21.3 


142 


13ql2-ql4 


143 


13ql2~ql4 


144 


13ql2-ql4 


145 


13ql2-ql4 


146 


13ql2-ql4 


147 


13ql2-ql4 


148 


13ql2-ql4 


149 


13ql2-ql4 


150 


13ql2-ql4 


151 


13ql2-ql4 


152 


13ql2-ql4 


153 


13ql2-ql4 


154 


13ql2-ql4 


155 


13ql2-ql4 


156 


2 


157 


2 


158 


19ql3.3-ql3.4 


159 


19ql3.3-ql3.4 


160 


19ql3.3-ql3.4 


161 


19ql3.3-ql3.4 


162 


4 


163 


4 


164 


9q33-q34 


165 


13ql2-ql4 
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1 1 

1. 


20 


SEQ ID 


Chromsomal location 


166 


13ql2-ql4 


167 


13ql2-ql4 


168 


13ql2-ql4 


169 


22ql2. 1-12.3 


170 


13ql2-ql4 


171 


13ql2-ql4 


172 


13ql2-ql4 


173 


13ql2-ql4 


174 


13ql2-ql4 


175 


13ql2-ql4 


176 


8q 


177 


20ql2 


178 


20ql2 


179 


4 


180 


4 


181 


4 


182 


9q33-q34 


183 


9q33-q34 


184 


9q33-q34 


185 


3 


186 


9q33-q34 


187 


9q33-q34 


188 


9q33-q34 


189 


9q33-q34 


190 


9q33-q34 


191 


9q33-q34 


192 


9q33-q34 


193 


2 


194 


2 


195 


13ql2-ql4 


196 


13ql2-ql4 


197 


13ql2-ql4 


198 


13ql2-ql4 


199 


13ql2-ql4 


200 


13ql2-ql4 


201 


13ql2-ql4 


202 


13ql2-ql4 


203 


13ql2-ql4 


204 


13ql2-ql4 


205 


13ql2-ql4 


206 


13ql2-ql4 


207 


13ql2-ql4 


208 


13ql2-ql4 


209 


13ql2-ql4 


210 


13ql2-ql4 


211 


13ql2-ql4 


212 


13ql2-ql4 


213 


13ql2-ql4 


214 


19ql3 


215 


13ql2-ql4 


216 


13ql2-ql4 


217 


13ql2-ql4 


218 


13ql2-ql4 


219 


13ql2-ql4 


220 


13ql2-ql4 
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1 


21 


SEQ ID 


Chromsomal location 


221 


13ql2-ql4 


222 


13ql2-ql4 


223 


13ql2-ql4 


224 


13ql2-ql4 


225 


13ql2-ql4 


226 


13ql2-ql4 


227 


13ql2-ql4 


228 


13ql2«ql4 


229 


13ql2-ql4 


230 


13ql2-ql4 


231 


13ql2-ql4 


232 


13ql2-ql4 


233 


13ql2-ql4 


234 


13ql2-ql4 


235 


13ql2-ql4 


236 


13ql2-ql4 


237 


13ql2-ql4 


238 


5 | 


239 


5 


240 


19ql3.3-ql3.4 


241 


19ql3.3-ql3.4 


242 


19ql3.3-ql3.4 


243 


19ql3.3-ql3.4 


244 


19ql3.3-ql3.4 


245 


19ql3.3-ql3.4 


246 


19ql3.3-ql3.4 


247 


19ql3.3-ql3.4 


248 


19ql3.3-ql3.4 


250 


6 


256 


lp32 


259 


Ilql2-ql3.1 


268 


18pll.22-pll.21 


269 


7q 11.23 


270 


3 


272 


9q34.3 


278 


9q32-q33 


279 


14qlL2 


280 


14qll.2 


283 


2 


285 


13 


298 


20ql2-ql3.1 


303 


1 


305 


19 


306 


19 | 


310 


4pl6-pl5 


311 


6 


316 


3p22-p21.3 


326 


Ip31.1-p22.3 


335 


llpter-pl5.5 


336 


3 


350 


llq 


353 


3 


355 


8p21-pl2 


356 


Iq23-q25.1 


357 


Iq23-q25.1 
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11 


22 


SEQ ID 


Chromsomal location 


359 


20ql3.2-ql3.33 


360 


13 


361 


13 


362 


22ql3.1 


369 


6 


373 


2p24.3-p24.1 


374 


2p24.3-p24.1 


383 


llql3 


384 


19ql3.2 


388 


17qll-q21.1 


398 


8p23.2-p23.1 


403 


19 


407 


11 


421 


19ql3.1 


427 


20ql2-ql3.12 


428 


17 


429 


17 


431 


llql3 


432 


7q35 


433 


X 


435 


6q22.2-22.33 


437 


19 


439 


16q24.3 


445 


19ql3 


446 


12 


452 


14 


453 


3 


456 


14 


457 


10 


458 


10 


459 


3p2Ll-ql3.13 


460 


lp36 


470 


14q31 


475 


4qll-ql3 


476 


6 


477 


15q21-q22 


478 


15q24-q26 


493 


19 


496 


6q24. 1-24.3 


497 


15 


501 


llql3 


505 


lq21 


506 


6p21.3 


507 


3 


510 


lq21.3 


512 


14 


513 


9 


515 


1 


521 


7q32-34 


522 


7q32-34 


532 


2 


545 


q21.2-21.3 


547 


11 


551 


llq22 


556 


10, 
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1] 


.23 


SEQ ID 


Chromsomal location 


557 


9p24. 1-24.3, 


559 


16 


560 


18 


561 


9 


562 


22qll.22-12.3. 


565 


6q22.3-23.3 


566 


6q22. 1-22.33, 


567 


1 


568 


11 


569 


6q21 


571 


18, 


572 


15, 


573 


5 


574 


5 


575 


6p21. 1-21.2. 


576 


8, 


577 


10 


578 


1 


579 


12 


580 


1 


581 


6ql6.3-22.1, 


582 


3 


583 


10 


585 


3 


587 


1 


588 


5 


590 


4 


592 


lq24. 


593 


1 


594 


8 


595 


17, 


596 


11 


597 


5 


598 


16 


599 


11 


600 


13, 


601 


llq 


603 


2 


604 


6q25.2-26, 


606 


1 


607 


15 


608 


9 


609 


lq24 


610 


1 


611 


4 


612 


17, 


613 


8 


614 


17, 


615 


15 


616 


6, 


617 


15 


618 


2 


619 


3 


620 


17 


621 


8 



WO 03/054152 



PCT/US02/39555 



Table 7 



1 1 

1] 


.24 


SEQ ID 


Chromsomal location 


622 


14 


623 


6q24. 1-25.2, 


624 


2 


625 


9 


626 


8 


627 


12q24 


629 


20, 


630 


NA 


631 


2 


632 


2 


633 


Xpll.21-11.22 


634 


17 


636 


2 


637 


22 


638 


1 


639 


3 


640 


19, 


641 


1 


642 


7 


643 


5 


645 


17 


647 


1, 


649 


6 


650 


7 


651 


5 


652 


19, 


653 


20qll. 1-11.23, 


654 


1 


655 


3 


656 


8 


657 


16 


659 


9 


660 


18 


662 


6p21.3, 


665 


X 


666 


5 


667 


1 


669 


2 


670 


13, 


671 


1 


672 


15 


673 


9, 


674 


10 


675 


9, 


676 


7 


678 


16 


679 


X 


680 


11, 


681 


19, 


683 


8 


684 


4 


685 


6ql6.1-16.3, 


686 


10, 


687 


14 


688 


4 
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1 1 

1 


25 


SEQ ID 


Chromsomal location 


689 


6q26-27, 


690 


lp35.1-36.13, 


692 


6q26-27 s 


695 


12 


700 


9, 


701 


5 


702 


2 


703 


8 


704 


10, 


705 


19 


706 


14 


708 


19, 


709 


17 


711 


16 


712 


16 


713 


5 


714 


16 


715 


19 


716 


16 


717 


12 


718 


15 


719 


15 


720 


14 


721 


16 


722 


5 


723 


6, 


724 


12q 


725 


1 


726 


8 


727 


15, 


728 


2 


729 


7 


730 


17, 


731 


21q22.1, 


732 


20pl2 


733 


11 


734 


1 


735 


15 


736 


2 


737 


lp36.21-36.33. 


738 


1 


739 


6q21-22.31, 


740 


15 


742 


12 


743 


UL 


744 


13q33.3-34, 


745 


17, 


746 


12 


747 


2 


749 


16 


752 


17 


754 


16 


755 


20ql2-13.12. 


756 


19 


757 


14 
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11 


26 


SEQ ID 


Chromsomal location 


758 


8, 


759 


6, 


760 


22 


761 


19 


763 


13, 


764 


lq32.1-41 


765 


8, 


767 


20 


769 


llq, 


770 


1 


771 


6p22.3-24.1, 


772 


9p23-24.3, 


773 


17 


774 


19 


775 


3 


776 


3 


777 


22ql2 


779 


lp36.13-36.31, 


780 


17 


781 


7 


783 


4 


784 


6, 


785 


1 


786 


21q22.1, 


787 


6 1 


788 


1 1 


789 


15 


790 


20, 


791 


9 


793 


10, 


794 


16 


796 


18 


797 


18 


798 


10 


799 


8, 


800 


16 


801 


UNK 


803 


9, 


807 


20ql3.1. 


808 


10, 


809 


3 


810 


UNK 


815 


20. 


818 


Iq25.1-31.1, 


820 


10 


821 


20pl2.2-13, 


822 


20 


823 


4, 


824 


1 


826 


5 


827 


8 


829 


7 


830 


lp21. 3-22.3, I 


832 


1 


833 


12q | 
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11 


27 


SEQ ID 


Chromsomal location 


834 


6 


836 


6, 


837 


10 


838 


9pl 1.2-21, 


839 


11 


840 


5 


841 


17 


842 


llq 


844 


1 


845 


llq, 


846 


9 


848 


13, 


849 


12 


852 


1 


853 


16 


854 


15 | 


856 


6q25.3-26 


857 


6, 


858 


1 


859 


11 


860 


5p, 


861 


11 


862 


6, 


864 


Iq22-q23, 


865 


12p 


866 


lq32.2-41 


867 


2 


868 


13q33.1-34, 


869 


6p21.3, 


870 


3 


872 


1 


873 


3 


874 


19, 


876 


9, 


877 


16pl3 


878 


5 


879 


16 


880 


5 


881 


16 


882 


15 


883 


15 


884 


9 


885 


8pll.2, 


886 


17 


887 


5 


888 


6, 


889 


13, 


890 


11 


891 


11 


892 


Xq22.3-23, 


894 


9, 


895 


10 


896 


15 


897 


11 


898 


h 
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1] 


28 


SEQ ID 


Chromsomal location 


899 


6q25.2-26, 


900 


12 


901 


1 


902 


19 


903 


19, 


904 


4 


905 


5 


906 


5 


908 


19 


909 


4, 


910 


19, 


911 


15 
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1129 



SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1037 


1 


100-115:1952 


1038 


1 


732-749:2593 


1039 


1 


181-201:2410 


1040 


2 


53-68:1828 132-149:2533 


1041 


2 


53-69:2959 121-140:2878 


1042 


1 


536-560:2906 


1043 


1 


63-82:2545 


1044 


5 


86-102:1766 189-205:2721 229-244:1878 273-300:1714 
385-405:1946 


1045 


1 


629-645:2430 


1046 


1 


59-75:2149 


1047 


1 


306-332:2773 


1048 


1 


118-136:2329 


1049 


2 


98-113:2861 220-243:2391 


1050 


1 


151-169:2618 


1051 


2 


94-110:2524 124-146:2138 


1052 


1 


73-87:2180 


1053 


1 


206-226:2584 


1054 


1 


402-419:2096 


1055 


1 


343-361:1953 


1056 


1 


132-154:2199 


1057 


1 


590-613:2402 


1058 


2 


89-105:1748 155-173:2433 


1059 


1 


201-222:2190 


1060 


4 


254-277:2256 317-332:1771 442-460:2005 530-544:2110 


1061 


2 


169-186:1866 239-259:2042 


1062 


1 


63-77:1794 


1063 


1 


227-248:3456 


1064 


1 


133-148:2558 


1065 


3 


435-453:1849 505-526:2495 697-712:2057 


1066 


1 


317-340:2214 


1067 


1 


173-192:2637 


1068 


1 


63-79:1933 


1069 


1 


124-146:3384 


1070 


3 


82-102:2213 115-135:1769 160-185:2317 


1071 


1 


312-329:2354 


1072 


2 


116-131:3056 188-209:2254 


1073 


6 


48-71:1708 174-196:2300 237-254:1918 359-378:1887 
413-435:1864 501-518:2625 


1074 


6 


136-159:1708 262-284:2300 325-342:1918 447-466:1887 
501-523:1864 589-606:2625 


1075 


1 


352-376:2946 


1076 


3 


106-125:2854 226-241:1973 277-300:2759 


1077 


4 


85-105:2047 208-225:1907 309-330:2122 454-471:2461 


1079 


1 


60-75:2189 


1081 


3 


192-214:1705 236-259:1933 436-453:2349 


1082 


1 


459-477:1896 


1083 


1 


144-159:3028 


1084 


1 


144-159:3028 


1085 


1 


436-455:2525 


1086 


1 


705-724:2525 


1093 


2 


93-108:2014 249-264:2324 


1094 


5 


145-165:2633 316-331:2180 399-412:1770 481-496:2328 
541-560:2589 



WO 03/054152 PCT/US02/39555 



Table 8 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position m £>i^i<i in Lyyji ana its iivi jrrea score 


1095 


1 


*7i m.iocl 

73-92:1951 


1097 


1 


85-103:2195 


1099 


1 


1 /T A 1T/C.1AQC. 

ioO-1 /o:2Uoj 


1100 


1 


256-272:2085 


1101 


1 


210-226:20o5 


1102 


1 


63-94:3259 


1103 


2 


184-201:2183 245-262.1812 


1104 


1 


276-295:3080 


1106 


1 


1 Ti"\ 1 A/I 1 /''I /A 

179-194:2620 


1110 


1 


111-129:2519 


1112 


1 


169-190:2680 


1113 


3 


f i OO 11/11 1 O /I . 1 *71 C 1 10 1 1 fl.T7^C 

61-82:2141 99-134:1715 119-139:2765 


1114 


1 


65-85:1713 


1115 


3 


123-13/:2o44 190-218:20 /4 :5UU-d14.Zooo 


1116 


2 


98-123:2354 2 /0-295 :214o 


1117 


1 


T~l ni . 1 "7A1 

//-yz:i/yi j 


1118 


1 


68-88:2672 


1119 


1 


i ci /i i C/n.omn 

1524-1547:2939 


1121 


1 


AC 1 11.1ACO 

95-113:2958 


1123 


2 


ai i at.1 An 1 no.T7/TA 

92-107:1923 162-178:2760 


1124 


1 


ni r\/i .. 1 ooc 

71-94:1835 


1125 


1 


nA /iAi.1111 

379-403:3221 


1127 


1 


152-182:1795 


1129 


3 


in o 1 *"7 i,li<-7 110 TO.I 1/^ 1 /H A A ^/T.TCOn 

201-217:2437 338-353:1761 449-466:2589 


1130 


2 


99-114:1754 108-130:2731 


1131 


1 


1 177-1193:3038 


1132 


1 


/-^ /- 111 
95-111:2301 


1133 


2 


205-227:1715 307-322:1735 


1134 


1 


308-330:2431 


1136 


5 


92-107:1734 298-311:2063 363-378:1720 382-399:1988 






453-471:2040 


1137 


2 


Cf i~l C 1 A O 1 1 1^7 1 /lO.II^A 

56-75:2481 127-148:2269 


1138 


1 


no i .c 1 1 on 

228-251:1822 


1139 


4 


c\i~< i ■< r -t r\/\l 1 T7 1 r\ A 1 rt 1 A OOA AACOA/T'2 AOO 1 AAO.OAT7 

97-115:1903 177-194:1919 889-905:2063 98o-10Uo:2U2/ 


1141 


1 


223-242:2971 


1143 


3 


O^.H^A ITT 1 A/1.1070 T7 1 1O0,A10(C 

63-86:2169 177-194:1878 271-2oo:21oo 


1144 


2 


1 1 / lO/C.nAA 1*7A 1 Ayl.lClA 

116-136:2390 179-194:2530 


1146 


2 


s~ s~ oi 1TA1 1 1 A 1 1/C. 1 7f C 

66-82:2701 110-126:1/55 


1147 


2 


T/i ia/t.icoa 1 ia Kz:,i nro 

74-106:2580 139-156:1958 


1148 


1 


C11 C A A ,1/ZA A 

522-544:2644 


1149 


2 


OI AT .1 A1 /I 1AA 1 1 r,lT7r 

83-97:2024 200-216:2275 


1150 


1 


1AA 11<C.1T7C. 

200-21 0:22/ J _j 


1151 


1 


r»i 1 An.irOO 

92-109:2588 


1152 


2 


1 /if 1 ^"A- 100/I 1 1 n 0/)^.i 0A1 

145-169:1834 317-346:1891 


1155 


4 


s~ a 01 rtAjn 110 ni i a c f\ ^ nA. 1 on A or CCO, 1 AAA 

64-83:2948 218-232:2016 452-480:1829 535-553:1999 


1156 


1 


11 "1 11A.1C1/1 

311-330:2524 


1 1 C7 
1 ID / 


i 
i 


/o-i i i.zjy / 


1159 


1 


163-180:2270 


1330 


1 


264-282:2674 


1331 


1 


105-128:2620 


1332 


1 


141-164:3091 


1333 


1 


394-409:1914 


1334 


1 


390-410:1947 


1335 


1 


67-82:1917 



WO 03/054152 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: ana its 1M rrea bcore 


1336 


1 


301-317:3356 


1337 


1 


301-317:3356 


1338 


1 


427-443:1894 


1339 


2 


103-119:2524 133-155:2138 


1340 


2 


94-110:2524 124-146:2138 


1341 


2 


61-78:1768 212-233:2126 


1342 


1 


139-160:2023 


1343 


1 


331-353:1902 


1344 


1 


99-117:2054 


1345 


1 


52-67:1997 


1346 


1 


67-80:2411 


1347 


1 


367-384:2888 


1348 


1 


457-476:2898 


1349 


2 


202-217:1923 272-288:2760 


1350 


5 ! 


266-284:2563 328-346:2525 390-408:2516 452-470:2456 






514-532:2395 


1351 


1 


94-116:2763 


1352 




45-62:2164 71-85:1838 


1353 


1 


482-504:3155 


1354 




1-17:2284 75-90:1824 145-160:2079 


1355 


1 


12-28:1796 


1356 


1 


332-356:3221 


1357 


1 


9-30:2498 


1358 




202-215:2016 256-281:1796 283-312:1900 


1359 


1 


267-284:3050 


1360 




229-245:2437 316-333:1885 429-446:2589 J 


1361 


1 


1-17:2914 


1362 


1 


4-21:2014 


1363 


1 


962-980:1762 


1364 


1 


12-31:2460 


1365 




38-54:3201 


1366 


1 


103-119:2300 


1367 


1 


1-15:2225 


1368 


1 


48-64:1711 


1369 


1 


48-64:1711 


1370 


1 


21-36:2592 


1371 




290-307:2420 419-433:2394 


1372 


1 


66-81:2345 


1373 


1 


3-18:1914 


1374 


1 


110-124:2032 


1375 


1 


3-18:1862 _^ 


1376 




37-56:2540 102-127:1708 194-209:2784 434-453:2036 


1377 


1 


1-20:2264 


1378 




259-274:2102 330-358:2448 416-434:2702 


1379 


1 


24-43:2348 


1380 


1 


196-211:1704 


1381 




iDy-l /o: looj zJo-zou.zoyu jvd-d ly.zDto 


1382 




262-288:2789 


1383 




60-89:2283 119-140:1980 


1384 




21-41:2833 


1385 




227-248:3456 


1386 




3-19:2081 


1387 




162-180:3181 


1388 




12-27:2098 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1389 


1 


487-503:2039 


1390 


2 


95-111:2234 155-173:2180 


1391 


2 


87-106:2547 186-206:1903 


1392 


3 


142-162:2149 169-184:1733 265-283:2036 


1393 


3 


83-110:1977 142-159:1729 238-254:2361 


1394 


1 


12-30:2458 


1395 


1 


61-76:2746 


1396 


1 ! 


14-31:1963 


1397 


1 


1060-1079:2742 


1398 


1 


220-244:2016 


1399 


1 


17-36:2664 


1400 


1 


21-41:1851 


1401 


1 


37-69:2282 


1402 


1 


24-60:1926 


1403 


1 


55-75:2438 


1404 


1 


237-252:2493 


1405 


1 


1-20:1866 


1406 


1 


172-190:1781 


1407 


1 


90-111:2881 


1408 


1 


6-23:2294 


1409 


1 


477-499:2659 


1410 


1 


231-249:2797 


1411 




64-84:1993 95-110:1932 149-167:2173 


1412 




166-182:1937 216-239:1819 326-341:1779 


1413 


1 


154-169:2224 


1414 


1 


241-254:1760 


1415 


2 


71-97:1818 234-254:2475 


1416 


1 


143-164:2551 


1417 


2 


104-122:1815 147-162:2031 


1418 


2 


121-135:2273 209-234:2186 


1419 


2 


8-27:2032 82-102:1765 


1420 


1 


26-47:2557 


1421 


3 


18-34:1798 220-243:1742 301-318:1915 


1422 


1 


27-42:3053 


1423 


1 


6-21:1974 


1424 


1 


345-360:2050 


1425 


1 


44-65:2195 


1426 


1 


763-779:1916 


1427 


1 


448-469:2686 


1428 


1 


597-614:2205 


1429 


1 


194-210:2157 


1430 


2 


22-44:2983 52-67:1999 


1431 


2 


22-39:1830 91-106:2145 


1432 


1 


6-26:2422 


1433 


1 


6-26:2422 


1434 


1 


72-89:1942 


1435 


2 


128-146:1770 337-352:3007 


1436 


1 


88-105:1803 


1437 


1 


1013-1035:2346 


1438 


4 


574-589:1938 650-668:2275 720-735:1944 793-810:1729 


1439 


4 


655-670:1938 731-749:2275 801-816:1944 874-891:1729 


1440 


1 


65-81:2339 


1441 


1 


131-151:1772 


1442 


2 


5-22:1872 248-273:2467 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1443 


3 


179-194:23 18 387-403:3014 442-465: 1 871 


1444 


2 


58-74:2684 238-264:2576 


1720 


1 


89-107:1832 


1721 


1 


225-240:2165 


1722 




13-29:2422 49-63:2158 


1723 


1 


129-144:1704 


1724 


1 


12-28:1900 


1725 




1-22:1840 143-158:1938 181-198:2860 247-265:2284 


1726 


1 


1-15:1790 


1727 




100-122:2447 199-217:2147 


1728 


1 


12-28:1802 


1729 


1 


21-37:1706 


1730 


1 


328-343:2384 


1731 




33-60 :2908 1 85-2 1 1 :2596 237-25 8: 1818 


1732 


1 


279-309:1795 


1733 


1 


6-26:2040 


1734 


1 


14-34:1939 


1735 


1 


17-38:1933 


1736 




10-25:2484 95-110:1787 


1737 




5-21:1862 109-125:2714 


1738 


1 


1-21:1881 


1739 


1 


393-414:2402 


1740 


1 


3-21:2189 


1741 




15-33:3158 109-123:2786 364-380:2232 


1742 


1 


564-580:1725 


1743 




40-60:2740 169-189:3006 


1744 


1 


214-228:2582 


1745 


1 


3-19:2099 


1746 




4-22:2753 247-263:2558 


1747 


1 


43-60:2160 


1748 


1 


195-217:2199 


1749 


4 


304-319:1707 427-447:1983 547-562:2389 657-683:2065 


1750 


5 


2-17:2136 753-768:2335 810-831:1715 906-928:1710 






1009-1031:2357 


1751 


1 


337-352:2864 


1752 


1 


1-24:3347 


1753 


1 


621-644:2150 


1754 


1 


88-106:2133 i 


1755 




67-89:2584 130-148:1837 239-261:1922 334-349:2621 


1756 


1 


20-35:1702 


1757 


1 


471-488:1852 


1758 




8-25:2374 247-265:1721 


1759 


1 


10-26:2268 


1760 




37-55:2350 79-102:1867 


1761 


1 


3-16:1751 


1762 




10-29:2708 158-183:2332 413-430:1706 514-532:2104 


1763 




C1 1 f f 0.0001 

531-558:2381 


1764 




1-15:2095 


1765 




3-20:1765 


1766 




3-27:2376 


1767 




11-39:1868 


1768 




150-169:1896 


1769 




91-112:2655 


1770 




347-362:1792 
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SEQ ID 


Number of 


For each Transmembrane Domain, its Transmembrane 


NO: 


Transmembrane Domains 


Domain Position in SEQ ID NO: and its TM Pred Score 


1771 


1 


717-737:3080 


1772 


1 


9-24:1787 


1773 


2 


1-16:1977 85-110:2971 


1774 


3 


492-510:2266 517-532:1752 675-696:1808 


1775 


3 


16-33:2461 94-113:2485 221-247:2232 


1776 


1 


24-39:2149 


1777 


1 


108-123:2281 


1778 


2 


23-42:3401 179-201:3489 


1779 


1 


296-311:1881 


1780 


1 


280-303:2645 


1781 


1 


22-37:2170 


1782 


1 


313-332:1963 


1783 


7 


243-261:2338 288-311:1992 401-429:1705 502-519:1819 






556-568:1888 597-614:1780 675-690:2102 


1784 


1 


1433-1457:3108 


1785 


1 


390-403:1855 


1786 


2 


103-122:1737 738-756:2220 


1787 


2 


116-131:2359 173-193:1746 


1788 


1 


128-144:2443 


1789 


1 


73-97:1982 


1790 


1 


103-124:2547 


1791 


1 


53-69:2536 


1792 


3 


84-99:2680 1083-1098:1729 1147-1167:2267 


1793 


1 


59-74:2644 


1794 


2 


24-39:2073 69-92:1922 


1795 


1 


59-78:2490 


1796 


2 


H0l-ll2l:2566 1224-1244:2000 


1797 


3 


54-73:1789 113-131:1860 182-195:2408 


1798 


2 


76-92:1943 135-155:3247 


1799 


1 


141-159:3106 


1800 


1 


248-266:1883 


1801 


2 


413-430:1851 496-521:2037 


1802 


1 


782-811:1739 


1803 


3 


85-108:1832 117-138:1746 166-187:2659 


1804 


1 


183-200:1880 


1805 


1 


549-568:1822 


1806 


2 


121-142:2091 191-210:2580 


1807 


3 


52-67:2292 112-127:1756 215-232:2516 


1808 


1 


99-115:1824 


1809 


1 


24-41:1826 


1810 


3 


710-738:2388 769-792:1779 797-815:1896 


1811 


1 


71-93:2069 


1812 


1 


97-117:1891 


1813 


3 | 


12-27:2868 38-54:1801 59-89:3042 


1814 


1 


404-422:2347 


1815 


1 


1-19:2068 


1 O 1 ZT 

1816 


2 


62-84:2012 118-134:1970 


1817 


1 


68-83:2632 


1818 


2 


3-22:2491 47-61:1754 


1819 


1 


1187-1209:3678 


1820 


6 


4-29:1727 87-110:3204 95-129:1851 165-190:2822 214- 






230:1867 245-261:1911 


1821 


1 


324-342:2300 


1822 


1 


4-24:1911 
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Table 9 
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SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


1 


912 


1823 


2479 


784_2167 


2 


913 


1824 


2480 


790_12732 


3 


914 


1825 


2481 


787 7952 


4 


915 


1826 


2482 


787 7952 


5 


916 








6 


917 


1827 


2483 


787 5660 


7 


918 


1828 


2484 


790_6350 


8 


919 


1829 


2485 


790 18138 


9 


920 


1830 


2486 


784 2658 


10 


921 


1831 


2487 


788„11684 


11 


922 








12 


923 


1832 


2488 


789 1765 


13 


924 


1833 


2489 


784 6512 


14 


925 


1834 


2490 


789 4745 


15 


926 


1835 


2491 


790 14655 


16 


927 


1836 


2492 


791 2272 


17 


928 


1837 


2493 


791 1348 


18 


929 


1838 


2494 


790 23 


19 


930 


1839 


2495 


784 5039 


20 


931 


1840 


2496 


787 8518 


21 


932 


1841 


2497 


784 5125 


22 


933 








23 


934 








24 


935 


1842 


2498 


790 26893 


25 


936 


1843 


2499 


785 3384 


26 


937 


1844 


2500 


791 2238 


27 


938 


1845 


2501 


784 7881 


28 


939 








29 


940 








30 


941 


1846 


2502 


785_197 


31 


942 


1847 


2503 


785_836 


32 


943 


1848 


2504 


784_8990 


33 


944 


1849 


2505 


792 6041 


34 


945 


1850 


2506 


792 873 


35 


946 








36 


947 


1851 


2507 


784_4339 


37 


948 








38 


949 


1852 


2508 


792_5188 


39 


950 








40 


951 


1853 


2509 


784 6463 


41 


952 


1854 


2510 


784 2235 


42 


953 j 


1855 


2511 


784_2849 


43 


954 


1856 


2512 


785 206 


44 


955 


1857 


2513 


787 6368 


45 


956 


1858 


2514 


788 5838 


46 


957 


1859 


2515 


784 7012 


47 


958 








48 


959 


1860 


2516 


785 835 


49 


960 


1861 


2517 


784 7305 


50 


961 








51 


962 


1862 


2518 


784 8815 
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Table 9 
1136 



SEQ ID NO: of 

full-length 

nucleotide 

sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 

INO. ollvl^ 1U INU.) 


CO 

52 


A/TO 

963 


Io03 


OC 1 A 

25 iy 


OQA 71QC 

/o4 /iyj 


CO 

53 


964 


1864 


ocoo 
2520 


OOO /COOQ 

/O/ 0/Z5 


C A 

54 


A/CC 

yo5 








cc 


yoo 


loco 


I^O 1 

Z5Z 1 


000 c/;aq 
/o / Doyo 


C/C 

5o 


yo 1 


1 0££ 
loOO 


ZoZZ 


OA A IOCOO 

/yu izj / 0 


CO* 

57 


yoo 


loo/ 


ZDZ5 


/O^ 04oJ 


c o 
58 


O^O 

969 


looo 


Z5Z4 


/o^- loyo 


59 


AHA 

9/0 


iooy 


o<o< 
Z5Z5 


OAO 1 1 QA 

/yz 1 iy4 


/"A 

60 


AO 1 

971 


1 OOA 

1870 


oco^c 
Z520 


no a 01 co 

/o4 yij/ 


1 

61 


AOO 

9/2 








^:o 

62 


AOO 

973 


1 OO 1 

18/1 


ocoo 
Z5Z / 


OO/I 1/1AQ 

/o4 14Uo 


£"0 

63 


AO A 

974 


1 ooo 

1872 


ocoo 
Z5Zo 


no a ocoo 
/o4 ZDUo 


£Z A 

64 


AOC 

975 


1 ooo 

1873 


OCO A 

Z5zy 


nQA 10/IO 
/o4 104y 


^c 

65 


An/; 

976 








66 


977 








67 


AOO 

978 








/TO 

6c 


AOA 

979 








^a 

69 


AOA 

980 








70 


AO 1 

981 


1 OO/I 

18/4 


OCO A 

Z530 


000 

/of 01Z1 


O 1 

71 


noo 

982 


1 ooc 
18/5 


OCO 1 

2531 


OOC OOOO 

/o5 Zy\)5 


oo 

72 


AOO 

yo3 


1876 


ocoo 

Z53Z 


non 1A1AO 

/o / iuiy/ 


oo 

73 


AO A 

984 








74 


AOC 

985 


1 ooo 

187/ 


ocoo 
2533 


707 1 AOO A 

/o/ 1UZZ4 


75 


AO/T 

986 


1 ooo 

1878 


OCO A 

2534 


000 cone 
/oy_5o0o 


nc 
76 


AOO 

y© / 








oo 
/ / 


AO O 

yoo 


1 OOA 

lo /y 


Z555 


noA <o°7 
/o4 OZo / 


oo 
/o 


AOA 

y©y 


1 ooo 
looU 


2550 


ooc coo 
/oD DoZ 


OO 

/y 


AAA 

yyo 








OA 

80 


OA1 

yy i 








0 1 


yyz 


iool 


ZDo / 


OA A 1 OQCO 

/yu izyjz 


oo 

oZ 


AOO 

yy3 








oo 
85 


A A A 

yy4 


1 QQO 

looZ 


OCO Q 

Z55o 


OAA 1 OQQO 

/yu 1000/ 


O A 

84 


AAC 

995 


1 ooo 
loo3 


OC1G 

Z53y 


OC/1 7QO< 

/o4 /yzD 


85 


996 








oc 
86 


AAO 

997 








oo 

87 


AAO 

998 


loo4 


OC/tO 

2540 


OAO 1 1 Ol A 

/yu iiuio 


o o 
88 


AAA 

999 


looD 


OC/1 1 

2541 


000 1 OOOI 

/yu 1/5/1 


o a 

89 


1 AAA 
1000 








A A 

90 


1 AA 1 
1001 








a i 
91 


1 AAO 

1002 


1 oc< 
looo 


OC/IO 

Z54Z 


oo/i 1 c^ro 
/o4 1D0U 


92 


1 AAO 

1003 


1 ooo 
loo / 


OC/IO 

Z543 


ob/i 1 o/^o 
/o4 1Z00 


no 

93 


1 AA/I 

1004 


1 ceo 
looo 


OC A A 


OAA O 1 O O £Z 

790 21336 


A A 

94 


1 AAC 

1005 


1 ooo 

iooy 


OC /I c 

2545 


OO A OA 1 O 

784 7912 




1UUO 








96 


1007 


1890 


2546 


784 8528 


97 


1008 


1891 


2547 


784 8105 


98 


1009 


1892 


2548 


787 5314 


99 


1010 


1893 


2549 


784 7469 


100 


1011 


1894 


2550 


790 11802 


101 


1012 


1895 


2551 


787 5599 


102 


1013 


1896 


2552 


784 5534 
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Table 9 
1137 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No, SEQ ID NO.) * 


103 


1014 








104 


1015 


1897 


2553 


791 3043 


105 


1016 


1898 


2554 


784 8916 


106 


1017 


1899 


2555 


784 1271 


107 


1018 








108 


1019 








109 


1020 


1900 


2556 


784_6527 


110 


1021 








111 


1022 


1901 


2557 


790 14183 


112 


1023 


1902 


2558 


791 1570 


113 


1024 


1903 


2559 


787 2154 


114 


1025 


1904 


2560 


787 2849 


115 


1026 








116 


1027 








117 


1028 








118 


1029 


1905 


2561 


790 23824 


119 


1030 


1906 


2562 


787 10284 


120 


1031 


1907 


2563 


790 13062 


121 


1032 








122 


1033 








123 


1034 








124 


1035 


1908 


2564 


787_3576 


125 


1036 








126 


1037 








127 


1038 


1909 


2565 


789 2490 


128 


1039 


1910 


2566 


784 2340 


129 


1040 


1911 


2567 


784 3449 


130 


1041 


1912 


2568 


787 9834 


131 


1042 


1913 


2569 


787_7563 


132 


1043 








133 


1044 


1914 


2570 


784 969 


134 


1045 


1915 


2571 


784_9071 


135 


1046 


1916 


2572 


785 1259 


136 


1047 


1917 


2573 


790^19506 


137 


1048 


1918 


2574 


784_3978 


138 


1049 


1919 


2575 


784 3848 


139 


1050 


1920 


2576 


789 6174 


140 


1051 


1921 


2577 


785 1465 


141 


1052 


1922 


2578 


787 7763 


142 


1053 


1923 


2579 


790 25889 


143 


1054 








144 


1055 


1924 


2580 


784^4674 


145 


1056 


1925 


2581 


787 8960 


146 


1057 


1926 


2582 


787 2258 


147 


1058 


1927 


2583 


787 2290 


148 


1059 


1928 


2584 


784 8266 


149 


1060 


1929 | 


2585 


784 1397 


150 


1061 


1930 


2586 


784_8164 


151 


1062 


1931 


2587 


785 1003 


152 


1063 








153 


1064 
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Table 9 
1138 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


154 


1065 


1932 


2588 


784_2498 


155 


1066 


1933 


2589 


787 5189 


156 


1067 








157 


1068 


1934 


2590 


787 10359 


158 


1069 








159 


1070 








160 


1071 


1935 


2591 


790 17261 


161 


1072 


1936 


2592 


784 9629 


162 


1073 


1937 


2593 


784 3405 


163 


1074 


1938 


2594 


784 3405 


164 


1075 


1939 


2595 


790 4043 


165 


1076 


1940 


2596 


787 3900 


166 


1077 


1941 


2597 


787 9753 


167 


1078 


1942 


2598 


787 4766 


168 


1079 


1943 


2599 


789 3521 


169 


1080 


1944 


2600 


784 8097 


170 


1081 








171 


1082 


1945 


2601 


784 735 


172 


1083 


1946 


2602 


784 4418 


173 


1084 


1947 


2603 


784 4418 


174 


1085 








175 


1086 








176 


1087 


1948 


2604 


784 1006 


177 


1088 


1949 


2605 


784 1528 


178 


1089 








179 


1090 








180 


1091 


1950 


2606 


791 2053 


181 


1092 








182 


1093 


1951 


2607 


788 6860 


183 


1094 


1952 


2608 


785_1010 


184 


1095 


1953 


2609 


784 6854 


185 


1096 


1954 


2610 


784 8679 


186 [ 


1097 


1955 


2611 


790 19249 


187 


1098 


1956 


2612 


78455 66 


188 


1099 


1957 


2613 


790 3027 


189 


1100 


1958 


2614 


790 3027 


190 


1101 


1959 


2615 


790 3027 


191 


1102 








192 


1103 


1960 


2616 


791 2652 


193 


1104 


1961 


2617 


784 7116 


194 


1105 


1962 


2618 


789 1658 


195 


1106 


1963 


2619 


787 7673 


196 


1107 


1964 


2620 


790_3240 


197 


1108 








198 


1109 


1965 


2621 


784 6361 


199 


1110 


1966 


2622 


784 297 


200 


1111 


1967 


2623 


790 13930 


201 


1112 








202 


1113 


1968 


2624 


784_8933 


203 


1114 








204 


1115 
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Table 9 
1139 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No.JSEQ ID NO.) * 


205 


1116 








206 


1117 








207 


1118 


1969 


2625 


784_7603 


208 


11 19 


1970 


2626 


HOT C/1CO 

787 5453 


209 


1120 


1971 


2627 


789 637/ 


210 


1121 


1972 


2628 


<-7 0 A 1 A Ail 

784_7446 


211 


1122 


1973 


2629 


784 5422 


212 


1123 


1 AT /I 

1974 


2630 


TO A KO.A 

/o4 Zdd\j 


213 


1124 


1975 


2631 


HOI TTCT 

787 7257 


214 


1125 


1976 


2632 


'-JO A ZAT1 

784 54/3 


215 


1126 


1977 


1 /TO o 

2633 


TO A 1 T^l 

784 1793 


216 


1127 


1978 


2634 


no A 1 AA1T 

784 10027 


217 


1128 








218 


1129 


1979 


2635 


nnn TOAO 

787 2898 


219 


1130 


1980 


2636 


788 9208 


220 


1131 


1981 


2637 


787 2305 


221 


1132 


1982 


2638 


784 8556 


222 


1133 








223 


1134 


1983 


2639 


TOT C T f s~ 

787 5766 


224 


1135 


1984 


2640 


i-jr\f\ A CO 1 

79045 31 


225 


1136 


1985 


2641 


784_6708 


226 


1137 t 


1986 


2642 


TAA 1 AO 1 £1 

790 19316 


227 


1138 


1987 


2643 


TO A 1 TO A 

784 1784 


228 


1139 


1988 


2644 


TO A 1 £1 AO 

784 1698 


229 


1140 


1989 


2645 


TOT n/TO 

787 1368 


230 


1141 


1990 


2646 


TO A c i ni 

789 6192 


231 


1142 


1991 


2647 


TO A A A C\ O 

784 4498 


232 


1143 








233 


1144 


1992 


2648 


789 6042 


234 


1145 








235 


1146 


1 aai 

1993 


2649 


TA A A A £1 1 

790 4461 


236 


1147 








237 


1148 


1 A A A 

1994 


2650 


TO/1 1/T7C 

784 2675 


238 


1149 


1995 


2651 


TO A ,1 cm 

789 4591 


239 


1150 


1996 


2652 


TO A ,1 cm 

789 4591 


240 


1151 


1997 


2653 


TA A lliyH: 

790 13145 


241 


1152 








242 


1153 


1998 


2654 


TA 1 ,imo 

791 4932 


243 


1154 








244 


1155 


1999 


2655 


TO >l 1 A 1 jl 1 

784 10141 


245 


1156 


2000 


2656 


784 10225 


246 


1157 


2001 


2657 ! 


784 7722 


247 


1158 








z4o 


1 1 C A 

1159 








249 


1160 


2002 


2658 


787 3228 


250 


1161 


2003 


2659 


785 3642 


251 


1162 


2004 


2660 


784 6469 


252 


1163 


2005 


2661 


787 6106 


253 


1164 


2006 


2662 


791 2793 


254 


1165 








255 


1166 


2007 


2663 


784 3318 
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Table 9 
1140 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


256 


1167 








257 


1168 


2008 


2664 


789 6270 


258 


1169 


2009 


2665 


788 13058 


259 


1170 


2010 


2666 


787 5541 


260 


1171 








261 


1172 


2011 


2667 


784 7891 


262 


1173 


2012 


2668 


784 4405 


263 


1174 


2013 


2669 


784 7585 


264 


1175 


2014 


2670 


784 7585 


265 


1176 


2015 


2671 


785 3390 


266 


1177 


2016 


2672 


789 6254 


267 


1178 


2017 


2673 


785 3073 


268 


1179 


2018 


2674 


790 16443 


269 


1180 








270 


1181 


2019 


2675 


790 21097 


271 


1182 


2020 


2676 


787 8948 


272 


1183 


2021 


2677 


784 8624 


273 


1184 








274 


1185 


2022 


2678 


785 2309 


275 


1186 








276 


1187 


2023 


2679 


784 1834 


277 


1188 


2024 


2680 


787 8921 t 


278 


1189 


2025 


2681 


784 7125 


279 


1190 


2026 


2682 


785 2903 


280 


1191 


2027 


2683 


785 2903 


281 


1192 








282 


1193 








283 


1194 


2028 


2684 


791 2389 


284 


1195 








285 


1196 


2029 


2685 


784_7647 


286 


1197 


2030 


2686 


784 3528 


287 


1198 


2031 


2687 


784 5750 


288 


1199 








289 


1200 


2032 


2688 


789 3496 


290 


1201 








291 


1202 








292 


1203 


2033 


2689 


790 16699 


293 


1204 


2034 


2690 


785 3735 


294 


1205 


2035 


2691 


788 12224 


295 


1206 


2036 


2692 


787 6084 


296 


1207 








297 


1208 


2037 


2693 


784 330 


298 


1209 


2038 


2694 


791 2972 


299 


1210 


2039 


2695 


/y\J 24UUZ 


300 


1211 








301 


1212 


2040 


2696 


784 528 


302 


1213 


2041 


2697 


787 1611 


303 


1214 


2042 


2698 


785 1187 


304 


1215 


2043 


2699 


784 8087 


305 


1216 


2044 


2700 


789 4427 


306 


1217 


2045 


2701 


789 4427 
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Table 9 
1141 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 

TVT*> OTP*"* TTfc XTf^ \ * 

JMO. aiiiQ 111 1NU.) 


307 


1218 








308 


1219 


2046 


2702 


TOC O/CQI 

/o0 zoyi 


309 


1220 


2047 


2703 | 


TOC 1 A/1 A 


310 


1221 


2048 


2704 


TO A C1 T/C 

/o4 J> 1 /D 


311 


1222 


2049 


2705 


TO A 1 A C 1 

7o4 14M 


312 


1223 


2050 


2706 


TOO CT3£ 

/o9 3Z3o 


313 


1224 


2051 


2707 


TOT C 1 TC 

787 5175 


314 


1225 


2052 


2708 


TO A Olfll 

7o4 o3Ul 


315 


1226 


2053 


2709 


TOC T/TAO 


316 


1227 


2054 


2710 


TOC OQ1C 


317 


1228 









318 


1229 








319 


1230 


2055 


271 1 


TO/I 1A1A< 

/o4 lUiUD 


320 


1231 


2056 


2712 


no a a c\n a 
/o4 4y/4 


321 


1232 


2057 


2713 


TOC TAOC 

785 3UZD 


322 


1233 


2058 


2714 


toc into 
7o5 3UZo 


323 


1234 


2059 


T7 1 C 

2715 


TOC 1A11 
/QD 3U31 


324 


1235 


2060 


2716 


TOC T 1 C\C 


325 


1236 


2061 


2717 


TOC OAOO 

/oO 3U3Z 


326 


1237 


2062 


1T1 O 

271s 


TOO T/t^C/C 

/yz /4oo 


327 


1238 


2063 


2719 


TO C 11 AT 


328 


1239 


2064 


2720 


TOC 1AQ.C 


329 


1240 


2065 


2721 


TOC 1AOT 

/oj 3U3 / 


330 


1241 


2066 


2722 


TOC QAQO 

/5D 3U3o 


331 


1242 


2067 


2723 


TOC 1 AO O 

7o5 3U3o 


332 


1243 


2068 


2724 


TOC T 1 AO 


333 


1244 


2069 


2725 


TOC 1 A/1 O 

785 3043 


334 


1245 


2070 


2726 


TOC 1A/1C 

785 3045 


335 


1246 


2071 


2727 


TOC 1 1 1 /I 

785 Z114 


336 


1247 


2072 


2728 


TOC /lOI 

7o5 4o3 


337 


1248 


2073 


2729 


toc <no 
/oD ouy 


338 


1249 


2074 


2730 


TOC 1 CA1 


339 


1250 


2075 


2731 


TO C T1 CT 

785 215/ 


340 


1251 


2076 


2732 


toc ai r\tz 


341 


1252 


2077 


2733 


TOC QCAO 


342 


1253 








343 


1254 


2078 


2734 


i—iOC 111/1 

785 3114 


344 


1255 


2079 


2735 


TOC 11 K 

785 3115 


345 


1256 


2080 


2736 


TOC Ol^l 

785 2161 


346 


1257 








347 


1258 


2081 


2737 


TOC 1 1T3 

7o5 31Z3 


348 


1259 


2082 


2738 


TOC O-C A 

785 o54 


349 


1260 


2083 


2739 


TO C 1 C A/1 

7o5 1D04 


1 C A 

350 


1 O/SI 
IZOl 


ZUo4 


Z /4U 


700 I^S^ 
jjoj 


351 


1262 








352 


1263 


2085 


2741 


784 1062 


353 


1264 


2086 


2742 


784 9616 


354 


1265 








355 


1266 


2087 


2743 


788 604 


356 


1267 


2088 


2744 


784 9474 


357 


1268 


2089 


2745 


784 9474 
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Table 9 
1142 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


358 


1269 








359 


1270 








360 


1271 


2090 


2746 


788 6828 


361 


1272 


2091 


2747 


788 6828 


362 


1273 


2092 


2748 


784 8575 


363 


1274 


2093 


2749 


790 21803 


364 


1275 


2094 


2750 


784 5329 


365 


1276 


2095 


2751 


784 10122 


366 


1277 


2096 


2752 


784_3644 


367 


1278 








368 


1279 


2097 


2753 


787 2863 


369 


1280 


2098 


2754 


784 854 


370 


1281 








371 


1282 








372 


1283 


2099 


2755 


787 3968 


373 


1284 


2100 


2756 


785_2864 


374 


1285 


2101 


2757 


785 2864 


375 


1286 








376 


1287 


2102 


2758 


784_4739 


377 


1288 








378 


1289 


2103 


2759 


791 771 


379 


1290 


2104 


2760 


792_4497 


380 


1291 


2105 


2761 


784 4833 


381 


1292 


2106 


2762 


785 3195 


382 


1293 


2107 


2763 


784 6243 


383 


1294 


2108 


2764 


784 1134 


384 


1295 


2109 


2765 


784_10176 


385 


1296 


2110 


2766 


784 3781 


386 


1297 


2111 


2767 


785 2441 


387 


1298 


2112 


2768 


787 4817 


388 


1299 


2113 


2769 


784 3255 


389 


1300 








390 


1301 


2114 


2770 


784^_9436 


391 


1302 


2115 


2771 


784 5232 


392 


1303 


2116 


2772 


784 4604 


393 


1304 








394 


1305 


2117 


2773 


784 7986 


395 


1306 


2118 


2774 


787_10365 


396 


1307 


2119 


2775 


784 4897 


397 


1308 








398 


1309 


2120 


2776 


787 5541 


399 


1310 


2121 


2777 


787 7655 


400 


1311 


2122 


2778 


784 1002 


401 


1312 


2123 


2779 


787 8365 


402 


1313 








403 


1314 


2124 


2780 


784 9956 


404 


1315 


2125 


2781 


787 7927 


405 


1316 


2126 


2782 


787_10138 


406 


1317 


2127 


2783 


787__5878 


407 


1318 


2128 


2784 


784 6962 


408 


1319 


2129 


2785 


784 1341 
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Table 9 
1143 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: 01 

4\>1I Inn rrflt 

iuu-iengtn 

peptide 

sequence 


coniig 

iiiiflpni'iHp 

IlUtlCUllUC 

acuueute 


ct?o TD NO* of 
pnntip n en tide 
senuence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


4uy 




2130 


2786 


787 894 


/tin 

41U 


1 ^91 








411 


1 ^99 


2131 


2787 


784 5866 


A 1 9 
41Z 


1 ^9^ 


2132 


2788 


784 5734 


/J 1 1 


1^94 








A1 A 
414 


1 ^9*5 








A1 < 
41 J 


1 ^96 


2133 


2789 


788 11985 


A1 £ 
410 


1^97 








A1 7 
41 / 


1 ^98 


2134 


2790 


791 1605 


'flo 


1329 








41 Q 
41^ 




2135 


2791 


784 3232 


49 H 
4ZU 


I J J 1 


2136 


2792 


784 1670 


49 1 
4/1 


1119 








A99 
4ZZ 


M.JJJ 


2137 


2793 


784 2377 


A9"* 
4Z3 


1 114 ' 


2138 


2794 


788 2631 


A9A 
4Z4 


1 lis 


2139 


2795 


784 8666 


49 *\ 
4Z3 


1336 


2140 


2796 


785 1109 


49 
tzo 


1337 


2141 


2797 


785 1109 


497 

4 Z / 


1338 


2142 


2798 


784 6176 


49 S 
4Zo 


111Q 


2143 


2799 


785 1465 


A9Q 
4Zi? 


1 140 


2144 


2800 


785 1465 


4^n 


1141 


2145 


2801 


784 5369 


A'X 1 
4 J 1 


1 149 


2146 


2802 


787 8763 


/I ^19 
4.3Z 


1141 


2147 


2803 


784 5253 


/111 
4oJ 


1144 


2148 


2804 


789 2568 


4j4 


1 14S 


2149 


2805 


790 3055 


4 J J 


1 146 


2150 


2806 


790 10356 


4J>0 


1147 


2151 


2807 


785 2811 


A^7 
4 O / 


114R 


2152 


2808 


784 2496 


435 


1 14Q 


2153 


2809 


784 2530 


HO Z7 


1350 


2154 


2810 


784 2168 


440 


1351 


2155 


2811 


784 7505 


441 


1352 


2156 


2812 


784 6670 


AA9 
44Z 


1353 


2157 


2813 


787 2168 


AA^ 
44o 


1354 


2158 


2814 


784 3022 


AAA 

444 


1355 


2159 


2815 


790 1217 


AA^ 
44D 


1356 


2160 


2816 


784 5473 


AA^ 
44D 


1357 


2161 


2817 


784 7905 


A AH 
44 / 


1158 








AAC 
44 o 


1359 


2162 


2818 


784 9436 


AAQ 
44y 


1360 


2163 


2819 


787 2898 


A^n 


1361 








A^1 
4 J 1 


1362 








452 


1363 


2164 


2820 


784 4625 


453 


1364 


2165 


2821 


785 2622 


454 


1365 








455 


1366 


2166 


2822 


787 2386 


456 


1367 


2167 


2823 


788 9391 


457 


1368 


2168 


2824 


784 10104 


458 


1369 


2169 


2825 


784 10104 


459 


1370 


2170 


2826 


787 10366 
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Table 9 
1144 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No._SEQ ID NO.) * 


460 


1371 


2171 


2827 


784 6453 


461 


1372 


2172 


2828 


785 1337 


462 


1373 


2173 


2829 


787_5778 


463 


1374 


2174 


2830 


TO A £?f\f\t? 

784 5025 


464 


1375 








465 


1376 


2175 


2831 


785 739 


466 


1377 


2176 


2832 


787 1269 


467 


1378 








468 


1379 








469 


1380 








470 


1381 


2177 


2833 


785 326 


471 


1382 


2178 


2834 


784 8341 


472 


1383 


2179 


2835 


787 757 


473 


1384 








474 


1385 








475 


1386 


2180 


2836 


787 8454 


476 


1387 


2181 


2837 


790 8424 


477 


1388 








478 


1389 


2182 


2838 


787 9580 


479 


1390 


2183 


2839 


784 6906 


480 


1391 


2184 


2840 


787 I4ll 


481 


1392 


2185 


2841 


790 10999 


482 


1393 








483 


1394 








484 


1395 








485 


1396 








486 


1397 


2186 


2842 


787 2916 


487 


1398 


2187 


2843 


787 5986 


488 


1399 


2188 


2844 


nor irtnn 

785 2979 


489 


1400 


2189 


2845 


785 21 05 


490 


1401 








491 


1402 








492 


1403 


2190 


2846 


785 3046 


493 


1404 


2191 


2847 


788 9546 


494 


1405 


2192 


2848 


785 2158 


495 


1406 








496 1 


1407 


2193 


2849 


784 7549 


497 


1408 


2194 


2850 


784 1066 


498 


1409 


2195 


2851 


TO /f ^ ✓"TO 

784 2673 


499 


1410 


2196 


2852 


785 563 


500 


1411 








501 


1412 


2197 


2853 


789 630 


502 


1413 


2198 


2854 


785 259 


503 


1414 


2199 




/o4 DOj/ 


504 


1415 


2200 


2856 


790 7396 


505 


1416 








506 


1417 


2201 


2857 


790 11548 


507 


1418 








508 


1419 


2202 


2858 


784 6740 


509 


1420 








510 


1421 


2203 


2859 


785 1007 
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Table 9 
1145 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


511 


1422 








512 


1423 








513 


1424 


2204 


2860 


784 3641 


514 


1425 








515 


1426 


2205 


2861 


789 6066 


516 


1427 


2206 


2862 


788_13582 


517 


1428 


2207 


2863 


784 9762 


518 


1429 


2208 


2864 


787 2052 


519 


1430 


2209 


2865 


790 3055 


520 


1431 








521 


1432 


2210 


2866 


790 23800 


522 


1433 


2211 


2867 


790 23800 


523 


1434 








524 


1435 


2212 


2868 


787 2548 


525 


1436 








526 


1437 


2213 


2869 


784 2565 


527 


1438 


2214 


2870 


784_544 


528 


1439 


2215 


2871 


784 544 


529 


1440 








530 


1441 


2216 


2872 


789 1844 


531 


1442 


2217 


2873 


787 2733 


532 


1443 


2218 


2874 


785 493 


533 


1444 


2219 


2875 


790 J 8203 


534 


1445 








535 


1446 


2220 


2876 


784 3336 


536 


1447 


2221 


2877 


790 777 


537 


1448 


2222 


2878 


787 4220 


538 


1449 


2223 


2879 


790 26438 


539 


1450 


2224 


2880 


790 13586 


540 


1451 


2225 


2881 


784 270 


541 


1452 








542 


1453 


2226 


2882 


790 14176 


543 


1454 


2227 


2883 | 


784_3543 


544 


1455 


2228 


2884 


784 3575 


545 


1456 


2229 


2885 


787_5638 


546 


1457 | 


2230 


2886 


784 3826 


547 


1458 


2231 


2887 


787 6059 


548 


1459 


2232 


2888 


787 2585 


549 


1460 








550 


1461 








551 


1462 


2233 


2889 


787 2965 


552 


1463 








553 


1464 


2234 


2890 


790_13050 


554 


1465 








555 


1466 


2235 


2891 


784 3374 


556 


1467 


2236 


2892 


787 8240 


557 


1468 


2237 


2893 


784 4398 


558 


1469 








559 


1470 


2238 


2894 


788_10891 


560 


1471 


2239 


2895 


787 9805 


561 


1472 
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Table 9 
1146 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


562 


1473 








563 


1474 


2240 


2896 


787 7094 


564 


1475 


2241 


2897 


787 5209 


565 


1476 


2242 


2898 


784 582 


566 


1477 


2243 


2899 


790 10111 


567 


1478 


2244 


2900 


790 13563 


568 


1479 








569 


1480 


2245 


2901 


787 8234 


570 


1481 








571 


1482 


2246 


2902 


790 16394 


572 


1483 








573 


1484 


2247 


2903 


790 18452 


574 


1485 


2248 


2904 


790 211 


575 


1486 


2249 


2905 


790 14932 


576 


1487 








577 


1488 


2250 


2906 


787 7498 


578 


1489 


2251 


2907 


787 610 


579 


1490 


2252 


2908 


790_l9096 


580 


1491 


2253 


2909 


784 7363 


581 


1492 


2254 


2910 


792 8355 


582 


1493 


2255 


2911 


787 ll 79 


583 


1494 


2256 


2912 


790 6822 


584 


1495 


2257 


2913 


784 8670 


585 


1496 


2258 


2914 


784 2469 


586 


1497 








587 


1498 


2259 


2915 


790 15 720 


588 


1499 


2260 


2916 


790 11300 


589 


1500 


2261 


2917 


784 190 


590 


1501 








591 


1502 


2262 


2918 


789 6245 


592 


1503 


2263 


2919 


787 7385 


593 


1504 


2264 


2920 


790 281 


594 


1505 


2265 


2921 


784 6192 


595 


1506 


2266 


2922 


790_770 


596 


1507 


2267 


2923 


784 3364 


597 


1508 


2268 


2924 


784 5424 


598 


1509 








599 


1510 


2269 


2925 


790 10H8 


600 


1511 








601 


1512 








602 


1513 


2270 


2926 


785 2555 


603 


1514 








604 


1515 








605 


1516 


2271 


2927 


790 20185 


606 


1517 


2272 


2928 


784 333 


607 


1518 


2273 


2929 


792 3164 


608 


1519 








609 


1520 








610 


1521 


2274 


2930 


784 3455 


611 


1522 


2275 


2931 


784 506 


612 


1523 


2276 


2932 


788 7179 
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Table 9 
1147 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No. SEQ ID NO.) * 


613 


1524 








614 


1525 








615 


1526 


2277 


2933 


790 13445 


616 


1527 








617 


1528 


2278 


2934 


784 6181 


618 


1529 


2279 


2935 


784 4453 


619 


1530 








620 


1531 


2280 


2936 


784 8140 


621 


1532 








622 


1533 


2281 


2937 


784 3441 


623 


1534 


2282 


2938 


787 6194 


624 


1535 


2283 


2939 


784 1407 


625 


1536 








626 


1537 


2284 


2940 


790 8886 


627 


1538 








628 


1539 


2285 


2941 


785 2555 


629 


1540 


2286 


2942 


784 688 


630 


1541 


2287 


2943 


784 8302 


631 


1542 


2288 


2944 


787 2581 


632 


1543 


2289 


2945 


790 19616 


633 


1544 


2290 


2946 


784 4663 


634 


1545 


2291 


2947 


790 9138 


635 


1546 








636 


1547 


2292 


2948 


784_4743 


637 


1548 


2293 


2949 


790 12541 


638 


1549 








639 


1550 








640 


1551 


2294 


2950 


784 4128 


641 


1552 








642 


1553 


2295 


2951 


790 19867 


643 


1554 


2296 


2952 


784 7061 


644 


1555 


2297 


2953 


785 1399 


645 


1556 


2298 


2954 


784 5565 


646 


1557 


2299 


2955 


790 10780 


647 


1558 








648 


1559 








649 


1560 


2300 


2956 


784 2148 i 


650 


1561 


2301 


2957 


790 24174 


651 


1562 


2302 


2958 


784 4307 


652 


1563 


2303 


2959 


790 20063 


653 


1564 








654 


1565 


2304 


2960 


784 7654 


655 


1566 


2305 


2961 


784_7454 


656 


1567 




2962 


/y(J o324 


657 


1568 


2307 


2963 


784^5860 


658 


1569 








659 


1570 


2308 


2964 


788 11718 


660 


1571 


2309 


2965 


784 2105 


661 


1572 








662 


1573 


2310 


2966 


785 1924 


663 


1574 


2311 


2967 


784„8098 
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Table 9 
1148 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


664 


1575 


2312 


2968 


784 2320 


665 


1576 


2313 


2969 


784 7695 


666 


1577 


2314 


2970 


784 6847 


667 


1578 








668 


1579 








669 


1580 


2315 


2971 


784 8098 


670 


1581 


2316 


2972 


784 4781 


671 


1582 








672 


1583 


2317 


2973 


792 8335 


673 


1584 








674 


1585 


2318 


2974 


789 2488 


675 


1586 


2319 


2975 


785 2433 


t~ >-i s: 

676 


1587 








677 


1588 


2320 


2976 


787 3115 


678 


1589 








679 


1590 


2321 


2977 


787 5209 


680 


1591 


2322 


2978 


787 4223 


681 


1592 


2323 


2979 


784 5228 


682 


1593 


2324 


2980 


787 5094 


683 


1594 


2325 


2981 


790 10116 


684 


1595 








685 


1596 


2326 


2982 


785 1351 


686 


1597 








687 


1598 








688 


1599 


2327 


2983 


784 4288 


689 


1600 


2328 


2984 


784 4816 


690 


1601 


2329 


2985 


791 3535 


691 


1602 


2330 


2986 


784 4322 


692 


1603 


2331 


2987 


785 2272 


693 


1604 


2332 


2988 


790 12959 


694 


1605 


2333 


2989 


784 4642 


695 


1606 


2334 


2990 


784 3534 


696 


1607 








697 1 


1608 








698 


1609 








699 


1610 


2335 


2991 


784_2904 


700 


1611 


2336 


2992 


784_8547 


701 


1612 








702 


1)613 








703 


1614 


2337 


2993 


785_2907 


704 


1615 








705 


1616 


2338 


2994 


789_3856 


706 


1617 


2339 


2995 


785_2043 


icn 
i\j I 


lO lo 


OO Af\ 

ZJ4U 




784 4327 


708 


1619 


2341 


2997 


787 10225 


709 


1620 








710 


1621 


2342 


2998 


790 24136 


711 


1622 


2343 


2999 


785 172 


712 


1623 


2344 


3000 


790 24958 


713 


1624 


2345 


3001 


790 17848 


714 


1625 


2346 


3002 


790 10920 
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Table 9 
1149 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


715 


1626 


2347 


3003 


790 24477 


hi £. 

716 


1627 


2348 


3004 


790 21024 


111 


1628 


2349 


3005 


790_3053 


718 


1629 








7iy 


1630 


2350 


3006 


790 13392 


/2U 


1631 


2351 


3007 


790 28822 


721 


1632 


2352 


3008 


790 17805 


111 


1633 


2353 


3009 


784 1991 


723 


1634 


2354 


3010 


790 12778 


'70 /J 

/24 


1 /TO C 

1635 


2355 


3011 


784 5899 




1636 








hhc 


1 ^17 

1637 


2356 


3012 


784 4851 


727 


1638 


2357 


3013 


784 6070 


/2o 


1 /Tort 

1639 


2358 


3014 


784 9615 


729 


1640 


2359 


3015 


790 23719 




1 /C A 1 

1641 








HI 1 

/31 


1 ZT A 'I 

1642 


2360 


3016 


784 7918 


/32 


1 ZTyl O 

1643 


2361 


3017 


790 16176 


TIT 


1644 








/34 


1645 


2362 


3018 


784 4042 


735 


1646 








HI /C 

736 


1647 


2363 


3019 


790 4484 


737 


1648 








738 


1649 


2364 


3020 


790 16690 


739 


1650 


2365 


3021 


784 7703 


740 


1651 


2366 


3022 


785 2626 


741 


1652 








742 


1653 


2367 


3023 


784 1423 


743 


1654 








HA A 
/44 


1 /re c 

1655 








/45 


1656 


2368 


3024 


784 8178 


/46 


1 HCH 

1657 


2369 


3025 


784 6726 


/4/ 


1 rco 

1658 


2370 


3026 


792 7162 


HA O 

74o 


1659 


2371 


3027 


784 2948 


/49 


1660 








/5U 


1 ZT^T 1 

1661 


2372 


3028 


784 2988 


/D 1 


1 £LCH 

1662 








/52 


1663 


2373 


3029 


784 10047 


/D3 


1 CCA 

1664 








7f y| 

/34 


i ccz 
loo J 


2374 


3030 


787 3415 


7ff 


1 £CC 

1666 








/JO 


1 C_CH 

166/ 


2375 


3031 


787_10007 


7f7 

757 


1 ceo 
166c 








758 


1669 








759 


1670 


2376 


3032 


790 12779 


760 


1671 








761 


1672 


2377 


3033 


785 3699 


762 


1673 


2378 


3034 


784_4026 


763 


1674 


2379 


3035 


790_ 14323 


764 


1675 








765 


1676 
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Table 9 
1150 



SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


SEQ ID NO: of 


Identification of Priority 


full-length 


full-length 


contig 


contig peptide 


Application that contig 


nucleotide 


peptide 


nucleotide 


sequence 


nucleotide sequence was 


sequence 


sequence 


sequence 




filed (Attorney Docket 










No. SEQ ID NO.) * 


766 


1677 


2380 


3036 


792 2616 


767 


1678 


2381 


3037 


787 6570 


768 


1679 


2382 


3038 


784 2590 


769 


1680 


2383 


3039 


784 5216 


770 


1681 








771 


1682 


2384 


3040 


790 12514 


772 


1683 








773 


1684 








774 


1685 


2385 


3041 


785 3366 


775 


1686 


2386 


3042 


790 12446 


776 


1687 








111 


1688 








IIS 


1689 








119 


1690 








780 


1691 


2387 


3043 


784 6708 


781 


1692 


2388 


3044 


784 852 


782 


1693 








783 


1694 








784 


1695 








785 


1696 


2389 


3045 


784 9460 


786 


1697 


2390 


3046 


787 7895 


787 


1698 


2391 


3047 


784_6226 


788 


1699 


2392 


3048 


789 5263 


789 


1700 


2393 


3049 


784 4933 


790 i 


1701 








791 


1702 








792 


1703 








793 


1704 


2394 


3050 


784 7176 


794 


1705 








795 


1706 


2395 


3051 


784_3074 


796 


1707 








797 


1708 


2396 


3052 


785 1642 


798 


1709 








799 


1710 


2397 


3053 


790 12671 


800 


1711 








801 


1712 








802 


1713 


2398 


3054 


784 1127 


803 


1714 








804 


1715 








805 


1716 


2399 


3055 


790 14423 


806 


1717 


2400 


3056 


790 28166 


807 


1718 








808 


1719 








QAQ 

ouy 


1720 




3057 


784 5946 


810 


1721 


2402 


3058 


784 4850 


811 


1722 


2403 


3059 


784 744 


812 


1723 


2404 


3060 


790 13151 


813 


1724 








814 


1725 


2405 


3061 


787 7172 


815 


1726 


2406 


3062 


790 6384 


816 


1727 
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Table 9 
1151 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleoli uc sequence vvaa 


817 


1728 








818 


1729 


o a ao 
24U/ 


jUOj 


785 3645 


819 


1730 


O A AO 

240o 


3U04 


784 9630 


820 


1731 








821 


1732 


o/i nn 

24uy 


7fl6^ 


784 10027 

/ 0*+ 1 \J\JZd / 


822 


1733 


O/l 1 A 

24 1U 


1066 


784 10083 


823 


1734 


241 1 


jUO / 


787 7031 


824 


1735 


O/l 1 o 

2412 




784 4592 


825 


1736 








826 


1737 


241 3 




790 3308 


827 


1738 








828 


1739 


O/l 1 A 

2414 


^070 
3U/U 


784 4360 

/ O'T tJ 


829 


1740 


O A 1 C 

24 ID 


3U / 1 


785 2472 


830 


1741 


O A 1 /C 

241o 


Q AOO 
3U /Z 


784 7060 


831 


1742 


2417 


QA7-2 

3U /3 


784 3540 

/0*+ «->~>'t-7 


832 


1743 








833 


1744 


O A 1 O 

241 o 


11Y74 
3U/4 


784 1 872 


834 


1745 


O/l 1 A 

2419 


3U /0 


784 7995 

/ O'T / _/^w-' 


835 


1746 


O/l OA 

2420 


3U /O 


784 5564 

/ 0*T J w> VJ^ 


836 


1747 








837 


1748 








838 


1749 


O A O 1 

2421 


O AOO 


785 147? 
/ OJ it / ^ 


839 


1750 


o /in 

2422 


■5A70 

3U / o 


784 9937 


840 


1751 


O /I oo 

2423 


Q AOQ 

5U /y 


7Q0 9413? 


841 


1752 


o /i o /i 
2424 




790 6689 


842 


1753 


2425 


3Uo I 


787 3699 


843 


1754 


O/l o^: 
242o 


J)UoZ 


785 9955 


844 


1755 


O /lOO 

242/ 


oUoo 


7R4 4394 


845 


1756 


O/l OQ 

242 o 




784 9382 


846 


1757 


o a on 
242y 




787 7087 

/Of 1 V/O 1 


847 


1758 








848 


1759 


O/l Q A 

243 U 


■30,86 


784 4912 


849 


1760 








850 


1761 


O/l 1 

243 1 


^087 
jUo / 


790 17236 ' 


851 


1762 


O/IIO 

2432 




790 1914 


852 


1763 


O/l OQ 

2433 


5Uoy 


784 466 


853 


1764 


O/l O A 

2434 


O AAA 

3uyu 


790 1 1 1 64 


854 


1765 








855 


1766 


2435 


1AG1 

3uy I 


7R7 8100 

/O / 0 1UVJ > 


856 


1767 


2436 


AOO 

3Uy2 


7RA 0069 
/ oh- 


857 


1768 








858 


1769 


O A 1*7 

243 / 


3Uy3 


784 4141 

/OH- Tin 1 


859 


1770 








ODU 


1 771 
1 / / 1 


2438 


3094 


784 1365 


861 


1772 


2439 


3095 


784 6642 


862 


1773 








863 


1774 


2440 


3096 


789 3442 


864 


1775 


2441 


3097 


784 10198 


865 


1776 


2442 


3098 


790 11264 


866 


1777 








867 


1778 


2443 


3099 


1 788 13640 
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Table 9 
1152 



SEQ ID NO: of 
full-length 
nucleotide 
sequence 


SEQ ID NO: of 
full-length 
peptide 
sequence 


SEQ ID NO: of 
contig 
nucleotide 
sequence 


SEQ ID NO: of 
contig peptide 
sequence 


Identification of Priority 
Application that contig 
nucleotide sequence was 
filed (Attorney Docket 
No._ SEQ ID NO.) * 


868 


1779 








869 


1780 


2444 


3100 


785_238 


870 


1781 


2445 


3101 


787 10243 


871 


1782 


2446 


3102 


784 8555 


872 


1783 


2447 


3103 


789 5245 


873 


1784 


2448 


3104 


784 4968 


874 


1785 


2449 


3105 


789 5631 


875 


1786 


2450 


3106 


784_6373 


876 


1787 


2451 


3107 


784 7992 


877 


1788 


2452 


3108 


787_6002 


878 


1789 


2453 


3109 


784 1503 


879 


1790 


2454 


3110 


790 10504 


880 


1791 


2455 


3111 


790 18438 


881 


1792 


2456 


3112 


790_13790 


882 


1793 


2457 


3113 


790 21890 


883 


1794 


2458 


3114 


790 17754 


884 


1795 








885 


1796 


2459 


3115 


785 1016 


886 


1797 


2460 


3116 


785_14 


887 


1798 


2461 


3117 


790 18472 


888 


1799 


2462 


3118 


784 8233 


889 


1800 


2463 


3119 


784 177 


890 


1801 


2464 


3120 


784 3522 


891 


1802 


2465 


3121 


784_10067 


892 


1803 


2466 


3122 


784 7868 


893 


1804 








894 


1805 


2467 


3123 


787 7085 


895 


1806 


2468 


3124 


784_3739 


896 


1807 


2469 


3125 


784_4168 


897 


1808 


2470 


3126 


784 6642 


898 


1809 








899 


1810 


2471 


3127 


789 6290 


900 


1811 


2472 


3128 


791 2423 


901 


1812 








902 


1813 








903 


1814 








904 


1815 








905 


1816 


2473 


3129 


790 10106 


906 


1817 


2474 


3130 


787 3075 


907 


1818 


2475 


3131 


790 17762 


908 


1819 


2476 


3132 


784 482 


909 


1820 


2477 


3133 


790 21059 


910 


1821 


2478 


3134 


788 13847 


911 


1822 


i 





*784_XXX - SEQ ID NO: XXX of Attorney Docket No. 784, US Serial No. 09/488,725 
filed 01/21/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 
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Table 9 
1153 

785JXXX = SEQ ID NO: XXX of Attorney Docket No. 785, US Serial No. 09/491,404 
filed 01/25/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

787_XXX = SEQ ID NO: XXX of Attorney Docket No. 787, US Serial No. 09/496,914 
filed 02/03/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

788_XXX = SEQ ID NO: XXX of Attorney Docket No. 788, US Serial No. 09/515,126 
filed 02/28/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

789_XXX = SEQ ID NO: XXX of Attorney Docket No. 789, US Serial No. 09/519,705 
filed 03/07/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

790_XXX = SEQ ID NO: XXX of Attorney Docket No. 790, US Serial No. 09/540,217 
filed 03/31/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

791_XXX = SEQ ID NO: XXX of Attorney Docket No. 791, US Serial No. 09/552,929 
filed 04/18/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 

792_XXX = SEQ ID NO: XXX of Attorney Docket No. 792, US Serial No. 09/577,408 
filed 05/18/2000, the entire disclosure of which, including sequence listing, is 
incorporated herein by reference. 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,739 


1 


912 


1 


2 


913 


2 


3 


914 


3 


4 


915 


4 


5 


916 


5 


6 


917 


6 


7 


918 


7 


8 


919 


8 


9 


920 


9 


10 


921 


10 


11 


922 


11 


12 


923 


12 


13 


924 


13 


14 


925 


14 


15 


926 


15 


16 


927 


16 


17 


928 


17 


18 


929 


18 


19 


930 


19 


20 


931 


20 


21 


932 


21 


22 


933 


22 


23 


934 


23 


24 


935 


24 


25 


936 


25 


26 


937 


26 


27 ; 


938 


27 


28 


939 


28 


29 


940 


29 


30 


941 


30 


31 


942 


31 


32 


943 


32 


33 


944 


33 


34 


945 


34 


35 


946 


35 


36 


947 


36 


37 


948 


37 


38 


949 


38 


39 


950 


39 


40 


951 


40 


41 


952 


41 


42 


953 


42 


43 


954 


43 


44 


955 


44 


45 


956 


45 


46 


957 


46 


47 


958 


47 


48 


959 


48 


49 


960 


49 


50 


961 


50 


51 


962 


51 


52 


963 


52 
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Table 10 
1155 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 
Nucleotide 
Sequence 


Full-length 
Peptide 
Sequence 


Priority 
Application USSN 
60/339,739 


53 


964 


53 


54 


965 


54 


55 


966 


55 


56 


967 


56 


57 


968 


57 


58 


969 


58 


59 


970 


59 


60 


971 


60 


61 


972 


61 


62 


973 


62 


63 


974 


63 


64 


975 


64 


65 


976 


65 


66 


977 


66 


67 


978 


67 


68 


979 


68 


69 


980 


69 | 


70 


981 


70 


71 


982 


71 


72 


983 


72 


73 


984 


73 


74 


985 


74 


75 


986 


75 


76 


987 


76 


77 


988 


77 


78 


989 


78 


79 


990 


79 


80 


991 


80 


81 


992 


81 


82 


993 


82 


83 


994 


83 


84 


995 


84 


85 


996 


85 


86 


997 


86 


87 


998 


87 


88 


999 


88 


89 


1000 


89 


90 


1001 


90 


91 


1002 


91 


92 


1003 


92 


93 


1004 


93 


94 


1005 


94 


95 


1006 


95 


96 


1007 


96 


97 


1008 


97 


98 


1009 


98 


99 


1010 


99 


100 


1011 


100 


101 


1012 


101 


102 


1013 


102 


103 


1014 


103 


104 


1015 


104 
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Table 10 
1156 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,739 


105 


1016 


105 


106 


1017 


106 


107 


1018 


107 


108 


1019 


108 


109 


1020 


109 


110 


1021 


110 


111 


1022 


111 


112 


1023 


112 


113 


1024 


113 


114 


1025 


114 


115 


1026 


115 


116 


1027 


116 


117 


1028 


117 


118 


1029 


118 


119 


1030 


119 


120 


1031 


120 


121 


1032 


121 


122 


1033 


122 


123 


1034 


123 


124 


1035 


124 


125 


1036 


125 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,453 


126 


1037 


1 


127 


1038 


2 


128 


1039 


3 


129 


1040 


4 


130 


1041 


5 


131 


1042 


7 


132 


1043 


8 


133 


1044 


9 


134 


1045 


10 | 


135 


1046 


11 


136 


1047 


12 


137 


1048 


13 


138 


1049 


14 


139 


1050 


15 


140 


1051 


16 


141 


1052 


17 


142 


1053 


18 


143 


1054 


19 


144 


1055 


20 


145 


1056 


21 


146 


1057 


22 


147 


1058 


23 


148 


1059 


24 


149 


1060 


25 


150 


1061 


26 


151 


1062 


27 
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Table 10 
1157 



SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/339,453 


152 


1063 


28 


153 


1064 


29 


154 


1065 


30 


155 


1066 


31 


156 


1067 


32 


157 


1068 


33 


158 


1069 


34 


159 


1070 


35 


160 


1071 


36 


161 


1072 


37 


162 


1073 


38 


163 


1074 


39 


164 


1075 


40 


165 


1076 


41 


166 


1077 


42 


167 


1078 


43 


168 


1079 


44 


169 


1080 


45 


170 


1081 


46 


171 


1082 


47 


172 


1083 


48 


173 


1084 


49 


174 


1085 


50 


175 


1086 


51 


176 


1087 


52 | 


177 


1088 


53 


178 


1089 


54 


179 


1090 


55 


180 


1091 


56 


181 


1092 


57 


182 


1093 


58 


183 ! 


1094 


59 


184 


1095 


60 


185 


1096 


61 


186 


1097 


62 


187 


1098 


63 


188 


1099 


64 


189 


1100 


65 


190 


1101 


66 


191 


1102 


67 


192 


1103 


68 


193 


1104 


69 


194 


1105 


70 


195 


1106 


71 


196 


1107 


72 


197 


1108 


73 


198 


1109 


74 


199 


1110 


75 


200 


1111 


76 


201 


1112 


77 


202 


1113 


78 


203 


1114 


79 
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1158 



SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/339,453 


204 


1115 


80 


205 


1116 


81 


206 


1117 


82 


207 


1118 


83 


208 


1119 


84 


209 


1120 


85 


210 


1121 


86 


211 


1122 


87 


212 


1123 


88 


213 


1124 


89 


214 


1125 


90 


215 


1126 


91 


216 


1127 


92 


217 


1128 


93 


218 


1129 


94 


219 


1130 


95 


220 


1131 


96 


221 ) 


1132 


97 


222 


1133 


98 


223 


1134 


99 


224 


1135 


100 


225 


1136 


101 


226 


1137 


102 


227 


1138 


103 


228 


1139 


104 


229 


1140 


105 


230 


1141 


106 


231 


1142 


107 


232 


1143 


108 


233 


1144 


109 


234 


1145 


110 


235 


1146 


111 


236 


1147 


112 


237 | 


1148 


113 


238 


1149 


114 


239 


1150 


115 


240 


1151 


116 


241 


1152 


117 


242 


1153 


118 


243 


1154 


119 


244 


1155 


120 


245 


1156 


121 


246 


1157 


122 


247 


1158 


123 


248 


1159 


124 




SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,384 


249 


1160 


3 


250 


1161 


4 
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Table 10 
1159 



SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,384 


251 


1162 


5 


252 


1163 


6 


253 


1164 


8 


254 


1165 


12 


255 


1166 


14 


256 


1167 


15 


257 


1168 


17 


258 


1169 


18 


259 


1170 


19 


260 


1171 


20 


261 


1172 


21 


262 


1173 


22 


263 


1174 


23 


264 


1175 


24 


265 


1176 


25 


266 


1177 


26 


267 


1178 


27 


268 


1179 


28 


269 


1180 


29 


270 


1181 


30 


271 


1182 


31 


272 


1183 


32 


273 


1184 


33 


274 


1185 


34 


275 


1186 


35 


276 


1187 


36 


277 


1188 


37 


278 


1189 


38 


279 


1190 


39 


280 


1191 


40 


281 


1192 


41 


282 


1193 


42 


283 


1194 


43 


284 


1195 


44 


285 


1196 


45 


286 


1197 


46 


287 


1198 


47 


288 


1199 


48 


289 


1200 


49 


290 


1201 


50 


291 


1202 


51 


292 


1203 


52 


293 


1204 


53 


294 


1205 


54 


295 


1206 


55 


296 


1207 


56 


297 


1208 


57 


298 


1209 


58 


299 


1210 


59 


300 


1211 


60 


301 


1212 


61 


302 


1213 


62 
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Table 10 
1160 



SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,384 


303 


1214 


63 


304 


1215 


64 


305 


1216 


65 


306 


1217 


66 


307 


1218 


67 


308 


1219 


68 


309 


1220 


69 


310 


1221 


70 


311 


1222 ' 


71 


312 


1223 


72 


313 


1224 


73 


314 


1225 


74 


315 


1226 


75 


316 


1227 


76 


317 


1228 


77 


318 


1229 


78 


319 


1230 


79 


320 


1231 


80 


321 


1232 


81 


322 


1233 


82 


323 


1234 


83 


324 


1235 


84 


325 


1236 


85 


326 


1237 


86 


327 


1238 


87 


328 


1239 


88 


329 


1240 


89 


330 


1241 


90 


331 


1242 


91 


332 


1243 


92 


333 


1244 


93 


334 


1245 


94 


335 


1246 


95 


336 


1247 


96 


337 


1248 


97 


338 


1249 


98 


339 


1250 


99 


340 


1251 


100 


341 


1252 


101 


342 


1253 


102 


343 


1254 


103 


344 


1255 


104 


345 


1256 


105 


346 


1257 


106 


347 


1258 


107 


348 


1259 


108 


349 


1260 


109 


350 


1261 


110 


351 


1262 


111 


352 


1263 


112 


353 


1264 


113 


354 


1265 


114 
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Table 10 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,384 


355 


1266 


115 


356 


1267 


116 


357 


1268 


117 


358 


1269 


118 


359 


1270 


119 


360 


1271 


120 


361 


1272 


121 


362 


1273 


122 


363 


1274 


123 


364 


1275 


124 


365 


1276 


125 


366 


1277 


126 


367 


1278 


127 


368 


1279 


128 


369 


1280 


129 


370 


1281 


130 


371 


1282 ! 


131 


372 


1283 


132 


373 


1284 


133 


374 


1285 


134 


375 


1286 


135 


376 


1287 


136 


377 


1288 


137 


378 


1289 


138 


379 


1290 


139 


380 


1291 


140 


381 


1292 


141 


382 


1293 


142 


383 


1294 


143 


384 


1295 


144 


385 


1296 


145 


386 


1297 


146 


387 


1298 


147 


388 


1299 


148 


389 


1300 


149 


390 


1301 


150 


391 


1302 


151 


392 


1303 


152 


393 


1304 


153 


394 


1305 


154 


395 


1306 


155 


396 


1307 


156 


397 


1308 


157 


398 


1309 


158 


399 


1310 


159 


400 


1311 


160 


401 


1312 


161 


402 


1313 


162 


403 


1314 


163 


404 


1315 


164 


405 


1316 


165 


406 


1317 


166 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,384 


407 


1318 


167 


408 


1319 


169 


409 


1320 


170 


410 


1321 


171 


411 


1322 


172 


412 


1323 


173 


413 


1324 


174 


414 


1325 


175 


415 


1326 


176 


416 


1327 


177 


417 


1328 


178 


418 


1329 


179 




SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,091 


419 


1330 


2 


420 


1331 


4 


421 


1332 


5 


422 


1333 


6 


423 


1334 


8 


424 


1335 


9 


425 


1336 


10 


426 


1337 


11 


427 


1338 


12 


428 


1339 


14 


429 


1340 


15 


430 


1341 


16 


431 


1342 


17 


432 


1343 


18 


433 


1344 


19 


434 


1345 


20 


435 


1346 


21 


436 


1347 


22 


437 


1348 


24 


438 


1349 


25 


439 


1350 


26 


440 


1351 


27 


441 


1352 


28 


442 


1353 


29 


443 


1354 


32 


444 


1355 


33 


445 


1356 


34 


446 


1357 


35 


447 


1358 


36 


448 


1359 


37 


449 


1360 


38 


450 


1361 


39 


451 


1362 


40 


452 


1363 


41 


453 


1364 


42 
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Table 10 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/365,091 


454 


1365 


43 


455 


1366 


44 


456 


1367 


45 


457 


1368 


46 


458 


1369 


47 


459 


1370 


48 


460 


1371 


49 


461 


1372 


50 


462 


1373 


51 


463 


1374 


52 


464 


1375 


53 


465 


1376 


54 


466 


1377 


55 


467 


1378 


56 


468 


1379 


57 


469 


1380 


58 


470 


1381 


59 


471 


1382 


60 


472 


1383 


61 


473 


1384 


62 


474 


1385 


63 


475 


1386 


64 


476 


1387 


65 


477 


1388 


66 


478 


1389 


67 


479 


1390 


68 


480 


1391 


69 


481 


1392 


70 


482 


1393 


71 


483 


1394 


72 


484 


1395 


73 


485 


1396 


74 


486 


1397 


75 


487 


1398 


76 


488 


1399 


77 


489 


1400 


78 


490 


1401 


79 


491 


1402 


80 


492 


1403 


81 


493 


1404 


82 


494 


1405 


83 


495 


1406 


84 


496 


1407 


85 


497 


1408 


86 


498 


1409 


87 


499 


1410 


88 


500 


1411 


89 


501 


1412 


90 


502 


1413 


91 


503 


1414 


92 


504 


1415 


93 


505 


1416 


94 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/365,091 


506 


1417 


95 


507 


1418 


96 


508 


1419 


97 


509 


1420 


98 


510 


1421 


99 


511 


1422 


100 


512 


1423 


101 


513 


1424 


102 


514 


1425 


103 


515 


1426 


104 


516 


1427 


105 


517 


1428 


106 


518 


1429 


107 


519 


1430 


108 


520 


1431 


109 


521 


1432 


110 


522 


1433 


111 


523 


1434 


112 


524 


1435 


113 


525 


1436 


114 


526 


1437 


115 


527 


1438 


116 


528 


1439 


117 


529 


1440 


118 


530 


1441 


119 


531 


1442 


120 


532 


1443 


121 


533 


1444 


122 


534 


1445 


123 


535 


1446 


124 


536 


1447 


125 


537 


1448 


126 


538 


1449 


127 


539 


1450 


128 


540 


1451 


129 


541 


1452 


130 


542 


1453 


131 


543 


1454 


132 


544 


1455 


133 


545 


1456 


135 


546 


1457 


136 


547 


1458 


137 


548 


1459 


138 


549 


1460 


139 


550 


1461 


140 


551 


1462 


141 


552 


1463 


142 


553 


1464 


143 


554 


1465 


144 


555 


1466 


145 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/372,615 


556 


1467 


2 


557 


1468 


3 


558 


1469 


4 


559 


1470 


5 


560 


1471 


6 


561 


1472 


7 


562 


1473 


8 


563 


1474 


9 


564 


1475 


10 


565 


1476 


11 


566 


1477 


12 


567 


1478 


13 


568 


1479 


14 


569 


1480 


15 


570 


1481 


16 


571 


1482 


17 


572 


1483 


18 


573 


1484 


19 


574 


1485 


20 


575 


1486 


21 


576 


1487 


22 


577 


1488 


23 


578 


1489 


24 


579 


1490 


25 


580 


1491 


26 


581 


1492 


27 


582 


1493 


28 


583 


1494 


29 


584 


1495 


30 


585 


1496 


31 


586 


1497 


32 


587 


1498 


33 


588 


1499 


34 


589 


1500 


35 


590 


1501 


36 


591 


1502 


37 


592 


1503 


38 


593 


1504 


39 


594 


1505 


40 


595 


1506 


41 


596 


1507 


42 


597 


1508 


43 


598 


1509 


44 


599 


1510 


45 


600 


1511 


46 


601 


1512 


47 


602 


1513 


48 


603 


1514 


49 


604 


1515 


50 


605 


1516 


51 


606 


1517 


52 


607 


1518 


53 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


608 


1519 


54 


609 


1520 


55 


610 


1521 


56 


611 


1522 


57 


612 


1523 


58 


613 


1524 


59 


614 


1525 


60 


615 


1526 


61 


616 


1527 


62 


617 


1528 


63 


618 


1529 


64 


619 


1530 


65 


620 


1531 


66 


621 


1532 


67 


622 


1533 


68 


623 


1534 


69 


624 


1535 


70 


625 


1536 


71 


626 


1537 


72 


627 


1538 


73 


628 


1539 


74 


629 


1540 


75 


630 


1541 


76 


631 


1542 


77 


632 


1543 


78 


633 


1544 


79 


634 


1545 


80 


635 


1546 


81 


636 


1547 


82 


637 


1548 


83 


638 


1549 


84 


639 


1550 


85 


640 


1551 


86 


641 


1552 


87 


642 


1553 


88 


643 


1554 


89 


644 


1555 


90 


645 


1556 


91 


646 


1557 


92 


647 


1558 


93 


648 


1559 


94 


649 


1560 


95 


650 


1561 


96 


651 


1562 


97 


652 


1563 


98 


653 


1564 


99 


654 


1565 


100 


655 


1566 


101 


656 


1567 


102 


657 


1568 


103 


658 


1569 


104 


659 


1570 


105 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


660 


1571 


106 


661 


1572 


107 


662 


1573 


108 


663 


1574 


109 


664 


1575 


110 


665 


1576 


111 


666 


1577 


112 


667 


1578 


113 


668 


1579 


114 


669 


1580 


115 


670 


1581 


116 


671 


1582 


117 


672 


1583 


118 


673 


1584 


119 


674 


1585 


120 


675 


1586 


121 


676 


1587 


122 


677 


1588 


123 


678 


1589 


124 


679 


1590 


125 


680 


1591 


126 


681 


1592 


127 


682 


1593 


128 


683 


1594 


129 


684 


1595 


130 


685 


1596 


131 


686 


1597 


132 


687 


1598 


133 


688 


1599 


134 


689 


1600 


135 


690 


1601 


136 


691 


1602 


137 


692 


1603 


138 


693 


1604 


139 


694 


1605 


140 


695 


1606 


141 


696 


1607 


142 


697 


1608 


143 


698 


1609 


144 


699 


1610 


145 


700 


1611 


146 


701 


1612 


147 


702 


1613 


148 


703 


1614 


149 


704 


1615 


150 


705 


1616 


151 


706 


1617 


152 


707 


1618 


153 j 


708 


1619 


154 


709 


1620 


155 


710 


1621 


157 


711 


1622 


158 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,615 


712 


1623 


159 


713 


1624 


160 


714 


1625 


161 


715 


1626 


162 


716 


1627 


163 


717 


1628 


164 


718 


1629 


165 


719 


1630 


166 


720 


1631 


167 


721 


1632 


168 


722 


1633 


169 


, 723 


1634 


170 


724 


1635 


171 


725 


1636 


172 


726 


1637 


173 


727 


1638 


174 


728 


1639 


175 


729 


1640 


176 


730 


1641 


177 


731 


1642 


178 


732 


1643 


179 


733 


1644 


180 


734 


1645 


181 


735 


1646 


182 


736 


1647 


183 


737 


1648 


184 


738 


1649 


185 


739 


1650 


186 


740 


1651 


187 


741 


1652 


188 


742 


1653 


189 


743 


1654 


190 


744 


1655 


191 


745 


1656 


192 


746 


1657 


193 


747 


1658 


194 


748 


1659 


195 


749 


1660 


196 


750 


1661 


197 


751 


1662 


198 


752 


1663 


199 


753 


1664 


200 


754 


1665 


201 


755 


1666 


202 


756 


1667 


203 


757 


1668 


204 | 


758 


1669 


205 


759 


1670 


206 


760 


1671 


207 


761 


1672 


208 


762 


1673 


209 


763 


1674 


210 ; 
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SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/372,615 


764 


1675 


211 


765 


1676 


212 


766 


1677 


213 


767 


1678 


214 


768 


1679 


215 


769 


1680 


216 


770 


1681 


217 


771 


1682 


218 


772 


1683 


219 


773 


1684 


220 


774 


1685 


221 


775 


1686 


222 


776 


1687 


223 


111 


1688 


224 


IIS 


1689 


225 


119 


1690 


226 


780 


1691 


227 


781 


1692 


228 


782 


1693 


229 


783 


1694 


230 


784 


1695 


231 


785 


1696 


233 


786 


1697 


234 


787 


1698 


235 


788 


1699 


236 


789 


1700 


237 


790 


1701 


238 


791 


1702 


239 


792 


1703 


240 


793 


1704 


241 


794 


1705 


242 


795 


1706 


243 


796 


1707 


244 


797 


1708 


245 


798 


1709 


246 


799 


1710 


247 


800 


1711 


248 


801 


1712 


249 


802 


1713 


250 


803 


1714 


251 


804 


1715 


252 


805 


1716 


253 


806 


1717 


254 


807 


1718 


255 


808 


1719 


256 




SEQ ID NO of 
Full-length 
Nucleotide 
Sequence 


SEQ ID NO of 
Full-length 
Peptide 
Sequence 


SEQ ID NO in 
Priority 
Application USSN 
60/372,381 


809 


1720 


1 


810 


1721 


2 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 


Full-length 


Priority 


Nucleotide 


Peptide 


Application USSN 


Sequence 


Sequence 


60/372,381 


811 


1722 


3 


812 


1723 


4 


813 


1724 


5 


814 


1725 


6 


815 


1726 


7 


816 


1727 


8 


817 


1728 


9 


818 


1729 


10 


819 


1730 


11 


820 


1731 


12 


821 


1732 


13 


822 


1733 


14 


823 


1734 


15 


824 


1735 


16 


825 


1736 


17 


826 


1737 


18 


827 


1738 


19 


828 


1739 


20 


829 


1740 


21 


830 


1741 


22 


831 


1742 


23 


832 


1743 


24 


833 


1744 


25 


834 


1745 


26 


835 


1746 


27 


836 


1747 


28 


837 


1748 


29 


838 


1749 


30 


839 


1750 


31 


840 


1751 


32 


841 


1752 


33 


842 


1753 


34 


843 


1754 


35 


844 


1755 


36 


845 


1756 


37 


846 


1757 


38 


847 


1758 


39 


848 


1759 


40 


849 


1760 


41 


850 


1761 


42 


851 


1762 


43 


852 


1763 


44 


853 


1764 


45 


854 


1765 


46 


855 


1766 


47 


856 


1767 


48 


857 


1768 


49 


858 


1769 


50 


859 


1770 


51 


860 


1771 


52 


861 


1772 


53 


862 


1773 


54 
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SEQ ID NO of 


SEQ ID NO of 


SEQ ID NO in 


Full-length 
Nucleotide 
Sequence 


Full-length 
Peptide 
Sequence 


Priority 
Application USSN 
60/372,381 


863 


1774 


55 


864 


1775 


56 


865 


1776 


57 


866 


1777 


58 


867 


1778 


59 


868 


1779 


60 


869 


1780 


61 


870 


1781 


62 


871 


1782 


63 


872 


1783 


64 


873 


1784 


65 


874 


1785 


66 


875 


1786 


67 


876 


1787 


68 


877 


1788 


69 


878 


1789 


70 


879 


1790 


71 


880 


1791 


72 


881 


1792 


73 


882 


1793 


74 


883 


1794 


75 


884 


1795 


76 


885 


1796 


77 


886 


1797 


78 


887 


1798 


79 


888 


1799 


80 


889 


1800 


81 


890 


1801 


82 


891 


1802 


83 


892 


1803 


84 


893 


1804 


85 


894 


1805 


86 


895 


1806 


87 


896 


1807 


88 


897 


1808 


89 


898 


1809 


90 


899 


1810 


91 


900 


1811 


92 


901 


1812 


93 


902 


1813 


94 


903 


1814 


95 


904 


1815 


96 


905 


1816 


97 


906 


1817 


98 


907 


1818 


99 


908 


1819 ' 


100 


909 


1820 


101 


910 


1821 


102 


911 


1822 


103 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1 -9 1 1 . 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein 
said polynucleotide has greater than about 99% sequence identity with the polynucleotide of 
claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting 
of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1; 
and 

(b) a polypeptide encoded by a polynucleotide hybridizing under 
stringent conditions with any one of SEQ ID NO: 1-911. 
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11. A composition comprising the polypeptide of claim 10 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the 
polynucleotide of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribing an annealed RNA molecule into a cDNA 
polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 
complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if the polypeptide/compound complex 
is detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a cell, 
under conditions sufficient to form a polypeptide/compound complex, wherein the complex 
drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, 
so that if the polypeptide/compound complex is detected, a compound that binds to the 
polypeptide of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of any of the polynucleotides from SEQ ID NO: 1-911, under 
conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of any one of the polypeptides SEQ ID NO: 912-1822. 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprising of at least one of 
SEQ ID NO: 1-911. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any one of the 
polynucleotides in the collection. 
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25. The collection of claim 23, wherein the array detects mismatches to any one of the 
polynucleotides in the collection. 

26. The collection of claim 22, wherein the collection is provided in a computer-readable 
format. 



